Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Text segment is deduped

I uploaded my model.jsonl file to train my model in order to use Vertex entity extraction

My file looks like this:
{"textGcsUri":"gs://cnbv-sitiaa-prod-ocr-bucket/TrainingModels/TextFiles/2023_116915_3.txt","languageCode":"es-MX","textSegmentAnnotations":[{"displayName":"solicitado_nombre","startOffset":678,"endOffset":759,"annotationResourceLabels":{"aiplatform.googleapis.com/annotation_set_name":"8902360821032550400"}},{"displayName":"solicitado_nombre","startOffset":7913,"endOffset":7994,"annotationResourceLabels":{"aiplatform.googleapis.com/annotation_set_name":"8902360821032550400"}},
.
.
...... (multiple annotationResourceLabels tags]


After uploading my file I got this warning:
Text segment `[678, 759)` with display name `solicitado_nombre` is deduped. for: gs://cnbv-sitiaa-prod-ocr-bucket/TrainingModels/JsonFiles/2023_116915_3.jsonl line 1

But I have checked my file and there is no other segment like that, Am I doing something wrong?


0 0 95
0 REPLIES 0