Hello,
I am experiencing issues while training and testing a custom model using Document AI. Despite having provided a sufficient amount of labeled training data, the model frequently misclassifies documents during the testing phase.
Questions:
I’d appreciate any insights, suggestions, or resources to address this issue and improve the model's performance.
Thank you!
Hi @tootsieroll,
Welcome to Google Cloud Community!
I understand that you are encountering issues on training and testing a custom model using Document AI. There are several reasons why your model misclassify documents despite having sufficient labeled training data.
Here are possible causes of misclassify documents and how to address them :
Data Quality - Make sure that there are no labeling errors or inconsistencies. Ensure labels are consistent by following clear guidelines and having multiple reviewers check the data.
Data Diversity - Include different types of documents and formats in your training data to match what the model will see in real use.
Model Configuration - Check if certain types or features of documents are causing problems repeatedly.
Regular Retraining - Monitor the models performance with new training data and continue retraining to help the model improve overtime.
For more information about Custom Document Classifier you can read this documentation.
Was this helpful? If so, please accept this answer as “Solution”. If you need additional assistance, reply here within 2 business days and I’ll be happy to help.
Thank you for your response. I truly appreciate it. I have a few additional questions:
Your guidance on these points would be greatly appreciated. Thanks again.
User | Count |
---|---|
2 | |
2 | |
1 | |
1 | |
1 |