Issue in Detail:
I'm experiencing difficulties in achieving high accuracy when using Google Document AI for parsing invoices and expenses. I am currently working with custom datasets that include approximately 1,500 images/documents per parser (Invoice Parser and Expense Parser). My goal is to reliably extract key data from these documents, but due to the diverse layouts and formats in my dataset, the F1 scores remain low, ranging between 0.75 and 0.85. This performance issue becomes even more apparent when testing with new images or documents; in these cases, predictions are frequently inaccurate or sometimes miss important fields.
What I've Already Tried:
I would appreciate any insights or recommendations for improving model accuracy, particularly with datasets that contain various layouts. Thank you!
Solved! Go to Solution.
Hi @steven_tan10,
Yes, performing data preprocessing or data cleaning before importing documents into Document AI is a key step in improving accuracy, especially when working with diverse layouts. By normalizing the layout, correcting distortions, and ensuring that the input data is clean, you give Document AI a better chance of making accurate predictions.
Once you have preprocessed your documents, you can then input them into Document AI for further analysis and extraction. This will likely lead to better results in terms of accuracy and consistency.
I hope the above information is helpful.
Hi @steven_tan10,
Welcome to Google Cloud Community!
It's great that you're actively working to improve your Document AI model's accuracy. Here are some strategies to address the challenges you're facing with diverse layouts and inconsistent predictions:
Data Augmentation and Preprocessing:
1. Layout Normalization: Consider using image processing techniques to normalize the layout of your documents. This could involve:
2. Data Augmentation: Generate synthetic variations of your existing documents to increase the diversity of your training data. This can include:
3. OCR Preprocessing: Prior to inputting documents into Document AI, it's beneficial to use a reliable OCR engine to enhance text extraction accuracy. This step can help mitigate problems caused by blurry or low-quality images.
You can also refer to the following documentation for additional insights and guidance:
In addition, I came across an article/blog that covers Data Augmentation in Document AI which could be helpful for you.
I hope the above information is helpful.
Apologize for late response, thank you @dawnberdan for your suggestion. From your explanation, If I understand correctly, we need to perform data preprocessing or data cleaning first before importing the data into Document AI, is it correct?
Hi @steven_tan10,
Yes, performing data preprocessing or data cleaning before importing documents into Document AI is a key step in improving accuracy, especially when working with diverse layouts. By normalizing the layout, correcting distortions, and ensuring that the input data is clean, you give Document AI a better chance of making accurate predictions.
Once you have preprocessed your documents, you can then input them into Document AI for further analysis and extraction. This will likely lead to better results in terms of accuracy and consistency.
I hope the above information is helpful.
User | Count |
---|---|
2 | |
2 | |
1 | |
1 | |
1 |