Hello,
I’m planning to extract text and checkboxes from a handwritten survey in PDF format. Previously, the training samples were ineffective because they didn’t display the grey label when I selected “Text” during labeling. I believe this was due to the original form not being recognized.
So, I’ve decided to design a new form/survey using MS Word. This time, the checkboxes are larger, but some of them can’t be detected. I understand that blurring may occur after printing, filling, scanning to pdf, and rendering with Document AI. This is unavoidable and beyond my control.
Could you advise on the optimal size or conditions for the checkboxes to achieve better results? Is there anything else I can do?
FYI, I am using Template-Based model.
Solved! Go to Solution.
Hi @KT-K,
You're correct! Achieving consistent sizing across various formats can be challenging. To enhance OCR accuracy and minimize discrepancies, consider these tips for optimizing your print and scan settings:
Print Settings:
Scan Settings:
Additional Tips:
By fine-tuning your print and scan settings, you can greatly improve OCR accuracy and manage challenges related to document size and image quality.
I hope the above information is helpful.
Hi @KT-K,
Welcome to Google Cloud Community!
The absence of a gray label indicates that the checkbox definitions in your template might be incorrect. As a result, the model struggles to align your annotations with the actual checkbox elements in the document.
Additionally, adding more samples that lack the gray label won’t improve the model’s understanding of what a checkbox is. Instead, the model will likely learn to see those areas as empty or undefined. The crucial step is to make sure your existing samples have correctly labeled checkboxes with the gray label. This helps the model learn what a checkbox looks like.
Moreover, increasing the checkbox size is a good move, but it's not a complete fix. Challenges like blurring, ink quality, and handwriting variability still impact detection.
To optimize checkbox design for accurate extraction with Document AI, consider the following:
Additional tips include ensuring your Template-Based model is well-aligned with the form's layout to facilitate accurate extraction. Experiment with different checkbox sizes and designs, and test various training datasets. Analyzing the results will help you identify the optimal combination for your specific form and use case.
I hope the above information is helpful.
Hi @dawnberdan
Thank you for the information. It is quite difficult to set 12-15mm because the size differs between MS Word, the printed copy, and the scanned copy.😂
Anyway, as you said, “Challenges like blurring, ink quality, and handwriting variability still impact detection.”
Do you have any suggestions for other stages, like printout settings, scan settings, etc.?
Hi @KT-K,
You're correct! Achieving consistent sizing across various formats can be challenging. To enhance OCR accuracy and minimize discrepancies, consider these tips for optimizing your print and scan settings:
Print Settings:
Scan Settings:
Additional Tips:
By fine-tuning your print and scan settings, you can greatly improve OCR accuracy and manage challenges related to document size and image quality.
I hope the above information is helpful.