My goal is to be able to process large amounts of technical data sheets and extract the material properties within them using this functionality: https://cloud.google.com/vertex-ai/generative-ai/docs/samples/generativeaionvertexai-gemini-pdf
So far, its been 90% accurate. Most of the mistakes are attributing values in a table to the adjacent column or row.
Is this par for the course at the moment when it comes to pdf processing? A 10% difference in can be massive for material properties.
Hello tjohnson818,
Welcome to Google Cloud Community!
A 10% difference in material properties can indeed be massive, depending on the application. Without specific details about your PDF processing method, the nature of the material properties, and the expected accuracy, it's difficult to provide a definitive answer. However, I can provide the best practices and limitations where we can address the challenges in PDF processing and material properties.
You may also check document understanding for additional resources and detailed documentation for the best results.
I hope the above information is helpful.
User | Count |
---|---|
2 | |
2 | |
1 | |
1 | |
1 |