Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Playbook Data retrieval from PDF data store

Hi All,

I created a playbook with tool to get data from datastore which hold indexed pdf.

PDF contains many tables and summaries.

The problem is when i ask questions related to table data inside pdf its not bringing right column data to me sometimes its displaying other column value of table. 

0 2 93
2 REPLIES 2

Hi @Rajavelu,

Welcome to Google Cloud Community!

The issue you're encountering might be due to how the table data in the PDF is indexed or extracted. Make sure the tool you're using accurately identifies and indexes the data for each column individually. You might need to refine the extraction process to improve column alignment or use a more precise table extraction method to ensure proper mapping of columns and data.

If the issue persists, please contact Google Cloud Support and provide detailed information about your case. 

Was this helpful? If so, please accept this answer as “Solution”. If you need additional assistance, reply here within 2 business days and I’ll be happy to help.

Thanks for your reply!

Could you please explain below. how do I know if its properly indexed and how to execute extraction method? I am bit confused.

"Make sure the tool you're using accurately identifies and indexes the data for each column individually. You might need to refine the extraction process to improve column alignment or use a more precise table extraction method to ensure proper mapping of columns and data."