Greetings,
Very new to working with unstructured data.
Situation: Collection of PDF Invoices on some Google Drive
Required: Parse out certain data from the PDF files and land the data into Google BigQuery
Q1: How would I setup Document AI to parse the below data ( see highlighted green items )
Q2: Is there any special setup for the line item section ( PurchaseItem1,2,3, etc..)
Thank you for your patience and understanding 👍
Solved! Go to Solution.
You can follow these initial steps in codelabs to get you set up with Document AI's Invoice Parser. Make sure to enable your Document AI API and create the right processor. For your use case, the Invoice Parser.
Once you're set up, you can train your processor and set up a pipeline to extract data from your PDF invoices and to store it in BigQuery.