Our goal is to provide a picture of a document (pdf) and run some prompts to compare data on the document against the saved data in our system. The goal of this is to speed up our manual verification process and help point out missing or potentially incorrect data from the document.
This is our first step into introducing AI into our system. Everyone on our team is brand new to AI, so we are researching/testing multiple products.
Using any of the Google Vision products (Vision, Document AI, Vertex AI), is there a way to provide a base64 encoded document with some prompts and receive back a response? We have been able to use the Vertex UI for some initial testing, but now we would like to start testing using a Java application