Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Document Ai

Looking for someone with experience in Document AI and OCR integration to help me build a custom configuration for extracting data from Arabic invoices. The goal is to create a solution that can seamlessly switch between Document AI and OCR to accurately extract data, and link it to the correct key. If you have experience in this area and are interested in taking on this project, please reach out to me! #DocumentAI #OCR #CustomConfiguration #ArabicInvoices

0 1 289
1 REPLY 1

Hi @Salem-Mufarreh,

Welcome to Google Cloud Community.

You may try to create a model with a special configuration that will allow data to be extracted from your Arabic invoices based on their unique style and format.

Here are some steps that might help you to set up your Arabic invoices:

  • To serve as training data, compile a set of representative Arabic invoices.
  • To increase the OCR models' recognition of Arabic text, train them using the training data.
  • Utilize the Document AI interface to create a unique template for your Arabic invoices.
  • To make sure the template accurately collects the necessary data, test it on an example of an invoice.
  • To increase the template's correctness, make any necessary adjustments.
  • The template can be used to automatically extract data from your Arabic invoices once it has been finished.

Creating a unique configuration for Document AI and OCR to extract data from Arabic invoices can be a challenging task that calls for knowledge of machine learning, natural language processing, and data extraction.

Here are some references that might help you:
https://cloud.google.com/document-ai/docs/languages?_ga=2.213324372.-1392753435.1676655686
https://cloud.google.com/vision/docs/languages?_ga=2.213324372.-1392753435.1676655686
https://cloud.google.com/document-ai/docs/processors-list?_ga=2.213324372.-1392753435.1676655686
https://cloud.google.com/speech-to-text/docs/speech-to-text-supported-languages?_ga=2.213324372.-139...
https://cloud.google.com/speech-to-text/v2/docs/speech-to-text-supported-languages?_ga=2.213324372.-...