Document AI documentation
Document AI is a document understanding platform that takes unstructured data from documents and transforms it into structured data, making it easier to understand, analyze, and consume.
Document AI uses machine learning and Google Cloud to help you create scalable, end-to-end, cloud-based document processing applications.
To learn more, see Document AI overview.
Start your proof of concept with $300 in free credit
- Get access to Gemini 2.0 Flash Thinking
- Free monthly usage of popular products, including AI APIs and BigQuery
- No automatic charges, no commitment
Keep exploring with 20+ always-free products
Access 20+ free products for common use cases, including AI APIs, VMs, data warehouses, and more.
Documentation resources
Get Started
- Tutorial
- Tutorial
- Tutorial
- Tutorial
- Related
Reference
- Technical
- Technical
- Technical
- Technical
Related resources
Optical Character Recognition (OCR) with Document AI (Python)
This codelab will teach how to perform Optical Character Recognition using the Document AI Python Client Library. You will explore how to perform online and batch processing of documents with a PDF of the classic novel "Winnie the Pooh" by A.A. Milne.
Form Parsing with Document AI (Python)
In this codelab, you will learn how to use the Document AI Form Parser to parse a handwritten form with Python. We will use a simple medical intake form as an example, but this procedure will work with any generalized form.
Specialized Processors with Document AI (Python)
In this codelab, you will learn how to use Document AI Specialized Processors to classify and parse specialized documents with Python. We will use an invoice as an example, but this procedure will work with any specialized document supported by Document AI.
Managing Document AI processors with Python
In this lab, you will focus on managing Document AI processors programmatically with the Python client library.
Document AI Workbench: Uptraining
In this lab, you will create an Invoice Parser processor, configure the processor for uptraining, label example documents, and uptrain the processor.
Document AI Workbench: Custom Document Extractor
In this lab, you will create a Custom Document Extraction processor, import a dataset, label example documents, and train the processor.
Document AI Warehouse
In this lab, you will learn how to ingest, process, and search documents using the Document AI Warehouse user interface.
Automating income taxes with Document AI
For the United States, Tax Day is upon us. Luckily, Lending Document AI can intelligently classify and parse many common documents used for tax preparation. Learn how to build a tax processing pipeline using Document AI.
Smarter applications with Document AI, Workflows and Cloud Functions
With compute solutions on Google Cloud and Document AI, you can create seamless integrations and easy to use applications for your users. Document AI is a platform and a family of solutions that help businesses to transform documents into structured data backed by machine learning. In this blog post we'll walk you through how to use Serverless technology to process documents with Cloud Functions, and with workflows of business processes orchestrating microservices, API calls, and functions, thanks to Workflows.
Diving into your documents with Document AI
We recently announced the GA of the Document AI Platform, Google's solution for automating and validating documents to streamline document workflows. Important business data is not always readily available in computer-readable formats. This is what we consider dark formats such as pdfs, handwritten forms and images.
Using Document AI to automate procurement workflows
We recently announced the GA of the Document AI Platform, Google's solution for automating and validating documents to streamline document workflows. Important business data is not always readily available in computer-readable formats. This is what we consider dark formats such as pdfs, handwritten forms and images.
Building a document understanding pipeline with Google Cloud
Document understanding is the practice of using AI and machine learning to extract data and insights from text and paper sources such as emails, PDFs, scanned documents, and more. In the past, capturing this unstructured or "dark data" has been an expensive, time-consuming, and error-prone process requiring manual data entry. Today, AI and machine learning have made great advances towards automating this process, enabling businesses to derive insights from and take advantage of this data that had been previously untapped.
Automate Data Capture at Scale with Document AI
Earn a skill badge by completing the Automate Data Capture at Scale with Document AI quest, where you learn how to create a document processing pipeline that will automatically process documents. You will learn how to use the Document AI form processor to extract data from the documents and save the data.