Google this morning launched the Document AI (DocAI) platform, a console for document processing hosted in Google Cloud, in preview. The company says it’s aimed at automating and validating documents by extracting data from documents and making them available to business apps and users.
Companies spend an average of $20 to file and store a single document, by some estimates, and only 18% of companies consider themselves paperless. An IDC report revealed that document-related challenges account for a 21.3% productivity loss, and U.S. companies waste a collective $8 billion annually managing paperwork.
Google’s DocAI platform ostensibly solves this by providing access to document parsers, tools, and solutions via an API. It supports the creation and customization of document processing workflows built with Google Cloud’s predefined taxonomy without the need to perform additional data mapping or training. DocAI offers general processors including a form parser, W9 parser, optical character recognition, document splitter, and custom workflows for domain-specific documents. These reside in a unified dashboard from where they can be tested by uploading a document directly in the console.
The parsers can classify information in documents like addresses, account numbers, and signatures as well as extract data like supplier names, invoice dates, and payment terms. Google says it’s working on additional capabilities for the DocAI platform to grow its core capabilities and support additional toolsets.
General parsers such as optical character recognition, the form parser, and the document splitter are available from the DocAI platform console. Access to specialized parsers like W9, 1040, W2, 1099-MISC, 1003, invoice, and receipts must be requested on a per-customer basis.
The launch of Google’s DocAI platform comes after the release of Lending DocAI, a Google Cloud product for the mortgage industry that ostensibly provides “industry-leading” accuracy for documents relevant to lending and processing. Google also recently unveiled PPP Lending AI, an effort to help lenders expedite the processing of applications for the since-exhausted U.S. Small Business Administration’s (SBA) Paycheck Protection Program, and Procurement DocAI, which automates procurement data capture by turning docs like invoices and receipts into structured data.
Lending DocAI and Procurement DocAI are now a part of the DocAI platform.
“We believe that any company that has to manually extract data from complex documents at scale can greatly benefit from Google Cloud AI,” Google product manager Lewis Liu and product marketing manager Yang Liang wrote in a blog post. “Transforming documents into structured data increases the speed of decision making for companies, unlocking measurable business value and helping develop better experiences for customers.”
VentureBeatVentureBeat's mission is to be a digital town square for technical decision-makers to gain knowledge about transformative technology and transact. Our site delivers essential information on data technologies and strategies to guide you as you lead your organizations. We invite you to become a member of our community, to access:
- up-to-date information on the subjects of interest to you
- our newsletters
- gated thought-leader content and discounted access to our prized events, such as Transform
- networking features, and more