Documentai Notebooks Save Abandoned

A centralized repository for AI Platform notebooks using the Google Cloud Document AI API.

Project README

Document AI Notebooks

This repository contains several Jupyter notebooks to be used with the Cloud Document AI Platform. Use the general notebooks to process any form type or the specialized notebooks for any of the solutions such as Procurement DocAI or Lending DocAI. These notebooks help you get started with extracting data from your documents whether you're bring your own form types or using one of our specialized parsers for invoices, receipts, tax forms and more.

gif

Prerequisites

You must have your own GCP project with billing enabled and have working knowledge of the following products:

Set Up Steps

  1. Set up your GCP project for Document AI following the Setup Guide.
  2. Enable the 'Document AI API' in your project in the Document AI Platform.
  3. Create or use an existing instance of AI Platform Notebook with Python 3 using the default configurations.
  4. In the notebook, go to Git > Clone a Repository and paste the repository URL.
  5. Install the required libraries in the notebook terminal python -m pip install -r requirements.txt

Please note Colab and Jupyter notebooks are also work with these samples. However, additional authentication will be required for service accounts.

Instructions

  1. Identify which form type or utility you would like to run through a processor.
  2. Create your processor using the instructions.
  3. Copy your processor id. processorId
  4. Update the PROCESSOR_ID, PROJECT_ID and REGION variables in the notebook.
PROJECT_ID = "YOUR_PROJECT_ID_HERE"
LOCATION = "LOCATION"  # Format is 'us' or 'eu'
PROCESSOR_ID = "PROCESSOR_ID"  # Create processor in Cloud Console

Please note, the location must match the one assigned to the processor.

  1. Run the notebook.
Open Source Agenda is not affiliated with "Documentai Notebooks" Project. README Source: GoogleCloudPlatform/documentai-notebooks
Stars
73
Open Issues
6
Last Commit
1 year ago
License

Open Source Agenda Badge

Open Source Agenda Rating