Split Model Overview

Automatically breaking a multi-page source file into separate documents and associate a class to each one.

Use Cases

Process a bundle of different documents sent in the same file. The result has both the page range and the class for each document identified, allowing for complex workflows.

Some common examples, where a single PDF contains:

  • Several different invoices

  • A mix of invoices, receipts, and bank statements

  • The person's driver license, vehicle registration, insurance

  • Front and back of an ID card, each on a separate page

  • The same type of document, but from different regions or languages

A file sent to the Split Model may have any number of pages, within limits.

Create a Split Model

Split models are always custom, there are no templates available in the Catalog. This keeps Split models flexible for different documents and workflows.

Each Split model gets its own unique model ID when you create it.

This keeps Split models flexible for different document bundles and workflows.

  1. To create a Split model, you need to click on Models, and then on Create your document AI model.

  2. Scroll to the Document Utilities section, click on Split.

  3. A pop-up will appear, allowing you to enter the classes you want. Each class corresponds to a document type possibly present in the documents you want to process. For example, if the files you are processing contain invoices, receipts, and driving licenses, set the classes as: INVOICE, RECEIPT, DRIVER_LICENSE.

Add the class OTHER if you need the model to identify documents that are not one of the explicitly defined classes.

  1. Once ready, click on Create Utility to create your custom Split Model. This step will also generate the model's unique ID.

  2. You can now use the Live Test tab to process documents, and the Utility Configuration to update your classes.

Your utility is now available in your Models tab:

Here is a step-by-step tutorial that shows you how to properly create a Split Utility :

Integration

Class names will be returned exactly as defined on the platform in the return, spaces and capitalization included.

If the class names are changed on the platform, the change in the API return will be immediate for all new files sent.

Once your Split model is created and tested, integration documentation is provided in the "Documentation" page, or here: Split Quick Start.

Last updated

Was this helpful?

Morty Proxy This is a proxified and sanitized view of the page, visit original site.