Enhancing Data Extraction with Einstein
By default, IDP uses its natural language processing model (IDP NLP) to extract data based on the configured prompts. When you create a document action, you can select Einstein to analyze the document and extract the data.
Use Einstein to answer complex questions about the document, such as asking the total of an invoice after deducting taxes and other concepts, or to extract data from non-standard documents such as a driver’s license or a medical record that contains handwriting.
Einstein supports these predictive models:
-
OpenAI’s
GPT-4o (gpt-4o-2024-08-06)
LLM -
OpenAI’s
GPT-4o Mini (gpt-4o-mini-2024-07-18)
LLM
Einstein accesses these models through the Salesforce Einstein Trust layer, which is part of the Salesforce Einstein platform.
Select the model to use during document analysis by configuring Settings in the document action editor.
Document actions created before February 5th support only OpenAI’s GPT-4o (gpt-4o-2024-05-13) . To enable model selection, create a new document action.
|
To analyze any type of document with Einstein and fully customize the output structure, see Analyzing Documents With Custom User-Defined Schemas.
Before You Being Using Einstein
-
Ensure you have any of these Anypoint permissions:
- Manage Actions
-
Gives a user complete access to IDP and assigns reviewer permission by default for every document action.
- Build Actions
-
Enables a user to create, edit, publish, and execute document actions and assign reviewers to the actions.
-
Enable MuleSoft Anypoint Platform to publish MuleSoft assets to Salesforce.
Select the Response from Einstein
To configure Einstein to analyze and extract data for a given prompt:
-
In the document action builder, add a new prompt to a document action or edit an existing one.
-
Run a document analysis to generate results in the Outputs panel.
-
Click the dropdown button (
), next to a prompt’s result.
-
Select the response labeled Einstein.
When you query a published document action, IDP uses the model selected in each prompt to extract the data for the corresponding field.