PDF, PNG, JPG, TIFF (150 DPI or more recommended)
The document action builder does not support previewing TIFF files. However, you can extract data and review results from these files.
Quotas and Limits
The following table shows the quotas and limitations of the IDP service:
Limit | Description | ||||
---|---|---|---|---|---|
Accepted File Formats |
|||||
File Size Limits |
|
||||
Prompt Limits |
|
||||
Prompt Response Limits |
|
||||
Request Limits |
|
||||
Document Languages |
Language support varies depending on the selected large language model (LLM) in document actions. For a list of supported languages by provider, see: |
||||
Polling Limits |
10-second interval for retrieving results |
Token Usage and Estimation
A token is a unit of text that a large language model (LLM) processes. Tokens are essential for breaking down input data into manageable pieces that can be analyzed and generated. These are some estimated conversions that might help understand token lengths:
-
1 token ~= 4 characters
-
1 token ~= ¾ words
-
1-2 sentences ~= 30 tokens
-
1,500 words ~= 2048 tokens
When analyzing images, each model uses a different tokenization strategy that affects how they calculate tokens. Refer to each provider’s documentation for additional guidance:
Document Retention Policy
IDP retains the document files and extracts data as needed to process the document extractions. IDP uses the following features to enhance data retention:
-
Document Action Editor
IDP temporarily stores the files you modify in the editor during the testing process and immediately removes these files after you close the editor. IDP also removes the extracted data and the files when you end the session or upload a new file.
-
Document Action Execution Endpoint
When you use the execution endpoint API, IDP stores the files in an S3 bucket and stores the data extracted during the execution securely.
For all successful executions, IDP retains this data for 24 hours. For executions that require human review, IDP retains this data for seven days after the user task is completed. IDP retains any incomplete tasks for 30 days. Currently, you cannot configure these retention periods.
Data Residency Policy:
IDP uses AWS services such as S3 and RDS to store files and the extracted data. IDP currently supports 2 regions: US (Virginia) and EU (Frankfurt, Germany). IDP stores the data closest to your Anypoint control plane region to ensure GDPR requirements. For example, if your Anypoint control plane runs in the EU region, IDP stores your data in the Frankfurt data center.