V1 Client
- class Client(api_key='', http_client=None)
Mindee API Client.
- Parameters:
api_key (str)
http_client (Client | None)
- close()
Close the HTTP client.
- create_endpoint(endpoint_name, account_name='mindee', version=None)
Add a custom endpoint, created using the Mindee API Builder.
- Return type:
- Parameters:
endpoint_name (
str) – The “API name” field in the “Settings” page of the APIaccount_name (str)
version (str | None)
Builder :type account_name:
str, default:'mindee':param account_name: Your organization’s username on the API Builder :type version:str|None, default:None:param version: If set, locks the version of the model to use.If not set, use the latest version of the model.
- enqueue(product_class, input_source, include_words=False, close_file=True, page_options=None, cropper=False, endpoint=None, full_text=False, workflow_id=None, rag=False)
Enqueues a document to an asynchronous endpoint.
- Parameters:
product_class (
type[Inference]) – The document class to use. The response object will be instantiated based on this parameter.input_source (
LocalInputSource|URLInputSource) – The document/source file to use. Has to be created beforehand.include_words (
bool, default:False) – Whether to include the full text for each page. This performs a full OCR operation on the server and will increase response time.close_file (
bool, default:True) – Whether toclose()the file after parsing it. Set toFalseif you need to access the file after this operation.page_options (
PageOptions|None, default:None) – If set, remove pages from the document as specified. This is done before sending the file to the server. It is useful to avoid page limitations.cropper (
bool, default:False) – Whether to include cropper results for each page. This performs a cropping operation on the server and will increase response time.endpoint (
Endpoint|None, default:None) – For custom endpoints, an endpoint has to be given.full_text (
bool, default:False) – Whether to include the full OCR text response inworkflow_id (str | None)
rag (bool)
- Return type:
compatible APIs.
- Parameters:
workflow_id (
str|None, default:None) – Workflow ID.rag (
bool, default:False) – If set, will enable Retrieval-Augmented Generation. Only works if a validworkflow_idis set.product_class (type[Inference])
input_source (LocalInputSource | URLInputSource)
include_words (bool)
close_file (bool)
page_options (PageOptions | None)
cropper (bool)
endpoint (Endpoint | None)
full_text (bool)
- Return type:
- enqueue_and_parse(product_class, input_source, include_words=False, close_file=True, page_options=None, cropper=False, endpoint=None, initial_delay_sec=2, delay_sec=1.5, max_retries=80, full_text=False, workflow_id=None, rag=False)
Enqueues to an asynchronous endpoint and automatically polls for a response.
- Parameters:
product_class (
type[Inference]) – The document class to use. The response object will be instantiated based on this parameter.input_source (
LocalInputSource|URLInputSource) – The document/source file to use. Has to be created beforehand.include_words (
bool, default:False) – Whether to include the full text for each page. This performs a full OCR operation on the server and will increase response time.close_file (
bool, default:True) – Whether toclose()the file after parsing it. Set toFalseif you need to access the file after this operation.page_options (
PageOptions|None, default:None) – If set, remove pages from the document as specified. This is done before sending the file to the server. It is useful to avoid page limitations.cropper (
bool, default:False) – Whether to include cropper results for each page. This performs a cropping operation on the server and will increase response time.endpoint (
Endpoint|None, default:None) – For custom endpoints, an endpoint has to be given.initial_delay_sec (
float, default:2) – Delay between each polling attempts. This should not be shorter than 1 second.delay_sec (
float, default:1.5) – Delay between each polling attempts. This should not be shorter than 1 second.max_retries (
int, default:80) – Total amount of polling attempts.full_text (
bool, default:False) – Whether to include the full OCR text response inworkflow_id (str | None)
rag (bool)
- Return type:
compatible APIs.
- Parameters:
workflow_id (
str|None, default:None) – Workflow ID.rag (
bool, default:False) – If set, will enable Retrieval-Augmented Generation. Only works if a validworkflow_idis set.product_class (type[Inference])
input_source (LocalInputSource | URLInputSource)
include_words (bool)
close_file (bool)
page_options (PageOptions | None)
cropper (bool)
endpoint (Endpoint | None)
initial_delay_sec (float)
delay_sec (float)
max_retries (int)
full_text (bool)
- Return type:
- execute_workflow(input_source, workflow_id, options=None, page_options=None)
Send the document to a workflow execution.
- Parameters:
input_source (
LocalInputSource|URLInputSource) – The document/source file to use. Has to be created beforehand.workflow_id (
str) – ID of the workflow.page_options (
PageOptions|None, default:None) – If set, remove pages from the document as specified. This is done before sending the file to the server. It is useful to avoid page limitations.options (
WorkflowOptions|None, default:None) – Options for the workflow.
- Return type:
- Returns:
- load_prediction(product_class, local_response)
Load a prediction.
- Parameters:
product_class (
type[Inference]) – Class of the product to use.local_response (
LocalResponse) – Local response to load.
- Return type:
- Returns:
A valid prediction.
- parse(product_class, input_source, include_words=False, close_file=True, page_options=None, cropper=False, endpoint=None, full_text=False)
Call prediction API on the document and parse the results.
- Parameters:
product_class (
type[Inference]) – The document class to use. The response object will be instantiated based on this parameter.input_source (
LocalInputSource|URLInputSource) – The document/source file to use. Has to be created beforehand.include_words (
bool, default:False) – Whether to include the full text for each page. This performs a full OCR operation on the server and will increase response time. Only available on financial document APIs.close_file (
bool, default:True) – Whether toclose()the file after parsing it. Set toFalseif you need to access the file after this operation.page_options (
PageOptions|None, default:None) – If set, remove pages from the document as specified. This is done before sending the file to the server. It is useful to avoid page limitations.cropper (
bool, default:False) – Whether to include cropper results for each page. This performs a cropping operation on the server and will increase response time.endpoint (
Endpoint|None, default:None) – For custom endpoints, an endpoint has to be given.full_text (
bool, default:False) – Whether to include the full OCR text response in
- Return type:
compatible APIs.
- parse_queued(product_class, queue_id, endpoint=None)
Parses a queued document.
- Parameters:
product_class (
type[Inference]) – The document class to use. The response object will be instantiated based on this parameter.queue_id (
str) – queue_id received from the API.endpoint (
Endpoint|None, default:None) – For custom endpoints, an endpoint has to be given.
- Return type:
- send_feedback(product_class, document_id, feedback, endpoint=None)
Send a feedback for a document.
- Parameters:
product_class (
type[Inference]) – The document class to use. The response object will be instantiated based on this parameter.document_id (
str) – The id of the document to send feedback to.feedback (
dict[str,Any]) – Feedback to send.endpoint (
Endpoint|None, default:None) – For custom endpoints, an endpoint has to be given.
- Return type:
- api_key: str
API key for all endpoints.
- http_client: Client | None
HTTP client for making requests.
- get_bound_classname(type_var)
Get the name of the bound class.
- Return type:
str