Documentation Index
Fetch the complete documentation index at: https://mintlify.com/reductoai/reducto-python-sdk/llms.txt
Use this file to discover all available pages before exploring further.
extract.run()
Extracts structured data from a document synchronously based on provided instructions.Parameters
The URL of the document to be processed. You can provide one of the following:
- A publicly available URL
- A presigned S3 URL
- A
reducto://prefixed URL obtained from the /upload endpoint after directly uploading a document - A
jobid://prefixed URL obtained from a previous /parse invocation - A list of URLs (for multi-document pipelines, V3 API only)
The instructions to use for the extraction. Define the schema and extraction prompts.
The configuration options for parsing the document. If you are passing in a
jobid:// URL for the file, then this configuration will be ignored.The settings to use for the extraction.
The configuration options for asynchronous processing (default synchronous). Only available when using async mode.
Response
Returns either an
ExtractResponse with the extracted data (sync mode) or an AsyncExtractResponse containing a job_id (async mode).extract.run_job()
Extracts structured data from a document asynchronously and returns a job ID immediately.Parameters
The URL of the document to be processed. You can provide one of the following:
- A publicly available URL
- A presigned S3 URL
- A
reducto://prefixed URL obtained from the /upload endpoint after directly uploading a document - A
jobid://prefixed URL obtained from a previous /parse invocation - A list of URLs (for multi-document pipelines, V3 API only)
The instructions to use for the extraction. Define the schema and extraction prompts.
The configuration options for asynchronous processing (default synchronous).
The configuration options for parsing the document. If you are passing in a
jobid:// URL for the file, then this configuration will be ignored.The settings to use for the extraction.