Pipeline

pipeline.run()

Executes a custom pipeline synchronously on a document.

client.pipeline.run(
    input="https://example.com/document.pdf",
    pipeline_id="your-pipeline-id",
    settings={...}
)

Parameters

input

string | list[string]

required

The URL of the document to be processed. You can provide one of the following:

A publicly available URL
A presigned S3 URL
A reducto:// prefixed URL obtained from the /upload endpoint after directly uploading a document
A jobid:// prefixed URL obtained from a previous /parse invocation
A list of URLs (for multi-document pipelines, V3 API only)

For edit pipelines, this should be a string containing the edit instructions.

pipeline_id

string

required

The ID of the pipeline to use for the document.

settings

object

Settings for pipeline execution that override pipeline defaults.

Response

PipelineResponse

object

Returns the result of the pipeline execution.

result

object

The processed document data according to the pipeline configuration.

pipeline.run_job()

Executes a custom pipeline asynchronously and returns a job ID immediately.

response = client.pipeline.run_job(
    input="https://example.com/document.pdf",
    pipeline_id="your-pipeline-id",
    async_={"webhook": {"url": "https://example.com/webhook"}},
    settings={...}
)

print(response.job_id)  # Use this to check job status later

Parameters

input

string | list[string]

required

The URL of the document to be processed. You can provide one of the following:

A publicly available URL
A presigned S3 URL
A reducto:// prefixed URL obtained from the /upload endpoint after directly uploading a document
A jobid:// prefixed URL obtained from a previous /parse invocation
A list of URLs (for multi-document pipelines, V3 API only)

For edit pipelines, this should be a string containing the edit instructions.

pipeline_id

string

required

The ID of the pipeline to use for the document.

async_

ConfigV3AsyncConfig

The configuration options for asynchronous processing (default synchronous).

settings

object

Settings for pipeline execution that override pipeline defaults.

Response

PipelineRunJobResponse

object

job_id

string

The ID of the asynchronous job. Use client.job.get(job_id) to retrieve the result when the job completes.

Client

Resources

Types

Exceptions

pipeline.run()

Parameters

Response

pipeline.run_job()

Parameters

Response

Build docs developers (and LLMs) love

Client

Resources

Types

Exceptions

Documentation Index

​pipeline.run()

​Parameters

​Response

​pipeline.run_job()

​Parameters

​Response

Build docs developers (and LLMs) love

pipeline.run()

Parameters

Response

pipeline.run_job()

Parameters

Response