Skip to main content

Documentation Index

Fetch the complete documentation index at: https://mintlify.com/lumina-ai-inc/chunkr/llms.txt

Use this file to discover all available pages before exploring further.

The Configuration object controls how Chunkr processes your documents. It includes settings for OCR, segmentation, chunking, and post-processing.

Configuration Fields

ocr_strategy
OcrStrategy
required
Controls the Optical Character Recognition (OCR) strategy.
segmentation_strategy
SegmentationStrategy
required
Controls the segmentation strategy.
chunk_processing
ChunkProcessing
required
Controls the chunking and post-processing of each chunk.
segment_processing
SegmentProcessing
required
Controls the post-processing of each segment type. See Segment Processing for detailed configuration options.
high_resolution
boolean
default:true
required
Whether to use high-resolution images for cropping and post-processing. (Latency penalty: ~7 seconds per page)
error_handling
ErrorHandlingStrategy
required
Controls how errors are handled during processing.
llm_processing
LlmProcessing
required
Controls the LLM used for the task.
expires_in
integer
The number of seconds until the task is deleted. Expired tasks cannot be updated, polled, or accessed via web interface.
input_file_url
string
The presigned URL of the input file.

Deprecated Fields

target_chunk_length
integer
deprecated
DEPRECATED: Use chunk_processing.target_length instead.
json_schema
object
deprecated
DEPRECATED: The extracted JSON schema from the document.
model
Model
deprecated
DEPRECATED: Model selection.

Build docs developers (and LLMs) love