Documentation Index
Fetch the complete documentation index at: https://mintlify.com/lumina-ai-inc/chunkr/llms.txt
Use this file to discover all available pages before exploring further.
OutputResponse
The OutputResponse object contains the processed results of a document analysis task.Collection of document chunks, where each chunk contains one or more segments. See Chunk below.
The name of the file.
The number of pages in the file.
The presigned URL of the PDF file.
DEPRECATED: The extracted JSON from the document.
Chunk
A Chunk represents a logical grouping of segments from the document. Chunks are created based on thetarget_length configuration.
The unique identifier for the chunk.
The total number of tokens in the chunk. Calculated by the configured
tokenizer.Collection of document segments that form this chunk.When
target_chunk_length > 0, contains the maximum number of segments that fit within that length (segments remain intact). Otherwise, contains exactly one segment.See Segment below.Suggested text to be embedded for the chunk. This text is generated by combining the embed content from each segment according to the configured embed sources (HTML, Markdown, LLM, or Content).Can be configured using
embed_sources in the SegmentProcessing configuration.Segment
A Segment represents a logical element within a document page (e.g., title, paragraph, table, image).Unique identifier for the segment.
The type of the segment. See Segment Types for all possible values.
Bounding box coordinates for the segment.
Page number of the segment (1-indexed).
Width of the page containing the segment.
Height of the page containing the segment.
Content of the segment, will be either HTML or Markdown, depending on the format chosen in segment processing configuration.
HTML representation of the segment.
Markdown representation of the segment.
Text content of the segment. Calculated from the OCR results.
LLM-generated representation of the segment. Only present if LLM processing is configured for this segment type.
Presigned URL to the cropped image of the segment. Only present if cropping is enabled for this segment type.
Confidence score of the layout analysis model for this segment (0.0 to 1.0).
OCR results for the segment.