Documentation Index
Fetch the complete documentation index at: https://mintlify.com/ragaeeb/kokokor/llms.txt
Use this file to discover all available pages before exploring further.
Overview
Kokokor provides first-class support for right-to-left (RTL) languages like Arabic. TheisRTL option flips x-coordinates to ensure proper text ordering and alignment.
RTL Processing Example
How RTL Processing Works
Coordinate Flipping
When This ensures that text on the right side of the page is processed first.
isRTL: true, Kokokor flips the x-axis by transforming each observation’s x-coordinate:Normalization
X-coordinates are normalized using DPI information to ensure consistent alignment regardless of document resolution.
Complete Arabic Document Example
Configuration Options
Enable RTL text processing. Set to
true for Arabic, Hebrew, and other RTL languages.Vertical tolerance in pixels (at 72 DPI) for grouping observations into lines.
Options for detecting Arabic poetry patterns, including hemistichs.
Delimiter used when merging detected poetry hemistichs into a single line.
Important Notes
Tatweel Character: The tatweel character (ـ) is automatically removed during word count calculations to ensure accurate text analysis.
Advanced: Low-Level API
For more control, use the low-level API:See Also
Poetry Documents
Arabic poetry detection with hemistichs
Simple OCR
Basic LTR text processing