data.table and integrates seamlessly with pandas, numpy, and pyarrow.
Quick Start
Install datatable and run your first data operations in minutes.
Core Concepts
Understand Frames, f-expressions, and the DT[i, j, by] syntax.
Working with Data
Filter, transform, group, join, and read/write data.
API Reference
Full API documentation for all classes and functions.
Get started in 3 steps
Why datatable?
Blazing fast
Native C++ implementation with multi-threaded processing. Sort, group, and join hundreds of millions of rows in seconds.
Big data support
Memory-mapped datasets let you work on files larger than RAM transparently, without loading everything into memory.
Expressive syntax
The
DT[i, j, by(...)] query syntax is concise and powerful — filter rows, select columns, and aggregate in a single expression.Pandas compatible
Convert to/from pandas DataFrames, numpy arrays, and pyarrow tables. Drop-in complement for your existing workflow.
Fast I/O
fread() automatically detects CSV format, handles compressed files, reads URLs, and is orders of magnitude faster than pandas read_csv.Built-in ML models
Includes FTRL online learning and LinearModel for classification and regression — no additional libraries required.
datatable requires Python 3.6+ (64-bit) and pip 20.3+. Pre-built wheels are available for macOS, Linux, and Windows.