Performance Optimization Tips

Overview

This guide covers practical techniques to optimize bun-scikit performance for your specific use cases. From choosing the right backend to data preparation strategies, these tips will help you get the most out of native Zig acceleration.

Choose the Right Backend

Tree Models: Zig vs JavaScript

bun-scikit offers both native Zig and optimized JavaScript backends for tree-based models.

When to Use Zig Backend (Default)

Best for:

Medium to large datasets (>1000 samples)
Deep trees (maxDepth > 5)
Random forests with many estimators (nEstimators > 50)
Production workloads prioritizing throughput

Performance characteristics:

DecisionTree fit: 1.82x faster than JS
RandomForest fit: 2.65x faster than JS
RandomForest predict: 2.26x faster than JS

Enable (default):

export BUN_SCIKIT_TREE_BACKEND=zig

When to Use JavaScript Backend

Best for:

Small datasets (<500 samples)
Shallow trees (maxDepth ≤ 3)
Single decision trees (lower overhead)
Development without building native code

Performance characteristics:

DecisionTree predict: 1.6x faster than Zig on small data
Lower FFI overhead for simple models
Faster startup time

Enable:

export BUN_SCIKIT_TREE_BACKEND=js

Benchmark both backends with your specific data. The optimal choice depends on dataset size, tree depth, and whether you’re optimizing for training or inference.

Linear Models: Native Required

LinearRegression and LogisticRegression require native Zig kernels:

import { LinearRegression, LogisticRegression } from "bun-scikit";

// Both require native kernels
const linear = new LinearRegression({ solver: "normal" });
const logistic = new LogisticRegression({ solver: "gd" });

linear.fit(X, y);  // Throws if kernels unavailable

If native kernels are missing, these models will throw an error during fit(). Run bun run native:build to compile kernels locally.

Data Preparation

Use Typed Arrays

Native kernels work directly with typed arrays for zero-copy data transfer:

// Good: Native typed arrays
const X = new Float64Array([1.0, 2.0, 3.0, 4.0]);
const y = new Float64Array([2.0, 4.0, 6.0, 8.0]);

// Avoid: Regular arrays (require conversion)
const X = [[1.0], [2.0], [3.0], [4.0]];
const y = [2.0, 4.0, 6.0, 8.0];

bun-scikit automatically converts regular arrays to typed arrays when needed, but pre-converting your data eliminates this overhead.

Contiguous Memory Layout

Ensure data is contiguous for optimal native performance:

// Contiguous row-major layout (optimal)
const X = new Float64Array([
  1.0, 2.0,  // sample 0
  3.0, 4.0,  // sample 1
  5.0, 6.0,  // sample 2
]);

// Access pattern: X[row * n_features + col]

Batch Predictions

Predict on multiple samples at once to amortize overhead:

import { RandomForestClassifier } from "bun-scikit";

const forest = new RandomForestClassifier();
forest.fit(XTrain, yTrain);

// Good: Batch prediction
const predictions = forest.predict(XTest); // All test samples

// Avoid: Individual predictions in loop
const predictions = [];
for (let i = 0; i < XTest.length; i++) {
  predictions.push(forest.predict([XTest[i]])); // Repeated overhead
}

Model Configuration

Random Forests: Tune Estimators

Balance accuracy and speed by adjusting nEstimators:

Small Datasets (<1000 samples)

const forest = new RandomForestClassifier({
  nEstimators: 50,  // Sufficient for small data
  maxDepth: 8,
  randomState: 42,
});

50 estimators provide good accuracy
Training time: ~15-30ms
Diminishing returns beyond 100 estimators

Medium Datasets (1000-10000 samples)

const forest = new RandomForestClassifier({
  nEstimators: 100,  // Better accuracy, acceptable speed
  maxDepth: 10,
  randomState: 42,
});

100 estimators balance accuracy and speed
Training time: ~50-200ms
Consider parallel processing for larger values

Large Datasets (>10000 samples)

const forest = new RandomForestClassifier({
  nEstimators: 200,  // Maximum accuracy
  maxDepth: 15,
  randomState: 42,
  nJobs: -1,  // Use all cores
});

200+ estimators for maximum accuracy
Native Zig backend essential (6.4x speedup)
Training time: varies widely by data size

Decision Trees: Limit Depth

Control tree complexity to prevent overfitting and improve speed:

const tree = new DecisionTreeClassifier({
  maxDepth: 8,  // Reasonable default
  minSamplesSplit: 10,  // Prevent tiny splits
  minSamplesLeaf: 5,    // Ensure leaf stability
});

Depth vs Performance:

maxDepth: 5 - Very fast, risk underfitting
maxDepth: 8 - Good balance (recommended)
maxDepth: 15 - High accuracy, slower, risk overfitting
maxDepth: null - Unlimited (use with caution)

Logistic Regression: Solver Selection

Choose the appropriate solver for your data:

// Gradient descent (native Zig, good for large data)
const logistic = new LogisticRegression({
  solver: "gd",
  learningRate: 0.8,
  maxIter: 100,
  tolerance: 1e-5,
});

// L-BFGS (good for small to medium data)
const logistic = new LogisticRegression({
  solver: "lbfgs",
  maxIter: 100,
});

Native Zig gradient descent (solver: "gd") provides 2.5x speedup over Python scikit-learn. Use for datasets with >1000 samples.

Preprocessing Optimization

Reuse Scalers

Fit scalers once, transform multiple times:

import { StandardScaler } from "bun-scikit";

// Fit once on training data
const scaler = new StandardScaler();
scaler.fit(XTrain);

// Transform multiple datasets
const XTrainScaled = scaler.transform(XTrain);
const XTestScaled = scaler.transform(XTest);
const XNewScaled = scaler.transform(XNew);

// Avoid: Refitting scaler repeatedly
// const XTestScaled = new StandardScaler().fitTransform(XTest); // Wrong!

Pipeline for Efficiency

Combine preprocessing and modeling in pipelines:

import { Pipeline, StandardScaler, LogisticRegression } from "bun-scikit";

const pipeline = new Pipeline([
  ["scaler", new StandardScaler()],
  ["classifier", new LogisticRegression({ solver: "gd" })],
]);

// Single fit call handles both steps
pipeline.fit(XTrain, yTrain);
const predictions = pipeline.predict(XTest);

Benefits:

Automatic data flow between steps
No intermediate array allocations
Cleaner code, fewer errors

Memory Management

Avoid Memory Leaks

Models with native handles are automatically cleaned up, but be aware of lifecycle:

import { LinearRegression } from "bun-scikit";

function trainModel(X, y) {
  const model = new LinearRegression({ solver: "normal" });
  model.fit(X, y);
  return model;
}  // Model handle still valid

const model = trainModel(X, y);
// Use model...
// When done, model is garbage collected and native handle destroyed

Don’t manually manage model handles. bun-scikit automatically calls *_model_destroy() when models are garbage collected.

Large Dataset Strategies

For datasets that strain memory:

Use Incremental Learning

import { SGDClassifier } from "bun-scikit";

const model = new SGDClassifier();

// Process data in chunks
for (const [XChunk, yChunk] of dataBatches) {
  model.partialFit(XChunk, yChunk);
}

Models supporting partialFit:

SGDClassifier
SGDRegressor

Subsample for Prototyping

import { trainTestSplit } from "bun-scikit";

// Work with subset during development
const { XSample, ySample } = trainTestSplit(X, y, {
  testSize: 0.9,  // Keep only 10% for quick iteration
  randomState: 42,
});

model.fit(XSample, ySample);

Benchmarking Your Code

Measure What Matters

Use precise timing for optimization decisions:

const start = performance.now();
model.fit(XTrain, yTrain);
const fitTime = performance.now() - start;

console.log(`Fit time: ${fitTime.toFixed(2)}ms`);

const predStart = performance.now();
const predictions = model.predict(XTest);
const predTime = performance.now() - predStart;

console.log(`Predict time: ${predTime.toFixed(2)}ms`);

Compare Backends

Test both Zig and JS backends with your data:

import { RandomForestClassifier } from "bun-scikit";

// Test Zig backend
process.env.BUN_SCIKIT_TREE_BACKEND = "zig";
const forestZig = new RandomForestClassifier({ nEstimators: 80, maxDepth: 8 });
const zigStart = performance.now();
forestZig.fit(XTrain, yTrain);
const zigTime = performance.now() - zigStart;

// Test JS backend
process.env.BUN_SCIKIT_TREE_BACKEND = "js";
const forestJS = new RandomForestClassifier({ nEstimators: 80, maxDepth: 8 });
const jsStart = performance.now();
forestJS.fit(XTrain, yTrain);
const jsTime = performance.now() - jsStart;

console.log(`Zig: ${zigTime.toFixed(2)}ms, JS: ${jsTime.toFixed(2)}ms`);
console.log(`Speedup: ${(jsTime / zigTime).toFixed(2)}x`);

Run Official Benchmarks

Compare your results against CI benchmarks:

# Full benchmark suite
bun run bench

# Hot-path synthetic benchmarks
bun run bench:hotpaths

# Against Python scikit-learn
bun run bench:ci

Production Deployment

Verify Native Acceleration

Confirm native kernels are active in production:

import { DecisionTreeClassifier } from "bun-scikit";

const tree = new DecisionTreeClassifier();
tree.fit(X, y);

if (tree.fitBackend_ !== "zig") {
  console.warn("Native acceleration not active!");
  console.warn("Expected: zig, Got:", tree.fitBackend_);
}

console.log("Native library:", tree.fitBackendLibrary_);

Environment Configuration

Optimal production settings:

# Use Zig backend (default, but explicit is better)
export BUN_SCIKIT_TREE_BACKEND=zig

# Let runtime choose best bridge (default)
# export BUN_SCIKIT_NATIVE_BRIDGE=node-api  # Only if needed

# Don't disable Zig in production
# export BUN_SCIKIT_ENABLE_ZIG=0  # Development only

Model Serialization

Save trained models to avoid refitting:

import { RandomForestClassifier } from "bun-scikit";
import { writeFileSync, readFileSync } from "fs";

// Train once
const forest = new RandomForestClassifier();
forest.fit(XTrain, yTrain);

// Save model (serialize parameters)
const modelData = {
  params: forest.getParams(),
  // Save internal state as needed
};
writeFileSync("model.json", JSON.stringify(modelData));

// Load model later
const loaded = JSON.parse(readFileSync("model.json", "utf-8"));
const restoredForest = new RandomForestClassifier(loaded.params);
// Restore state...

Model serialization APIs are under development. Current approach requires manual state management.

Performance Checklist

Data Preparation

Use Float64Array for features and targets
Ensure contiguous memory layout
Batch predictions instead of looping
Reuse fitted scalers for multiple transforms
Consider pipelines for multi-step workflows

Model Configuration

Choose appropriate nEstimators for dataset size
Set reasonable maxDepth to prevent overfitting
Use native Zig backend for medium/large data
Select optimal solver for linear models
Tune hyperparameters with cross-validation

Runtime Environment

Verify native kernels are active (fitBackend_)
Use prebuilt binaries (Linux/Windows)
Build locally on macOS (bun run native:build)
Set BUN_SCIKIT_TREE_BACKEND=zig explicitly
Benchmark both backends with your data

Production Deployment

Confirm native acceleration in production logs
Use appropriate environment variables
Monitor fit/predict times
Cache trained models when possible
Test performance regression in CI

Next Steps

Benchmarks

Review detailed performance comparisons

Native Runtime

Deep dive into Zig acceleration

Native Runtime

⌘I

Get Started

Core Concepts

Guides

Performance

Performance Optimization Tips

Overview

Choose the Right Backend

Tree Models: Zig vs JavaScript

Linear Models: Native Required

Data Preparation

Use Typed Arrays

Contiguous Memory Layout

Batch Predictions

Model Configuration

Random Forests: Tune Estimators

Decision Trees: Limit Depth

Logistic Regression: Solver Selection

Preprocessing Optimization

Reuse Scalers

Pipeline for Efficiency

Memory Management

Avoid Memory Leaks

Large Dataset Strategies

Benchmarking Your Code

Measure What Matters

Compare Backends

Run Official Benchmarks

Production Deployment

Verify Native Acceleration

Environment Configuration

Model Serialization

Performance Checklist

Next Steps

Benchmarks

Native Runtime

Build docs developers (and LLMs) love

Get Started

Core Concepts

Guides

Performance

Documentation Index

​Overview

​Choose the Right Backend

​Tree Models: Zig vs JavaScript

​Linear Models: Native Required

​Data Preparation

​Use Typed Arrays

​Contiguous Memory Layout

​Batch Predictions

​Model Configuration

​Random Forests: Tune Estimators

​Decision Trees: Limit Depth

​Logistic Regression: Solver Selection

​Preprocessing Optimization

​Reuse Scalers

​Pipeline for Efficiency

​Memory Management

​Avoid Memory Leaks

​Large Dataset Strategies

​Benchmarking Your Code

​Measure What Matters

​Compare Backends

​Run Official Benchmarks

​Production Deployment

​Verify Native Acceleration

​Environment Configuration

​Model Serialization

​Performance Checklist

​Next Steps

Benchmarks

Native Runtime

Build docs developers (and LLMs) love

Overview

Choose the Right Backend

Tree Models: Zig vs JavaScript

Linear Models: Native Required

Data Preparation

Use Typed Arrays

Contiguous Memory Layout

Batch Predictions

Model Configuration

Random Forests: Tune Estimators

Decision Trees: Limit Depth

Logistic Regression: Solver Selection

Preprocessing Optimization

Reuse Scalers

Pipeline for Efficiency

Memory Management

Avoid Memory Leaks

Large Dataset Strategies

Benchmarking Your Code

Measure What Matters

Compare Backends

Run Official Benchmarks

Production Deployment

Verify Native Acceleration

Environment Configuration

Model Serialization

Performance Checklist

Next Steps