TWML (Twitter Machine Learning)

TWML is one of X’s machine learning frameworks, built on TensorFlow v1. While largely deprecated, it remains in active use for training the Earlybird light ranking models that power X’s search-based candidate retrieval.

Legacy Framework: TWML is no longer under active development. Much of the codebase is out of date and unused. It is maintained specifically for light ranker model training.

Overview

TWML (Twitter Machine Learning) was X’s original machine learning framework, providing abstractions on top of TensorFlow to simplify model training and deployment. While most ML efforts have migrated to newer frameworks, TWML continues to serve a critical role in the recommendation pipeline.

Current Usage

TWML is exclusively used for training Earlybird light ranking models:

Light Ranker Training

Located in: src/python/twitter/deepbird/projects/timelines/scripts/models/earlybird/The light ranker is a critical component that pre-filters candidates from the search index before heavy ranking.

Core Component: DataRecordTrainer

The DataRecordTrainer class contains the core training logic for TWML models:

# Simplified example from train.py
class DataRecordTrainer:
    """Core trainer for TWML models using DataRecord format"""
    
    def __init__(self, params):
        self.params = params
        self.model = self._build_model()
        self.optimizer = self._build_optimizer()
        
    def _build_model(self):
        """Constructs the TensorFlow graph for the model"""
        # Define input features from DataRecord
        features = self._parse_data_record()
        
        # Build model layers
        hidden = tf.layers.dense(
            features, 
            units=self.params.hidden_size,
            activation=tf.nn.relu
        )
        
        # Output layer for ranking score
        logits = tf.layers.dense(hidden, units=1)
        
        return logits
        
    def train(self, train_data, eval_data):
        """Training loop for the model"""
        for epoch in range(self.params.num_epochs):
            # Training step
            loss = self._train_epoch(train_data)
            
            # Evaluation
            metrics = self._evaluate(eval_data)
            
            # Checkpoint saving
            self._save_checkpoint(epoch, metrics)

Light Ranker Training Pipeline

The light ranker training process follows these steps:

Data Collection

Gather user engagement signals from production logs:

Tweet clicks
Video watch time
Favorites (likes)
Retweets
Quote tweets
Replies

These signals are stored in the DataRecord format.

Feature Engineering

Extract features from DataRecords:

# Example features for light ranker
features = {
    # User features
    'user_followers_count': user.followers,
    'user_reputation': user.tweepcred_score,
    
    # Tweet features  
    'tweet_age_seconds': now - tweet.created_at,
    'tweet_has_media': int(tweet.has_media),
    'tweet_has_url': int(tweet.has_url),
    
    # Engagement features
    'author_engagement_rate': author.avg_engagement,
    'tweet_early_engagement': tweet.engagement_1hr,
}

Model Training

Train using the DataRecordTrainer:

# Training command (simplified)
python train.py \
  --train_data_path=/path/to/train/data \
  --eval_data_path=/path/to/eval/data \
  --model_dir=/path/to/model/output \
  --num_epochs=10 \
  --learning_rate=0.001

Model Export

Export trained model for serving in Earlybird search index

Deployment

Deploy model to production Earlybird instances for real-time scoring

DataRecord Format

TWML uses a proprietary DataRecord format for training data:

# DataRecord structure (conceptual)
class DataRecord:
    """Container for training examples"""
    
    # Sparse binary features (feature ID present/absent)
    binary_features: Set[int]
    
    # Sparse continuous features (feature ID -> value)
    continuous_features: Dict[int, float]
    
    # Dense features (fixed-size vector)
    dense_features: List[float]
    
    # Labels for supervised learning
    labels: Dict[str, float]

Key characteristics:

Efficient storage of sparse features
Support for multi-task learning (multiple labels)
Optimized for TensorFlow v1 input pipelines

Integration with Ranking Pipeline

TWML-trained light ranker models integrate into the recommendation pipeline:

# Train light ranker with TWML
trainer = DataRecordTrainer(params)
model = trainer.train(train_data, eval_data)
model.export_for_serving()

Light Ranker Model Architecture

The light ranker uses a relatively simple architecture optimized for low latency:

# Simplified light ranker architecture
def build_light_ranker(features, params):
    """
    Lightweight neural network for fast candidate scoring
    """
    # Input layer - sparse and dense features
    sparse_features = embed_sparse_features(
        features.binary_features,
        embedding_size=params.embedding_size
    )
    
    dense_features = features.dense_features
    
    # Concatenate all features
    combined = tf.concat([
        sparse_features,
        dense_features
    ], axis=1)
    
    # Hidden layers (typically 2-3 layers)
    hidden1 = tf.layers.dense(
        combined, 
        units=256, 
        activation=tf.nn.relu
    )
    
    hidden2 = tf.layers.dense(
        hidden1, 
        units=128, 
        activation=tf.nn.relu
    )
    
    # Output layer - ranking score
    logits = tf.layers.dense(hidden2, units=1)
    
    return logits

Training Objectives

The light ranker optimizes for multiple engagement signals:

Primary Objective
Secondary Objectives
Combined Loss

Tweet Click PredictionThe primary label is whether the user clicked on a tweet:

# Binary classification loss
click_loss = tf.losses.sigmoid_cross_entropy(
    labels=labels['click'],
    logits=model_output
)

Video Watch TimeFor video content, also predict watch duration:

# Regression loss for video watch time
watch_time_loss = tf.losses.mean_squared_error(
    labels=labels['video_watch_time'],
    predictions=model_output
)

Multi-Task LearningCombine objectives with learned weights:

# Weighted combination
total_loss = (
    alpha * click_loss + 
    beta * watch_time_loss
)

Key Features

DataRecord Format

Efficient sparse feature representation optimized for large-scale training

Multi-Task Learning

Support for multiple training objectives (clicks, engagement, watch time)

TensorFlow v1

Built on TensorFlow 1.x with familiar training APIs

Production Proven

Battle-tested on billions of training examples at X scale

Limitations

Deprecated Technology Stack

Built on TensorFlow v1 (no longer supported)
Limited model architecture flexibility
Lacks modern ML features (dynamic computation graphs, eager execution)
No active development or feature additions

Migration Path

For new models, X has migrated to modern frameworks:

Heavy Ranker

Uses PyTorch with more sophisticated architecturesSee: the-algorithm-ml/projects/home/recap/

Other Models

New models use PyTorch, JAX, or TensorFlow 2.x

Light Ranker Migration

Eventually, light ranker training will migrate away from TWML

File Locations

# Main training script
src/python/twitter/deepbird/projects/timelines/scripts/models/earlybird/train.py

# Model configuration
src/python/twitter/deepbird/projects/timelines/scripts/models/earlybird/README.md

Overview

Core Services

Models & Embeddings

Machine Learning

Data Pipeline

Development

TWML Framework

TWML (Twitter Machine Learning)

Overview

Current Usage

Light Ranker Training

Core Component: DataRecordTrainer

Light Ranker Training Pipeline

DataRecord Format

Integration with Ranking Pipeline

Light Ranker Model Architecture

Training Objectives

Key Features

DataRecord Format

Multi-Task Learning

TensorFlow v1

Production Proven

Limitations

Migration Path

File Locations

Learn More

Ranking Systems

Candidate Generation

Build docs developers (and LLMs) love

Overview

Core Services

Models & Embeddings

Machine Learning

Data Pipeline

Development

Documentation Index

​TWML (Twitter Machine Learning)

​Overview

​Current Usage

Light Ranker Training

​Core Component: DataRecordTrainer

​Light Ranker Training Pipeline

​DataRecord Format

​Integration with Ranking Pipeline

​Light Ranker Model Architecture

​Training Objectives

​Key Features

DataRecord Format

Multi-Task Learning

TensorFlow v1

Production Proven

​Limitations

​Migration Path

​File Locations

​Learn More

Ranking Systems

Candidate Generation

Build docs developers (and LLMs) love

TWML (Twitter Machine Learning)

Overview

Current Usage

Core Component: DataRecordTrainer

Light Ranker Training Pipeline

DataRecord Format

Integration with Ranking Pipeline

Light Ranker Model Architecture

Training Objectives

Key Features

Limitations

Migration Path

File Locations

Learn More