Rate Limit Handling

JDA provides sophisticated rate limit handling to ensure your bot respects Discord’s API rate limits. This guide covers advanced rate limit configuration and custom implementation.

Understanding Rate Limits

Discord uses several types of rate limits:

Per-route rate limits: Apply to specific API endpoints
Global rate limits: Apply to the entire bot token
Cloudflare rate limits: Apply to the IP address

Rate Limit Headers

JDA reads the following headers from Discord’s API responses:

X-RateLimit-Reset-After

string

Total time (in seconds) until the current rate limit bucket resets. Can have decimals for millisecond precision.

X-RateLimit-Reset

string

Epoch time (seconds since January 1, 1970 UTC) at which the rate limit resets.

X-RateLimit-Limit

string

The number of requests that can be made to this endpoint.

X-RateLimit-Remaining

string

The number of remaining requests that can be made.

X-RateLimit-Bucket

string

A unique string denoting the rate limit being encountered (non-inclusive of top-level resources in the path).

X-RateLimit-Scope

string

Returned only on HTTP 429 responses. Value can be user, global, or shared.

X-RateLimit-Global

boolean

Returned only on HTTP 429 responses if the rate limit encountered is the global rate limit.

Retry-After

string

The number of seconds to wait before submitting another request.

Custom Rate Limiter

You can implement a custom rate limiter by implementing the RestRateLimiter interface:

import net.dv8tion.jda.api.requests.RestRateLimiter;
import net.dv8tion.jda.api.requests.Route;
import okhttp3.Response;

public class CustomRateLimiter implements RestRateLimiter {
    private final RateLimitConfig config;
    private volatile boolean stopped = false;
    
    public CustomRateLimiter(RateLimitConfig config) {
        this.config = config;
    }
    
    @Override
    public void enqueue(Work task) {
        if (stopped || task.isSkipped()) {
            return;
        }
        
        // Get the route for bucket identification
        Route.CompiledRoute route = task.getRoute();
        
        // Execute the request
        config.getElastic().execute(() -> {
            Response response = task.execute();
            if (response != null) {
                // Handle rate limit headers
                handleRateLimitHeaders(response, route);
            }
        });
    }
    
    private void handleRateLimitHeaders(Response response, Route.CompiledRoute route) {
        String bucket = response.header(HASH_HEADER);
        String remaining = response.header(REMAINING_HEADER);
        String limit = response.header(LIMIT_HEADER);
        String resetAfter = response.header(RESET_AFTER_HEADER);
        
        // Implement your rate limit tracking logic here
    }
    
    @Override
    public void stop(boolean shutdown, Runnable callback) {
        stopped = true;
        callback.run();
    }
    
    @Override
    public boolean isStopped() {
        return stopped;
    }
    
    @Override
    public int cancelRequests() {
        // Cancel all non-priority requests
        return 0;
    }
}

Rate Limit Configuration

The RateLimitConfig class provides configuration options for the rate limiter:

scheduler

ScheduledExecutorService

required

The scheduler used to schedule rate-limit tasks and delayed executions.

elastic

ExecutorService

required

The elastic pool used to execute blocking HTTP requests. Can scale up and down based on demand.

globalRateLimit

GlobalRateLimit

required

The global rate-limit store, which can be shared between multiple JDA instances.

isRelative

boolean

required

Whether to use X-RateLimit-Reset-After (relative) instead of X-RateLimit-Reset (absolute). Recommended to avoid NTP sync issues.

Example Configuration

import net.dv8tion.jda.api.requests.RestRateLimiter;
import java.util.concurrent.*;

public class RateLimitExample {
    public void configure() {
        // Create custom thread pools
        ScheduledExecutorService scheduler = 
            Executors.newScheduledThreadPool(2);
        ExecutorService elastic = 
            Executors.newCachedThreadPool();
        
        // Create shared global rate limit store
        RestRateLimiter.GlobalRateLimit globalRateLimit = 
            RestRateLimiter.GlobalRateLimit.create();
        
        // Configure rate limiter
        RestRateLimiter.RateLimitConfig config = 
            new RestRateLimiter.RateLimitConfig(
                scheduler,
                elastic,
                globalRateLimit,
                true // Use relative timing
            );
    }
}

Shared Global Rate Limits

When running multiple bot instances (sharding), you can share global rate limit information:

import net.dv8tion.jda.api.JDABuilder;
import net.dv8tion.jda.api.requests.RestRateLimiter;
import net.dv8tion.jda.api.requests.RestConfig;

public class SharedRateLimitExample {
    public void createShards() {
        // Create shared global rate limit store
        RestRateLimiter.GlobalRateLimit sharedGlobal = 
            RestRateLimiter.GlobalRateLimit.create();
        
        // Configure all shards to use the same global rate limit
        for (int i = 0; i < 4; i++) {
            RestConfig restConfig = new RestConfig()
                .setGlobalRateLimit(sharedGlobal);
            
            JDABuilder.createDefault(token)
                .useSharding(i, 4)
                .setRestConfig(restConfig)
                .build();
        }
    }
}

When implementing a custom rate limiter, ensure you properly handle the X-RateLimit-Global header to avoid hitting Discord’s global rate limits, which can affect all your bot’s operations.

Priority Requests

The Work interface includes priority detection:

public void handleWork(RestRateLimiter.Work task) {
    if (task.isPriority()) {
        // Handle priority requests immediately
        // These should not be cancelled
    } else {
        // Regular requests can be queued or cancelled
    }
}

Best Practices

Use relative timing: Set isRelative to true to avoid clock synchronization issues
Share global limits: When sharding, share the GlobalRateLimit instance across all shards
Respect retry headers: Always honor the Retry-After header on 429 responses
Monitor bucket hashes: Use the X-RateLimit-Bucket header to properly identify unique rate limit buckets
Don’t cancel priority requests: Requests marked as priority should never be cancelled

Troubleshooting

Hitting Rate Limits Frequently

If you’re consistently hitting rate limits:

Review your request patterns for inefficient API usage
Implement caching to reduce redundant requests
Use bulk operations where possible (e.g., bulk delete messages)
Consider implementing request queuing with backoff

Clock Synchronization Issues

If you experience issues with absolute timestamps:

Enable relative timing in your RateLimitConfig
Ensure your server’s clock is synchronized with NTP
Use X-RateLimit-Reset-After instead of X-RateLimit-Reset

Getting Started

Core Concepts

Guides

Advanced Topics

Migration & Troubleshooting

Understanding Rate Limits

Rate Limit Headers

Custom Rate Limiter

Rate Limit Configuration

Example Configuration

Shared Global Rate Limits

Priority Requests

Best Practices

Troubleshooting

Hitting Rate Limits Frequently

Clock Synchronization Issues

Build docs developers (and LLMs) love

Getting Started

Core Concepts

Guides

Advanced Topics

Migration & Troubleshooting

Documentation Index

​Understanding Rate Limits

​Rate Limit Headers

​Custom Rate Limiter

​Rate Limit Configuration

​Example Configuration

​Shared Global Rate Limits

​Priority Requests

​Best Practices

​Troubleshooting

​Hitting Rate Limits Frequently

​Clock Synchronization Issues

Build docs developers (and LLMs) love

Understanding Rate Limits

Rate Limit Headers

Custom Rate Limiter

Rate Limit Configuration

Example Configuration

Shared Global Rate Limits

Priority Requests

Best Practices

Troubleshooting

Hitting Rate Limits Frequently

Clock Synchronization Issues