Rate Limiting Algorithms: A Deep Dive 2

Introduction Rate limiting is a technique used to control the number of requests a system processes within a specific time frame. It helps prevent abuse, ensures fair usage, protects against DDoS attacks, and maintains system stability. This blog explores the most commonly used rate-limiting algorithms, their advantages and disadvantages, and how to implement them in Java. 1️⃣ Token Bucket Algorithm How It Works A bucket holds a fixed number of tokens. Tokens are added to the bucket at a constant rate. Each request consumes a token. If the bucket is empty, excess requests are denied until new tokens are added. Pros & Cons ✅ Allows short bursts of traffic while controlling overall request rate. ✅ Efficient for distributed systems. ❌ If the bucket drains quickly, requests may be blocked until tokens are refilled. Java Implementation import java.util.concurrent.Semaphore; import java.util.concurrent.TimeUnit; class TokenBucketRateLimiter { private final Semaphore tokens; private final int capacity; public TokenBucketRateLimiter(int capacity, int refillRatePerSecond) { this.capacity = capacity; this.tokens = new Semaphore(capacity); // Refill tokens periodically new Thread(() -> { while (true) { tokens.release(refillRatePerSecond); if (tokens.availablePermits() > capacity) { tokens.drainPermits(); tokens.release(capacity); } try { TimeUnit.SECONDS.sleep(1); } catch (InterruptedException e) { Thread.currentThread().interrupt(); } } }).start(); } public boolean allowRequest() { return tokens.tryAcquire(); } } 2️⃣ Leaky Bucket Algorithm How It Works Requests enter a queue (bucket). Requests are processed at a fixed rate (like water leaking from a bucket). If the queue overflows, extra requests are discarded. Pros & Cons ✅ Ensures a steady flow of requests. ✅ Prevents sudden traffic spikes from overloading the system. ❌ Can introduce delays if the queue is full. Java Implementation import java.util.LinkedList; import java.util.Queue; class LeakyBucketRateLimiter { private final Queue queue; private final int capacity; private final long leakRateMillis; public LeakyBucketRateLimiter(int capacity, int leakRatePerSecond) { this.capacity = capacity; this.leakRateMillis = 1000L / leakRatePerSecond; this.queue = new LinkedList(); } public synchronized boolean allowRequest() { long currentTime = System.currentTimeMillis(); while (!queue.isEmpty() && queue.peek()

Mar 2, 2025 - 04:57

Introduction

Rate limiting is a technique used to control the number of requests a system processes within a specific time frame. It helps prevent abuse, ensures fair usage, protects against DDoS attacks, and maintains system stability.

This blog explores the most commonly used rate-limiting algorithms, their advantages and disadvantages, and how to implement them in Java.

1️⃣ Token Bucket Algorithm

How It Works

A bucket holds a fixed number of tokens.
Tokens are added to the bucket at a constant rate.
Each request consumes a token.
If the bucket is empty, excess requests are denied until new tokens are added.

Pros & Cons

✅ Allows short bursts of traffic while controlling overall request rate.

✅ Efficient for distributed systems.

❌ If the bucket drains quickly, requests may be blocked until tokens are refilled.

Java Implementation

import java.util.concurrent.Semaphore;
import java.util.concurrent.TimeUnit;

class TokenBucketRateLimiter {
    private final Semaphore tokens;
    private final int capacity;

    public TokenBucketRateLimiter(int capacity, int refillRatePerSecond) {
        this.capacity = capacity;
        this.tokens = new Semaphore(capacity);

        // Refill tokens periodically
        new Thread(() -> {
            while (true) {
                tokens.release(refillRatePerSecond);
                if (tokens.availablePermits() > capacity) {
                    tokens.drainPermits();
                    tokens.release(capacity);
                }
                try {
                    TimeUnit.SECONDS.sleep(1);
                } catch (InterruptedException e) {
                    Thread.currentThread().interrupt();
                }
            }
        }).start();
    }

    public boolean allowRequest() {
        return tokens.tryAcquire();
    }
}

2️⃣ Leaky Bucket Algorithm

How It Works

Requests enter a queue (bucket).
Requests are processed at a fixed rate (like water leaking from a bucket).
If the queue overflows, extra requests are discarded.

Pros & Cons

✅ Ensures a steady flow of requests.
✅ Prevents sudden traffic spikes from overloading the system.
❌ Can introduce delays if the queue is full.

Java Implementation

import java.util.LinkedList;
import java.util.Queue;

class LeakyBucketRateLimiter {
    private final Queue<Long> queue;
    private final int capacity;
    private final long leakRateMillis;

    public LeakyBucketRateLimiter(int capacity, int leakRatePerSecond) {
        this.capacity = capacity;
        this.leakRateMillis = 1000L / leakRatePerSecond;
        this.queue = new LinkedList<>();
    }

    public synchronized boolean allowRequest() {
        long currentTime = System.currentTimeMillis();
        while (!queue.isEmpty() && queue.peek() <= currentTime - leakRateMillis) {
            queue.poll();
        }
        if (queue.size() < capacity) {
            queue.add(currentTime);
            return true;
        }
        return false;
    }
}

3️⃣ Fixed Window Counter Algorithm

How It Works

A counter tracks the number of requests per fixed time window (e.g., per minute).
If the count exceeds the allowed limit, further requests are rejected until the next window.

Pros & Cons

✅ Easy to implement.

✅ Works well for predictable traffic patterns.

❌ Can lead to spikes at window boundaries.

Java Implementation

import java.util.concurrent.atomic.AtomicInteger;

class FixedWindowRateLimiter {
    private final int limit;
    private final long windowSizeMillis;
    private long windowStart;
    private final AtomicInteger requestCount;

    public FixedWindowRateLimiter(int limit, int windowSizeSeconds) {
        this.limit = limit;
        this.windowSizeMillis = windowSizeSeconds * 1000L;
        this.windowStart = System.currentTimeMillis();
        this.requestCount = new AtomicInteger(0);
    }

    public synchronized boolean allowRequest() {
        long now = System.currentTimeMillis();
        if (now - windowStart >= windowSizeMillis) {
            windowStart = now;
            requestCount.set(0);
        }
        return requestCount.incrementAndGet() <= limit;
    }
}

4️⃣ Sliding Window Counter Algorithm

How It Works

Uses smaller sub-windows within a fixed window.
More accurate and distributes requests evenly.

Pros & Cons

✅ More accurate than Fixed Window Counter.

✅ Reduces request bursts at window boundaries.

❌ More complex to implement.

5️⃣ Sliding Window Log Algorithm

How It Works

Stores timestamps of each request.
Removes timestamps outside the allowed time window.

Pros & Cons

✅ High accuracy in rate limiting.

❌ High memory usage due to storing request timestamps.

6️⃣ Adaptive Rate Limiting

How It Works

Uses machine learning or heuristics to adjust rate limits dynamically.
Can consider factors like server load, request patterns, and user behavior.

Pros & Cons

✅ Dynamically adjusts limits for better efficiency.

❌ Complex implementation.

Comparison Table

Algorithm	Burst Handling	Smoothness	Memory Usage	Complexity
Token Bucket	✅ Yes	✅ Yes	Read More Tags: Previous Article Good Commit vs Bad Commit: The 5 Commandments of Git Bible Next Article Rate Limiting Algorithms: A Deep Dive Related Posts Alternatives to Pessimistic Locking in Cluster Applicat... Feb 5, 2025 0 Learn AWS Incognito for Authentication Mar 14, 2025 0 Overview Description Feb 17, 2025 0 Name Email Comment Popular Posts Glenn Close grapples with AI threat in Hollywood: ... Jan 27, 2025 0 'Non-sanctuary' coastal enclave sues California fo... Jan 28, 2025 0 2025 Brier: Scores, standings and schedule Mar 1, 2025 0 California sheriff puts accused hardened criminals... Feb 24, 2025 0 Kinderspace: Architecture for Children's Development Feb 24, 2025 0 Recommended Posts A Comprehensive Guide to Business Setup in Saudi A... Jan 30, 2025 1 Computer Vision as a Service (CVaaS): Redefining A... Jan 29, 2025 0 Why Domain-Specific Data Matters for AI Agents Jan 28, 2025 0 NewsWorldStream provides you with real-time access to the latest global news from various sources in all available languages. Stay updated on world events, all in one place. Most Viewed Posts Glenn Close grapples with AI threat in Hollywood: ... Jan 27, 2025 0 'Non-sanctuary' coastal enclave sues California fo... Jan 28, 2025 0 PETA calls to end Groundhog Day tradition, replace... Jan 28, 2025 0 Newsletter Join our subscribers list to get the latest news, updates and special offers directly in your inbox Copyright 2025 NewsWorldStream.com - All Rights Reserved Delete Your Account Download App Publish With Us Privacy Policy Cookies Policy News Source Terms & Conditions This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies.

Algorithm

Burst Handling

Smoothness

Memory Usage

Complexity

Token Bucket

✅ Yes