High-Throughput API Rate Limiter

This project implements a distributed, high-performance API rate limiter designed to protect backend services from traffic bursts and ensure fair usage. Built with a modern Java stack, it leverages Redis for high-speed, distributed state management and employs the Token Bucket algorithm for scalable and efficient request throttling.

The system is architected as a set of independent microservices orchestrated with Docker, featuring a central API Gateway that seamlessly integrates the rate-limiting logic.

System Architecture

The architecture consists of four main components that work together to process and rate-limit incoming requests:

API Gateway: The single entry point for all client requests. It forwards traffic to the Rate Limiter Service for authorization before proxying it to the appropriate backend service.
Rate Limiter Service: The core of the system. It applies the token bucket algorithm using a Redis-backed Lua script to ensure atomic operations, checking if an incoming request from a specific IP address is within the defined limits.
Product Service: A mock backend service representing a protected resource. It only receives requests that have been successfully authorized by the Rate Limiter Service.
Redis: An in-memory data store used to maintain the state of the token buckets for each client IP address in a distributed and highly available manner.

graph TD
    subgraph "Client"
        A[User Request]
    end

    subgraph "System Boundary"
        A --> B(API Gateway);
        B --> C{Rate Limiter Service};
        C -- Allowed --> D[Product Service];
        C -- Denied --> B;
        B -- 429 Too Many Requests --> A;
        D -- 200 OK with Data --> C;
        C --> B;
        B -- 200 OK with Data --> A;
        C <--> E[(Redis)];
    end

    style B fill:#26A69A,stroke:#004D40,stroke-width:2px,color:#fff
    style C fill:#5C6BC0,stroke:#1A237E,stroke-width:2px,color:#fff
    style D fill:#66BB6A,stroke:#1B5E20,stroke-width:2px,color:#fff
    style E fill:#EF5350,stroke:#B71C1C,stroke-width:2px,color:#fff

Key Features

Microservices Architecture: Decoupled services (Gateway, Rate Limiter, Product Service) for independent scaling and development.
Token Bucket Algorithm: Implemented for efficient and flexible rate limiting that allows for bursts of traffic.
Distributed & Scalable: Uses Redis as a centralized, high-speed data store, ensuring consistent rate limiting across multiple service instances.
Atomic Operations: Employs a Redis LUA script to ensure that checking and consuming tokens is an atomic operation, preventing race conditions.
Resilient & Fault-Tolerant: Integrates Resilience4j for Circuit Breaker and Retry patterns, ensuring the system can handle Redis connection failures gracefully.
Containerized & Composed: Fully containerized with Docker and orchestrated with Docker Compose for easy, reproducible deployments.
Automated CI/CD: Features GitHub Actions for continuous integration, automating the build, unit testing, and performance testing for every pull request.

Technology Stack

Backend: Java 21, Spring Boot 3, Project Reactor (WebFlux)
API Gateway: Spring Cloud Gateway
Database: Redis (for distributed state management)
Build Tool: Maven
Containerization: Docker, Docker Compose
CI/CD: GitHub Actions for automated building and testing
Load Testing: k6

Performance Benchmark

The system was load-tested using k6, simulating a ramp-up to 100 virtual users over 30 seconds and sustaining that load for one minute. The results demonstrate the system's high throughput and low latency.

Average Throughput: 79.40 requests/second
Average Latency (http_req_duration): 10.27ms
95th Percentile Latency: 17.75ms
Success Rate: 100% (all requests correctly returned either a 200 OK or 429 Too Many Requests)

These metrics confirm that the rate limiter adds minimal overhead while effectively enforcing usage policies under significant load.

Getting Started

Prerequisites

Git
Docker & Docker Compose
Java 21
Maven

Running with Docker (Recommended)

This is the easiest way to get all services up and running.

Clone the repository:

git clone https://github.com/sonii-shivansh/api-rate-limiter.git
cd api-rate-limiter

Build and run the services using Docker Compose:

docker-compose up --build -d

This command will build the images for all services and start them in detached mode.

Running Locally (Without Docker)

Start Redis: You still need Redis running. You can start it easily with Docker:

docker run --name my-redis -p 6379:6379 -d redis

Run each service: Open three separate terminals and run the following command in each respective service directory (api-gateway, rate-limiter-service, product-service):

# Terminal 1: Product Service
cd product-service
./mvnw spring-boot:run

# Terminal 2: Rate Limiter Service
cd rate-limiter-service
./mvnw spring-boot:run

# Terminal 3: API Gateway
cd api-gateway
./mvnw spring-boot:run

Configuration

The rate limit parameters can be easily configured in the rate-limiter-service/src/main/resources/application.yml file:

rate-limiter:
  bucket-capacity: 10         # Max tokens in the bucket
  refill-rate-per-minute: 10  # Tokens to add per minute

With the default settings, each IP address is allowed 10 requests per minute.

API Endpoint & Testing

Once the services are running, you can test the rate limiter by making requests to the protected endpoint through the API Gateway.

URL: http://localhost:8080/api/products
Method: GET

You can use curl to test it. The first 10 requests within a minute should succeed. Subsequent requests will be rejected with a 429 Too Many Requests status code until the token bucket is refilled.

# This request should succeed (if tokens are available)
curl -i http://localhost:8080/api/products

# Run this 11 times in a loop to see the rate limit in action
1..11 | ForEach-Object { curl http://localhost:8080/api/products }

Contributing

We welcome contributions from the open-source community! If you're looking to contribute, please read our CONTRIBUTING.md for guidelines on how to get started.

We have a number of "good first issues" that are perfect for new contributors. We're excited to see your pull requests!

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.github/workflows		.github/workflows
api-gateway		api-gateway
assets		assets
load-testing		load-testing
product-service		product-service
rate-limiter-service		rate-limiter-service
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

High-Throughput API Rate Limiter

System Architecture

Key Features

Technology Stack

Performance Benchmark

Getting Started

Prerequisites

Running with Docker (Recommended)

Running Locally (Without Docker)

Configuration

API Endpoint & Testing

Contributing

About

Uh oh!

Releases

Packages

Languages

License

Demagalawrence/api-rate-limiter

Folders and files

Latest commit

History

Repository files navigation

High-Throughput API Rate Limiter

System Architecture

Key Features

Technology Stack

Performance Benchmark

Getting Started

Prerequisites

Running with Docker (Recommended)

Running Locally (Without Docker)

Configuration

API Endpoint & Testing

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages