12 - Designing A Rate Limiter

A rate limiter is a common component in system design that helps control the rate at which certain operations or requests are allowed to occur. It acts as a mechanism to prevent overloading a system by limiting the number of requests or actions processed within a specified time period.

A rate limiter is employed to regulate the flow of traffic originating from a client or service. Within the realm of HTTP, this mechanism restricts the number of client requests permitted within a designated timeframe. If the count of API requests surpasses the threshold set by the rate limiter, any surplus calls are obstructed.

Algorithms For Rate Limiting

Rate limiting is a crucial technique used in system design to control the rate at which certain operations or requests are allowed to be processed. It helps protect systems from abuse, prevents resource exhaustion, and ensures fair usage among users.

Fixed Window Counter Algorithm

The fixed window counter algorithm is a simple algorithm used for counting occurrences or events within a fixed time window. It allows you to keep track of how many events have occurred within a specific period of time.

In this algorithm, a fixed number of requests or operations are allowed within a specific time window.
For example, if the rate limit is set to 100 requests per minute, no more than 100 requests can be processed within any 1-minute period.
Requests exceeding the limit are either rejected or subjected to a different handling mechanism, such as being placed in a queue or delayed.

Here's a detailed explanation of how the fixed window counter works:

Initialization:
- Define the maximum number of requests allowed within a specific time window (e.g., 1 minute).
- Set the starting time of the window.
Request Counting:
- As requests arrive, increment a counter that tracks the number of requests within the time window.
- Each request is associated with a timestamp indicating when it arrived.
Request Processing:
- When a request arrives, check if the number of requests within the time window exceeds the predefined limit.
- If the request count is below the limit, process the request and increment the counter.
- If the request count exceeds the limit, either reject the request or handle it separately based on the system's configuration.
Time Window Reset:
- After the time window duration has passed, reset the request counter to zero and update the starting time of the window to the current time.
- Any new requests that arrive after the window reset are counted from zero again.

The Fixed Window Algorithm provides a straightforward approach to rate limiting. It divides time into fixed windows and limits the number of requests allowed within each window. This ensures that the rate of requests remains within the defined limit.

However, the Fixed Window Algorithm has some limitations. One major drawback is that it doesn't account for bursts of requests. If a large number of requests arrive within a short duration, they may exhaust the limit for that window, even if the overall rate is within the desired limit. Additionally, the algorithm does not provide a smooth distribution of requests since it resets the count at specific intervals, potentially causing spikes in request processing.

Here's a simplified example to illustrate the algorithm:

Suppose we want to count the number of requests made to a web server within a 5-minute window, and we want to store the count of the last 5 minutes of requests.

Define the parameters: Time window size = 5 minutes.
Initialize the window: Create an array or a data structure of size 5 to store the request counts.
Track the events: Whenever a request occurs, add it to the data structure and remove any requests that occurred more than 5 minutes ago.
Count the events: Count the number of elements in the data structure to determine the request count within the time window.
Slide the window: As each minute passes, remove the oldest request from the data structure and add the newest request to the data structure.
Repeat steps 3-5: Continue tracking requests, counting events within the window, and sliding the window as time progresses.

Advantages:

Simplicity: The Fixed Window Algorithm is relatively simple to understand and implement, making it accessible to developers. It doesn't require complex calculations or intricate logic.
Control: The algorithm provides a straightforward approach to rate limiting by setting a fixed limit on the number of requests allowed within a specific time window. It offers control over the rate of requests being processed.
Resource Protection: By limiting the number of requests within a fixed window, the algorithm helps protect system resources from being overwhelmed by excessive traffic. It ensures resource availability and prevents resource exhaustion.

Disadvantages:

Bursty Traffic Handling: The Fixed Window Algorithm does not handle bursts of requests efficiently. If a large number of requests arrive within a short duration, they may exceed the limit for that specific window, even if the overall request rate is within the desired limit. This can lead to an uneven distribution of requests.
Lack of Granularity: The algorithm operates in fixed time windows, which may result in less granular control over the rate limiting. It doesn't account for variations within the window, potentially leading to spikes in request processing.
Limited Flexibility: The Fixed Window Algorithm doesn't provide flexibility in adjusting the rate limit dynamically based on varying traffic patterns. It relies on fixed limits and time intervals, which may not be ideal for systems with fluctuating request rates.

Summarizing Up

In summary, rate limiters offer a range of benefits including preventing overload, protecting against attacks, ensuring fair resource allocation, improving scalability, enhancing stability and availability, safeguarding third-party integrations, and enabling usage-based billing. These benefits contribute to the overall reliability, performance, and security of a system.

The Fixed Window Counter Algorithm is a simple rate limiting technique. It defines a maximum number of requests allowed within a specific time window. As requests arrive, a counter is incremented. If the counter exceeds the limit, requests are either rejected or handled separately. At the end of the time window, the counter resets to zero. The algorithm offers a straightforward approach to rate limiting but may struggle with bursty traffic and lacks flexibility in adjusting to varying request rates. It provides basic control over request processing, ensuring resource protection within fixed intervals.

Manas Tech Blogs

Manas Tech Blogs

12 - Designing A Rate Limiter - Part 4

Table of contents

Algorithms For Rate Limiting

Fixed Window Counter Algorithm

Summarizing Up

12 - Designing A Rate Limiter - Part 4

Table of contents

Algorithms For Rate Limiting

Fixed Window Counter Algorithm

Summarizing Up

Did you find this article valuable?