Load Limiting

Types of Load Limiting
1. Proactive Load Limiting
2. Reactive Load Limiting
Best Practices
Policy Comparisons
1. Rate Limiters vs Bulkheads
Preferences

Systems become overloaded when usage exceeds the capacity of resources such as CPU, memory, disk and network IO, thread pools, and so on. Failsafe-go offers several policies that can prevent and sometimes detect system overload, including Adaptive Limiters, Adaptive Throttlers, Circuit Breakers, Bulkheads, Rate Limiters, Timeouts, and Caches. We’ll discuss below how these policies differ, and when you might choose one over another.

Types of Load Limiting

There’s two general approaches to load limiting: proactive, where we estimate when a system might be overloaded and proactively limit it, or reactive, where we react to a signal that indicates a system is actually overloaded and limit it.

Proactive Load Limiting

Bulkheads, Rate Limiters, and Timeouts are proactive load limiters. They must be statically configured to limit load at some point. Ideally, that configuration should be carefully chosen through load testing to start limiting before the system becomes overloaded, but not limit so early that the system’s resources are underutilized. While it’s better to limit sooner than later, choosing configuration for these policies that actually matches a workload and system capacity can be challenging, especially as these change over time.

Reactive Load Limiting

Reactive load limiters take a different approach. Rather than limiting based on a static configuration that might not match a system’s capacity, reactive limiters wait for a signal that a system is actually becoming overloaded before they start to limit. The benefit is that they can be used with any sized system, without requiring carefully chosen configuration.

Adaptive limiters are reactive since they detect indications of overload through changes in latency and throughput. Adaptive throttlers and time based Circuit Breakers are also reactive since they only limit executions when the recent failure rate exceeds a threshold, ex: 10% in the last minute.

Best Practices

For overload prevention, it’s recommended to use a reactive load limiter. Adaptive Limiters are first among those, followed by Adaptive Throttlers and time based Circuit Breakers. For other use cases such as maintaining per-user quotas, Rate Limiters or Bulkheads can be useful.

All load limiters can be used on the client side or the server side inside a system. If your client connects with only a few servers, it may make sense to have limiters on the client side for each server you connect to. If on the other hand you have many servers, it may make sense to place the limiter on the server, so that each client doesn’t need a limiter for each server.

Policy Comparisons

Rate Limiters vs Bulkheads

Rate limiters and Bulkheads are both forms of proactive limiting, and should be configured based on a system’s capacity, but they differ in that Bulkheads are better at handling varied workloads than rate limiters. The reason for this is highlighted by Little’s Law, which states that the average concurrency inside a system relates to the average request rate and response time.

For example: 100 reqs/sec * 1 sec/req = an average concurrency of 100. If we limit the request rate to 100, the concurrency and load on the system is 100 so long as requests take 1 second to process. If more expensive requests arrive that take 2 seconds to process, then concurrency inside the system increases to 200.

Bulkheads avoid this problem since they directly limit concurrency, and therefore load. When the request rate or response times change, the concurrency limit is still the same, allowing a bulkhead to better control workloads for some system capacity. Of course, bulkheads still require static configuration. For something more adaptive to changes in load or capacity, consider an adaptive limiter.

Preferences

For guarding against system overload, prefer reactive limiters, and in particular, prefer adaptive limiters over circuit breakers since they respond more quickly to indications of overload.