Rate limiting in production: the four algorithms and their failure modes
Most services reach for a token bucket and never look further. Rate limiting is four distinct algorithms with different burst behaviours and failure modes — here is what each one actually protects against.