Total concurrency
100
Estimate theoretical and safe throughput for APIs using latency and effective concurrency parameters.
Total concurrency
100
Theoretical max throughput
400 RPS
Safe throughput
320 RPS
Requests per minute (safe)
19,200
Real throughput depends on CPU saturation, queueing, GC, IO limits, retries and network conditions.
We treat workers and connections as execution slots. Throughput is slots divided by latency in seconds.
total_concurrency = workers * connections_per_worker
throughput_max = total_concurrency / latency_seconds
throughput_safe = throughput_max * safety_factorUse safe throughput for test planning and alert thresholds.
Theoretical values are upper bounds. Safe throughput leaves room for spikes, noisy neighbors and latency tail behavior.
70% to 80% is a practical default for early planning.
For conservative planning, yes. For average-case projections, use average latency.
No, queue delay should be evaluated separately in load tests.