Skip to main content

On This Page

P99

2 min read
Share

P99 is a way to measure performance that tells you how fast 99% of requests or tasks complete. It stands for the 99th percentile, meaning that 99 out of 100 requests finish within a certain time, and only 1% take longer.

How It Works

If you sort all response times from fastest to slowest, the P99 value is the point where 99% fall below it. For example, if a website has a P99 latency of 200 milliseconds, that means 99% of visitors get their page in 200ms or less, while only the slowest 1% wait longer.

How It Detects Problems

If your system normally responds in 100ms but occasionally has a hiccup that causes some requests to take 500ms, the average might still look fine (maybe 110ms), but your P99 will jump to around 500ms. This makes P99 a sensitive metric that catches performance anomalies that averages would hide.

Practical Example

If 99 out of 100 requests complete in 1 second, but 1 request takes 20 seconds due to a hiccup, the average would be 2.9 seconds (misleadingly high), but P99 would be 20 seconds, clearly showing you have a tail latency problem affecting that unlucky 1%. This helps teams identify and fix the root cause, like optimizing a slow database query or fixing network issues rather than just masking the problem.

P99 is particularly useful for setting alerts because it catches degrading performance trends without being too noisy from single extreme outliers, while still being more sensitive than just monitoring averages.

Continue reading

Next article

Pragmatic Clean Code: The Full Guide to Ownership & Entropy

Related Content