interactive demos

SRE Toolbox

Live calculators and tools for common SRE tasks — SLO budgeting, incident cost analysis, and reliability metrics.

SLO Error Budget Calculator

43m
Error Budget
2592s total
43m
Max Downtime
per 30d window
52s
Fast Burn Alert
2% budget / 1h
2m
Slow Burn Alert
5% budget / 6h
# Prometheus burn rate alert (fast burn)
(sum(rate(http_requests_total{code=~"5.."}[1h])) / sum(rate(http_requests_total[1h]))) >
0.0144

Incident Cost Estimator

$10
Revenue Loss
$260
Engineer Time
$270
Total Cost

MTTR / MTBF Tracker

30m45m20m60m15m
34m
MTTR
mean time to recover
144h
MTBF
mean time between failures
99.606%
Availability
estimated from data
More tools coming

I'm building more interactive SRE tools: a Kubernetes resource rightsizing calculator, an on-call rotation health checker, and a capacity planning estimator. Stay tuned.