/

post-mortem

hacktoberfest
debugging
monitoring
security
incident-management
on-call
dev-ops
site-reliability-engineering
chaos-engineering
incident-response
reliability
sre-culture
observability
infrastructure
devops
sre
alerting

danluu/post-mortems
502日前10.9k

A collection of postmortems. Sorry for the delay in merging PRs!

upgundecha/howtheysre
501日前8.8k

A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)