/

chaos-engineering

monitoring
security
incident-management
on-call
dev-ops
site-reliability-engineering
incident-response
post-mortem
reliability
sre-culture
observability
infrastructure
devops
sre
alerting

upgundecha/howtheysre
501日前8.8k

A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)