Links: incident response program @ honeycomb
- How We Manage Incident Response at Honeycomb: Good ideas in here to think about. As a system grows it becomes harder for operators to know fully how it works. Make alerts actionable to avoid some cognitive burden. Coordination is important. (eg Say you’re going to something before you do it.) Psychological safety is important in letting others feel like they can share their experience without reprisal.