|
| 1 | +--- |
| 2 | +title: "Incident Postmortem" |
| 3 | +description: "Template for conducting blameless postmortems after incidents or outages" |
| 4 | +sections: |
| 5 | + - title: "Incident Summary" |
| 6 | + description: "What happened and when" |
| 7 | + - title: "Timeline of Events" |
| 8 | + description: "Chronological sequence of events" |
| 9 | + - title: "Root Cause Analysis" |
| 10 | + description: "What caused the incident" |
| 11 | + - title: "Impact Assessment" |
| 12 | + description: "Effect on users and business" |
| 13 | + - title: "Response & Resolution" |
| 14 | + description: "How the incident was resolved" |
| 15 | + - title: "Lessons Learned" |
| 16 | + description: "Key takeaways and insights" |
| 17 | + - title: "Action Items" |
| 18 | + description: "Preventive measures and improvements" |
| 19 | +--- |
| 20 | + |
| 21 | +## Incident Summary |
| 22 | + |
| 23 | +Brief description of the incident, affected systems, and duration. |
| 24 | + |
| 25 | +## Timeline of Events |
| 26 | + |
| 27 | +Chronological sequence of what happened, from detection to resolution. |
| 28 | + |
| 29 | +## Root Cause Analysis |
| 30 | + |
| 31 | +Technical root cause, contributing factors, and why detection/prevention failed. |
| 32 | + |
| 33 | +## Impact Assessment |
| 34 | + |
| 35 | +User impact, business impact, affected customers, and financial implications. |
| 36 | + |
| 37 | +## Response & Resolution |
| 38 | + |
| 39 | +How the incident was detected, escalated, debugged, and ultimately resolved. |
| 40 | + |
| 41 | +## Lessons Learned |
| 42 | + |
| 43 | +What went well, what didn't, and key insights from the incident. |
| 44 | + |
| 45 | +## Action Items |
| 46 | + |
| 47 | +Specific improvements to prevent recurrence, improve detection, or speed up response. |
0 commit comments