SaaS Post-Mortem
π Incident Summary
A one-sentence overview of the incident:
e.g., βOn July 17, the payment service failed in production, affecting all APAC users for 45 minutes.β
π Incident Details
Field
Description
Date & Time
When it occurred (include time zone)
Location
System / Region / Team affected
Detected How
Monitoring alert, user report, etc.
Severity Level
P1 / P2 / P3 (or custom scale)
Status
Ongoing / Resolved / Monitoring
π Root Cause Analysis
Primary Root Cause:
Contributing Factors:
Detection Issues: (Monitoring gaps, alerting failures, etc.)
π§ Resolution Details
Immediate Fixes Implemented:
Full Service Restored At:
Time to Detection (TTD):
Time to Resolution (TTR):
β
Actions
Action 1:
Action 2:
Action 3:
π― Learnings
Lesson 1:
Lesson 2:
Lesson 3:
π Recommendations
Recommendation 1:
Recommendation 2:
Recommendation 3:
π Post-Mortem Follow-Up
Owner Assigned:
Due Date:
Next Review Date: