🧯

Incident Postmortem Report

Capture incident facts quickly, then turn them into clear root cause analysis and follow-up actions.

You’ll be asked to sign in first.

softwaresaasincident-reviewpostmortemreliability

Use this after service incidents when the team needs a clean record of what happened, impact, and what changes next.

Voice-first prompt suggestion:

"Create a postmortem for incident INC-2026-0215. Checkout latency spiked after deploy. Severity SEV-2. Incident ran from 18:04 to 19:12 UTC. Customer impact: 18% delayed checkouts and 4% failures. Timeline: alert, incident start, rollback, cache tuning, recovery. Root cause: retry policy increased downstream load. Improvements: better pre-release load testing and tighter alerts. Action items with owners and due dates included."

Form preview

Overview

Incident Title*

Text

[not set]

Incident ID*

Text

[not set]

Severity*

Text

SEV-1SEV-2SEV-3SEV-4

SEV-1

SEV-2

SEV-3

SEV-4

Incident Window*

Text

Start and end time

[not set]

Summary*

Text Area

Customer Impact*

Text Area

Analysis

Timeline

List

Time	Event	Owner
+ Add	+ Add	+ Add

Root Cause*

Text Area

Contributing Factors

Text Area

Follow-up

What Went Well

Text Area

What To Improve

Text Area

Action Items

List

Action	Owner	Priority	Due Date	Status
+ Add	+ Add	+ Add	+ Add	+ Add

EXAMPLE REPORTSample data — shows what you’ll get after recording

Example filled report

Overview

Incident Title

Checkout API latency spike during evening traffic

Incident ID

INC-2026-0215

Severity

SEV-2

Incident Window

2026-02-15 18:04 UTC to 2026-02-15 19:12 UTC

Summary

Latency increased sharply after a deployment, causing retries and intermittent checkout failures until rollback and cache tuning stabilized response time.

Customer Impact

Around 18% of checkout attempts were delayed and about 4% failed during the peak hour.

Analysis

Timeline

Time	Event	Owner
18:04	Latency alert triggered	On-call SRE
18:11	Incident declared and war room started	Incident commander
18:29	Rollback to previous release	Backend engineer
18:47	Cache configuration tuned	Platform engineer
19:12	Metrics normalized and incident resolved	On-call SRE

Root Cause

A new retry policy increased downstream load under peak traffic, which pushed cache misses and elevated response times.

Contributing Factors

Alert threshold was too loose for evening load and dashboard lacked queue depth visibility.

Follow-up

What Went Well

Fast incident declaration and clear ownership reduced time to stabilization.

What To Improve

Run load tests for retry policy changes and tighten latency alert thresholds.

Action Items

Action	Owner	Priority	Due Date	Status
Add queue depth panel to checkout dashboard	Platform team	High	2026-02-20	Open
Create pre-release load test gate for retry changes	Backend team	High	2026-02-22	Open