← Blog

How to read a Statuspage postmortem — what each section actually tells you

· 6 min read
incidentspostmortemsstatuspagereading

Every Anthropic incident on the public Statuspage moves through up to five states, in this order: InvestigatingIdentifiedMonitoringResolvedPostmortem. Each state’s update is written for a slightly different audience, with a slightly different purpose. Reading them as a sequence — instead of just glancing at the most recent — extracts more information than most people realize is there.

This is a working reader’s guide.

The five states, in order

Investigating — first acknowledgment

Posted within minutes of an on-call engineer being paged. The text is intentionally minimal: “We are investigating reports of [symptom].” The audience is users who are currently broken; the goal is to acknowledge the problem exists and let them stop wondering whether it is them.

What you can extract:

What you cannot extract:

Investigating is most useful as a confirmation signal. If you arrive at the page and the latest update is Investigating from 4 minutes ago, you have just learned that yes, the problem you are seeing is real, and someone is working on it. That is the entire information content.

Identified — they know what is happening

Usually posted 5–30 minutes after Investigating, sometimes longer for genuinely novel failures. The post identifies the cause at a coarse level — not a precise root cause, but a category: “we have identified an issue with our model serving capacity,” “we have identified a regression in our recent deployment,” “we have identified a third-party dependency issue.”

What you can extract:

What you still cannot extract:

Monitoring — fix deployed, watching

The post says some variation of “we have implemented a fix and are monitoring.” This is the inflection point. From a user’s perspective, the symptom should now be gone or subsiding. The provider is in the verification window.

What you can extract:

What you cannot extract:

Resolved — full recovery confirmed

Posted when monitoring has confirmed that the symptom is fully gone for everyone. Often includes a sentence like “All systems are operating normally.”

What you can extract:

What you may want to wait for:

Postmortem — the writeup

Not every incident gets a postmortem state on the public Statuspage. Anthropic posts them for the more impactful events, typically several days to a couple weeks after Resolved. The text is significantly longer — sometimes a few paragraphs — and explains what happened, why, what was done to fix it, and what is being done to prevent recurrence.

What you can extract:

What you should not over-read:

Reading them in sequence, not just the latest

The most common mistake is to glance at the current state of an incident and stop there. The sequence is more informative than any single state.

A useful reading pattern, in order:

  1. What is the current state? Tells you whether to act now or wait.
  2. How long has it been in the current state? A long time in Investigating means the cause is non-obvious. A long time in Monitoring means the fix is being watched carefully — possibly because the engineer is not fully confident.
  3. How long was the previous state? Investigation that took 2 hours implies a hard problem. Identification that came 5 minutes after investigation implies a familiar shape.
  4. What does the gap between Investigating and now look like in your own metrics? The official timeline and your error metric should roughly agree. If they do not, your error metric may be measuring something the incident does not cover (a different region, a different component).

What postmortems do not include

Two things to be aware of, because their absence is sometimes mistaken for omission.

No customer impact numbers. Postmortems will say “users experienced elevated error rates” but will not say “X% of requests failed for Y minutes.” Internal numbers are not published. If you need impact numbers for your own analysis, you will have to derive them from your own metrics.

No personnel detail. Postmortems do not name engineers, do not describe paging chains, and do not share details of internal coordination. These are intentional editorial choices, common across the industry.

If you are building your own incident-response practice and want to model on Anthropic’s, the public postmortems are a good template for tone but a thin source of operational detail. Cross-reference with public engineering blogs (when they exist) for the engineering-side context.

Why postmortems are scarce on minor incidents

Most minor impact incidents resolve at Resolved and never get a Postmortem state. This is a reasonable editorial choice: a 20-minute degraded performance event with no broad impact does not warrant a multi-paragraph public writeup. Postmortems are reserved for major and critical events.

For users tracking the system’s evolution: this means the public record of postmortems is biased toward the worst events. The day-to-day operational picture — how Anthropic handles routine minor events — is mostly invisible from the outside. Most providers operate this way.

Where to read postmortems alongside the dashboard

Anthropic’s Statuspage incident archive is the canonical archive. You can also follow our RSS feed — when an incident moves to Postmortem, the RSS item description picks up the new update text automatically. Subscribing to the feed is a reasonable way to passively collect postmortems as they are published, without having to remember to check the page.

Read the sequence. Each state is doing different work. The sum is more useful than any single update.

Share this post