Skill reference: playbook
Skill reference: playbook
Section titled “Skill reference: playbook”The playbook skill authors one document genre: a strategic operational
playbook — the higher-altitude coordination of a class of situations. This
reference describes what that document type is, how the skill produces one, when
it earns its place, and the provenance and sources behind it.
| Property | Value |
|---|---|
| Authors | A strategic playbook coordinating a class of incidents |
| Purpose group | Operations |
MIF conceptType | procedural |
| Target MIF level | 3 |
| Primary source | Google SRE Book — Table of Contents |
What this document type is
Section titled “What this document type is”A playbook operates one level above a runbook. Its subject is not a single alert but a class of situations — a Sev1 outage, a data-integrity incident, a security event — and its job is to coordinate the people and decisions that resolve it. The Google SRE practice of incident management (the chapter on Managing Incidents, extended in the SRE Workbook) is the model: a clear command structure with separated roles — incident commander, operations lead, communications — clear decision points, and named phases from detection through mitigation to recovery and hand-off. A playbook captures that structure so that when a major incident hits, who-does-what and when-to-escalate are already decided.
A playbook is not a tactical procedure for one alert — fixing a specific firing condition step-by-step is the job of a sre-runbook, which a playbook will often invoke. It is the strategic complement: the runbook restores a service, the playbook coordinates the response when many things are moving at once and humans must be organised.
How the skill produces one
Section titled “How the skill produces one”playbook is a genre skill: it carries the incident-coordination pattern as
durable instructions plus exemplars, and writes the artifact over a MIF floor so
the result is at once an operational coordination guide and a machine-conformant
unit.
- Pattern, made operational. The skill encodes the strategic shape — roles
and their responsibilities, decision points, phased response across a class of
situations — and refuses anti-triggered work (fixing one specific alert belongs
in an
sre-runbook). - Roles and phases, not just steps. Because the subject is a class of incidents, the content is organised around who decides what and in which phase, rather than a single linear remediation, which is why it targets the higher MIF Level 3.
- Exemplars set the bar. Like every genre in the suite it ships
good-l1.md(the MIF Level-1 floor),good.md(the target level — Level 3 here),bad.md(a counter-example), andevals/evals.json. Thecheck-exemplarsgate provesgood-l1.mdvalidates at L1 andgood.mdat its target level. - MIF projection. The document is authored with MIF frontmatter (via the
shared
mif-frontmattersubstrate) and aconceptTypeofprocedural, reflecting that a playbook coordinates a sequence of performed actions across roles.mif-validateproves the Markdown to JSON-LD round-trip is lossless before the document is considered done.
When it is beneficial
Section titled “When it is beneficial”Reach for playbook when an incident is bigger than one alert — when
resolving it means coordinating several people, making escalation and
communication decisions, and moving through phases. Its value is that it removes
improvisation from the chaotic part of an incident: roles and decision rights are
settled in advance, so the team executes rather than debates.
Do not use it to remediate a single firing alert — that is a leaner sre-runbook, which the playbook references for the tactical fix. The cost of a playbook is realism: a coordination plan that has never been exercised tends to fail under real pressure, so it should be rehearsed and revised through incident retrospectives.
Example
Section titled “Example”A playbook titled “Sev1 customer-facing outage” defines the incident-commander, operations-lead, and communications roles and their hand-offs, then phases the response — declare and assemble, stabilise and mitigate (invoking the relevant service runbooks), communicate to stakeholders on a cadence, then recover and schedule the postmortem — with explicit decision points for when to escalate severity or page additional teams.
Provenance & citations
Section titled “Provenance & citations”- Genre source — Google SRE incident management: the playbook genre follows the incident-command and coordination practice in the Google SRE Book, https://sre.google/sre-book/table-of-contents/ (see the chapter on Managing Incidents), extended in the SRE Workbook, https://sre.google/workbook/table-of-contents/.
- Skill provenance: authored by the
playbookskill in the mif-docs plugin, https://github.com/modeled-information-format/mif-docs-plugin; the skill’s exemplars andevals/define and verify the pattern. - MIF conformance: the document projects to canonical JSON-LD under the MIF
specification, https://mif-spec.dev, and is proven lossless by
mif-validate. - Index: this skill is one entry in the skills by purpose catalog; its sibling operations genre is the tactical sre-runbook.