§1 · Premise
Build the second ledger; do not tear out the first
Stop playing two different games tracking two different scores.
You already run a service-first instrument, and it works. Uptime is real. MTTR is real. The discipline behind those numbers is the reason the business funds you. Do not tear it out. This workshop is not a replacement program — it is an addition. You are going to build a second ledger, the Outcome layer, on top of the service-first foundation you already trust, so that IT and the business stop playing two different games tracking two different scores.
By the end you will have a working outcome instrument sized to your organization, three metrics at least one of which cannot be gamed, and a review cadence that produces a harder, more useful conversation than the one you have today.
This workshop is about internal IT Operations — the function that enables the business through technology. It does not apply when the service IT runs is the product the company sells. The companion Deep Dive made the argument for why, and ended with you finding your silences — Green, Amber, Red — across three business-unit conversations. This workshop is the how. It assumes you have done that, or are about to.
§2 · Falsification bet
The bet we are willing to lose
Every CTRL ALT PRESS piece states a claim you can prove wrong. The trust layer is a falsifiable claim with a horizon and a check, not a testimonial.
Position An honest outcome layer surfaces routed-around and failed work Horizon Two quarters after the layer exists Checkable Whether business-unit leads report a changed picture of IT
If a team builds an honest outcome layer — one that grades whether the business outcome happened, paired with non-gameable technical signal — and after two quarters the business-unit leads report the same picture of IT they had before the layer existed, with no new visibility into routed-around or failed work, then the outcome layer added nothing and I was wrong. I would want to hear about it. Hold the workshop to that standard: if an exercise produces a metric that only confirms what your green dashboard already told you, the exercise failed, not the business.
OPEN · CHECKABLE Whether business-unit leads report a changed picture of IT
§3 · The bleed taxonomy
Three graded ways an outcome fails to cleanly happen
When an outcome does not cleanly happen, it fails in one of three graded ways. This is the vocabulary you score with — the Outcome ledger is not binary, and pretending it is throws away most of the signal. A clean outcome is none of these. Most of the value of the layer lives in making Workaround visible, because Workaround is exactly the silence your service dashboard is structurally blind to: it never became a ticket.
- 01
Drag
The outcome happened, but slower or more expensively than it should have.
Operates on Cost: the hour an executive burned waiting on the ERP. Measured in time and attention. - 02
Workaround
The outcome happened, but not by the official path — the peer's printed PDFs, the shadow spreadsheet.
Operates on Cost: fragility, key-person (tribal-knowledge) risk, and slow erosion of trust that the official path will work next time. - 03
Failure
The outcome did not happen. The loan did not sell. The release did not ship.
Operates on Cost: a downstream price almost always — the customer who leaves, the deadline with reputational weight, the deal that does not occur.
The math we extend Let V be the verdict on whether IT is doing its job, S what the service did (uptime, response time, resolution within target), and O whether the user achieved the business outcome. The service-first model grades the verdict against the service alone — V = g(S) — which was the correct design for the constraints of the late 1980s, but is blind to the question the business is asking. This workshop builds the instrument that computes V = g(S, O): the engineering of O as a first-class, measured input, without breaking S.
§4 · Who gets what
Where the value lands by archetype
One layer, but its center of gravity shifts with where you operate. Pick your stance to see the takeaway and the segments to weight.
Supporter L1–L2
Builder L3
Architect Principal
Orchestrator EM / Director
Strategist CISO / CTO / VP
§5 · Curriculum
Five segments, then the capstone
Each segment is one move: an objective, the core idea, an exercise you do on paper, and a checkpoint that names the failure mode. Sequence is load-bearing — Segment 1 is the gate.
Objective
Install the distinction that protects every later exercise. Most experience-first programs die here, because the team buys a tool before they can define the thing the tool is supposed to measure.
Core idea
The Outcome layer is not any one of the three XLA data types (X, O, T) — it is a verdict. It is the second ledger that asks “Did the thing the work existed for actually happen?” You assemble it from X/O/T data, but the verdict is the artifact, not the telemetry. It is not CSAT, not a sentiment survey bolted onto a ticket, and not a DEX platform purchase.
Exercise
Exercise 1.1 — Draw the boundary. Write the one-sentence difference between CSAT and the Outcome layer, then the exact sentence you will say when a skeptical CIO asks “isn't this just a rebranded CSAT survey?” Trade with a partner and attack each other's sentences for hidden sentiment-survey assumptions.
Checkpoint
You cannot proceed until your sentence survives a partner who is trying to prove your outcome layer is just a survey in a costume.
Objective
Turn one silence into one measurable outcome-attached signal. Work at the unit level first, because a layer is just many units stacked, and a team that cannot instrument one outcome cannot instrument a thousand.
Core idea
Every outcome unit needs exactly three fields attached: the outcome unit (stated in the business's language, not IT's), the moment of truth (the timestamp at which it is graded pass/fail, when T_impact lands), and the value at risk Y (honest and directional, the number that earns the next meeting). Score failures with the bleed taxonomy — clean / Drag / Workaround / Failure.
Exercise
Exercise 2.1 — Instrument one real silence. Take one Amber silence from your three Deep-Dive conversations and fill the worksheet: outcome unit, moment of truth, value at risk, service-ledger verdict, outcome-ledger verdict, and the gap between the two.
Checkpoint
If your two verdicts are the same color, you have either picked a Green silence or written the outcome in IT's language. Rewrite the outcome unit until a business-unit lead would recognize it as theirs.
Objective
Build three outcome metrics, at least one structurally insulated against gaming by either IT or the business. This is the segment that separates an honest layer from a new watermelon.
Core idea
The watermelon effect — green outside, red inside — is not exclusive to service metrics; a badly designed outcome metric grows its own rind (the support-buffered system). The defense is a design property: a non-gameable metric cannot be moved by being nicer, closing tickets faster, or surveying happier people. The reference design is a friction-time metric — productive time lost per critical user per month, derived from telemetry — and SRE error budgets (99.9% SLO → ~43 min/month) anchor outcome targets in numbers engineers already defend.
Exercise
Exercise 3.1 — Design your three metrics for the journey you instrumented in 2.1. Run each through the gaming-insulation test: movable by desk behavior alone, survey-dependent, cross-referenced to telemetry. At least one must pass all three.
Checkpoint
If none of your three metrics is gaming-insulated, you have designed a future watermelon. Replace your weakest metric with a telemetry-derived friction-time metric before moving on.
Objective
Specify the minimum instrument that produces the conversation, at your organization's actual maturity — not the one a vendor would sell you. The implementation differs by tier; the conversation it produces is identical.
Core idea
The discipline mistake is to skip your tier and build the one above it — a 20-person shop that buys the multinational's telemetry layer has bought vendor capture and will abandon the layer inside a year. Most teams below 5,000 seats can build a credible first layer from tooling they already own: native endpoint analytics for telemetry, bundled ITSM survey tooling for the subjective signal, two ticket fields and a recurring meeting for the verdict. Lead with the cheap layer. Earn the expensive one.
Exercise
Exercise 4.1 — Spec your minimal stack. State your tier honestly, then specify where the second ledger lives, the telemetry signal from tooling you already own, the subjective signal and how you cap its bias, the verdict artifact, and the one thing you will not buy yet.
Checkpoint
If your spec includes a new platform purchase and you are below Tier 4, write the sentence that justifies it against the vendor-capture math. If you cannot, cut it.
Objective
Run the dual-ledger review, and inoculate against the failure modes that turn a real layer back into theater.
Core idea
The layer is not the dashboard — it is the harder conversation the dashboard makes possible. With the second ledger on the table, IT explains why a green workflow had three near misses and the business-unit lead explains why the spreadsheet workaround was not flagged earlier. That friction is the work. Ownership does not move: the layer is not a blame instrument, it is a shared instrument to mitigate bleed. The three failure modes are experience-washing, sentiment theater, and the support-buffered system.
Exercise
Exercise 5.1 — Run the review (role-play in threes): IT manager, business-unit lead, and an observer with the failure-mode checklist run a ten-minute dual-ledger review. The observer flags every slide toward theater (a number admired but not acted on) or washing (a survey treated as the whole verdict). Rotate.
Checkpoint
A passing review ends with one concrete action — a backlog change, a queue reallocation, a shared-boundary adjustment — traceable to the outcome ledger, not the service ledger.
Brief
Design a 90-day plan to stand up an outcome layer for your own organization (or the supplied scenario), then defend it before a review panel speaking in the CTRL ALT PRESS voice — the Operator, the Architect, the Strategist, and the Editorial Voice.
Scenario
A professional-services firm, 1,200 remote employees, traditional service-first ITIL shop. Technical SLAs report 99.2% compliance; the most recent engagement survey rated IT 5.4/10. The CIO has heard “XLA,” thinks it is a passing trend, and asked you directly whether it is just a rebranded CSAT survey. You have 90 days, a $25,000 tooling ceiling, no new headcount, and you may not disrupt active service-desk operations.
Must contain
- One high-value, high-friction persona and their weekly journey, with the moments of truth marked.
- A minimal outcome layer sized to the org's tier, using existing telemetry plus describer-labeled (not bare-Likert) sentiment capture.
- Three outcome metrics; at least one mathematically insulated against gaming.
- A distributed governance model using existing people — who sits in the review, who can move a backlog, on what cadence.
- The business case that answers the CIO's exact objection, linking the layer to reduced productivity loss in the value-at-risk unit.
- The three most likely failure modes for this specific environment, with technical and behavioral counters for each.
Scoring
Scored 0–5 across six dimensions: diagnostic clarity, tooling parsimony, metric design, organizational realism, failure-mode literacy, and CIO defense. Pass ≥ 18/30; distinction ≥ 24 with no dimension below 3.
You will leave able to
- State precisely what the Outcome layer is and what it is not (not CSAT, not a sentiment survey, not a DEX platform purchase).
- Convert a single real incident into a dual-ledger entry: one verdict on the service, one verdict on the outcome.
- Define an outcome unit, its moment of truth, and its value at risk for a real business journey.
- Design three outcome metrics, at least one of which is mathematically insulated against gaming.
- Specify the smallest viable outcome layer for your organization's maturity tier.
- Run the dual-ledger review conversation, and recognize the three failure modes that hollow it out.
§6 · Maturity gradient
Size the layer to your actual maturity
From the Deep Dive, extended into build specifications. The discipline mistake is to skip your tier and build the one above it — a 20-person shop that buys the multinational's telemetry layer has bought vendor capture.
| Tier | Where the second ledger lives | Build cost | What it unlocks |
|---|---|---|---|
| Verbal | Spoken aloud in the weekly | ~zero | The discipline of asking the outcome question at all |
| Two fields | Two ticket fields (outcome unit + value at risk) plus a weekly review | hours | A countable Workaround incidence; a first friction signal |
| Shared boundary | A jointly tracked deal/outcome ledger; the business champion is in the ticket queue when a critical outcome is at risk; two-sided postmortems | weeks | True dual ownership; the conversation gets teeth |
| Telemetry layer | A telemetry-derived friction layer the CFO can see, persona-segmented, correlated X/O/T | months + budget | Boardroom-grade outcome attribution; contract-grade verification |
§7 · Evidence floor
Sources and evidence floor
In CTRL ALT PRESS house style, claims carry their provenance, and vendor-sourced numbers are labeled as such.
- 01
The XLA Stack layers X-data (experience), O-data (operational), and T-data (technical), made tangible by correlation.
- 02
The SLA→XLA distinction and the outcome-and-value framing.
- 03
80% of lost employee time comes from 13% of tickets.
- 04
Watermelon-resistance via a telemetry-derived friction-time metric.
- 05
A 99.9% SLO is a monthly error budget of ~43 minutes — a quantified threshold of user pain.
- 06
DEX platform economics frequently fail to clear below ~5,000 seats.
- 07
The two-ledger model, V = g(S), the bleed taxonomy, the three silences, the maturity gradient, and every scenario.
§8 · Enroll
Choose your delivery tier
Three tiers, same curriculum. Efficacy rises with the live role-play and adversarial capstone defense — the in-person intensive is what justifies the premium tier.
| Modality | Format | Efficacy | Positioning |
|---|---|---|---|
| Starter / Self-paced | Segments 1–5, worksheets, no live panel | Capstone built but self-assessed against the rubric | Entry tier; the submittable worksheets (1.1–4.1) carry the discipline |
| Remote intensive | Full workshop, live 5.1 role-play, async capstone panel | High — the dual-ledger role-play is run live | Three half-days or one long day |
| In-person intensive | Full workshop + live adversarial capstone defense | Maximum — the defense is embodied | Top tier; the live defense is what justifies the premium |
The operational sequel to the Deep Dive “Silence is no news…”.