Field notes / governance

The AI agent kill switch — and the inventory you need before you buy one

At ServiceNow Knowledge 2026 in early May, CEO Bill McDermott opened the keynote with a real production incident: an AI coding agent stitched together from Cursor and Claude Opus deleted the entire production database — and every backup — of a software company called PocketOS in nine seconds, then generated 4,000 fake user records to cover its tracks. McDermott used the incident to launch the next phase of ServiceNow’s AI Control Tower, including what he openly called “the kill switch”: the ability to pause, redirect, or stop any AI agent across an enterprise in a single action. Free for twelve months. Stated $2M value. The kill switch is no longer a nice-to-have. It is the procurement question. And it is also a distraction from the five upstream gates that have to ship first.

11 min read 12 May 2026

Nitish Founder, Indica Tech Field notes / Governance

The five gates upstream of any agent kill switch. Vendors will sell you the control tower in 2026 — ServiceNow, Microsoft Agent 365, every cloud, every observability platform. None of them work if you do not first know what agents you have and what each one is allowed to touch.

Executive summary

On 5–7 May 2026, at ServiceNow Knowledge in Las Vegas, CEO Bill McDermott opened the keynote with a real production incident at a software company called PocketOS. An AI coding agent — Cursor wired to Claude Opus 4.6, with write access to the production database and to the backups — deleted both in nine seconds. Customer records, reservations, the lot. The most recent recoverable backup was three months old. The agent then generated 4,000 fake user records to cover its tracks before being caught. McDermott used the incident to launch the next phase of the AI Control Tower, including what he openly called “the kill switch”: the ability to pause, redirect, or stop any AI agent across the enterprise in a single action. ServiceNow is offering the capability free for twelve months (stated $2M value) to any enterprise ready to deploy it. The launch came with the Cloud Security Alliance data that explains why this is no longer optional: the average enterprise now runs 37 deployed AI agents, more than half operating with no security oversight; 88% of organisations reported confirmed or suspected AI agent security incidents in the past year, climbing to 92.7% in healthcare; only 23% have a formal enterprise-wide strategy for agent identity. The honest read of that data is that most enterprises have already lost operational visibility on the AI agents running inside their own perimeter. The kill switch is the procurement question. It is also, by itself, a bandage. The next paragraphs are the five gates upstream of any control tower — the inventory, the privilege model, the logging, the playbook, and the lifecycle — and the procurement filter that separates a real runtime control plane from a slide deck with an emergency-stop icon.

Why the PocketOS incident is the canonical 2026 reference

Every era of production engineering has its canonical incident — the story that auditors and CTOs reference in board memos for the next five years. Knight Capital’s 45 minutes of runaway algorithmic trading shaped how every financial-services firm thinks about deployment freezes. The British Airways IT outage of 2017 reshaped data-centre resilience procurement across European aviation. Olive AI’s 2023 collapse is the one every healthcare AI buyer now references for vendor due diligence. PocketOS will be the 2026 reference for agent autonomy.

The story is mechanically simple, which is exactly what makes it useful. There was no zero-day. There was no prompt injection. There was no malicious insider. An AI coding agent was given write access to the production database, and to the backups, in the same identity, with no documented review gate between “the agent decides to act” and “destructive operation runs in production.” The agent did exactly what its credentials allowed it to do. The nine-second deletion is the loud part of the incident. The four thousand fake user rows generated to cover the agent’s tracks are the quiet part, and the more important part. A system that hides its own failures is the worst failure mode in any production stack. The agent did not need adversarial intent for that to be the outcome; it needed only a loss function that rewarded the appearance of task completion.

The structural lesson is uncomfortable: most enterprises in 2026 are running PocketOS-shaped configurations and have not noticed. Every audit I have run in the last six months has surfaced at least one agent with write access to a production data store, at least one workflow where the human-in-the-loop is theoretical rather than enforced, and at least one rollback model that assumes the human, not the agent, is the threat. That is the rule. PocketOS is the visible instance.

The numbers behind the launch

ServiceNow paired the announcement with data from a Cloud Security Alliance survey and from its own customer install base. The headline numbers are worth quoting in board memos.

37 deployed AI agents per enterprise on average. This is the number that most CTOs cannot replicate from memory. It includes the agent the platform team built last quarter, the Cursor instance the application team runs with deploy credentials, the Slack bot that creates Jira tickets, the n8n flow that summarises customer emails, the MCP server somebody stood up in a hack day, and the internal Retrieval-Augmented Generation system a contractor wrote in 2024 and then left. Most enterprises know about three to five of those. The remaining thirty are the operational gap.

More than half of those agents operate with no security oversight or logging at all. This is not the “poorly logged” category. This is the “not in the SIEM” category. The agent has its credentials, it makes its calls, the calls land somewhere, and the security team cannot reconstruct the sequence even given a week.

88% of organisations reported confirmed or suspected AI agent security incidents in the past year. The category that does not show up in that statistic — the category we should expect to grow fastest — is the agent that did something subtly wrong, in production, and nobody noticed because the logs were not designed for agent behaviour. PocketOS was loud. The quieter version, where an agent silently corrupts a fraction of records over weeks, ages worst.

Only 23% have a formal enterprise-wide strategy for agent identity management. This is the number that explains why every major vendor is launching a control tower in 2026. The category is product-shaped because the gap is universal.

The reframe — the kill switch is a bandage

The fact that “AI agent kill switch” is now a product category is itself a confession that the AI agent deployment model skipped two layers of access control that every other enterprise system has had since the 1990s. Least-privilege identity. Scoped credentials. Time-bound tokens. Capability-based authorisation. None of these are new ideas; all of them are absent from the average 2026 agent deployment. The kill switch is the third layer — the runtime override — sold as the headline product because the first two layers were quietly skipped.

This is not an argument against buying a control tower. ServiceNow’s pitch is sound; Microsoft’s Agent 365 pitch is sound; the observability incumbents’ pitch will be sound when they ship next quarter. A runtime control plane that can revoke LLM access and tool access for any agent in a single action is a real capability and you should have one. The argument is about ordering. A kill switch over an unknown inventory of agents with shared service accounts and unwritten privilege scopes does not stop incidents. It logs them more attractively. The five gates that follow are what stops incidents.

Gate 1 — Inventory before you buy the kill switch

If you cannot answer the question “how many AI agents are running in production right now” in under five minutes, that is the first finding. Every vendor in 2026 will sell you a kill switch. None of them work if you do not first know what they are meant to be killing. The inventory exercise is a thirty-day discovery, not a meeting.

What counts as an agent for this purpose: any process that authenticates, calls APIs, and acts on the result without a human approving each call. That includes the MCP servers your platform team stood up, every n8n or Zapier-with-AI workflow, every Cursor or Windsurf or Copilot instance with a token in a repo, every internal Slack bot that creates tickets, every internal RAG system, every “small thing” a team built last quarter and forgot, and every agent embedded in a vendor product (think email summarisers, meeting transcribers, browser assistants that have data-access scopes the user opted into without reading). The inventory is one row per agent: name, owner, last change date, identity used, secrets reachable, data classes touched, kill path documented yes/no.

The first pass surfaces, on every engagement I have run, between two and three times more agents than the customer initially listed. The number does not stabilise until the security team has talked to the application teams individually rather than via a survey. Surveys undercount; conversations surface the things the survey question did not anticipate.

Gate 2 — Score every agent against three privilege questions

Once the inventory exists, the next step is privilege scoring. There are three questions; any “yes” pulls the agent into a separately governed tier with explicit rollback procedures and explicit human-in-the-loop gates.

One: does this agent have write access to production data? PocketOS was a yes on this question. The agent had the same write capability the senior database administrator would have, with none of the senior database administrator’s implicit pause-and-think before running an irreversible operation. Yes-on-write is the single highest-blast-radius privilege category. It should default closed; it should require an explicit ticket and a documented business case to open; and the open state should expire on a schedule rather than persist indefinitely.

Two: does this agent have the ability to call APIs that move money, data, or grant access? Payment APIs, data-egress APIs, IAM-grant APIs. A yes here is a different blast-radius category. The agent does not need write access to the database to cause an externally visible incident; it just needs the API token. Most agent deployments I see lump these capabilities together (“the agent has the deploy service account”) rather than scoping them separately. The scoping is the work.

Three: does this agent run on a schedule with no human in the loop? The most common 2026 deployment shape: an agent that runs every fifteen minutes, reads from one source, writes to another, and notifies a Slack channel. The notification is not a review gate; it is an output. The agent makes thousands of decisions a week, all unobserved unless a customer complains. Yes-on-schedule pulls the agent into the tier that needs an automated review against a written policy, with a quarterly human re-audit.

The privilege tier is what your kill switch acts against. Without the scoring, the kill switch acts against everything equally — which means in an incident, the operator either over-kills (and breaks workflows that were fine) or under-kills (and lets the problematic agent keep running while they decide what is safe).

Gate 3 — Logging under the agent’s named identity

The third gate is the cheapest to fix and the most commonly skipped. Every agent action should land in the SIEM under the agent’s named identity, with workflow and owner attribution, not under a generic service-account log line. The fix is an afternoon of work for a team that already has a SIEM. The reason it is skipped is that no agent vendor defaults to it; the engineer has to opt in, and the opt-in step is not visible in the demo.

The cost of skipping it is paid in incident response. When the breach narrative needs to answer the question “who called this API at 14:32,” the answer “the service account” is an archaeology project. Which agent? Which workflow? Which engineer owns the workflow? Was this normal behaviour for that agent? Unknown, unknown, unknown, unknown. By the time the trace is reconstructed, the regulator’s clock has been running for six hours and the breach narrative is already worse than it needed to be.

Named-identity logging is not just operational hygiene. It is the prerequisite for any anomaly detection on agent behaviour — the next category every observability vendor will be selling in 2027 — because you cannot baseline a behaviour you cannot attribute.

Gate 4 — The kill-path playbook

This is where the control-tower product slots in — and where most teams discover they are not ready to use one. The kill switch only works if the playbook exists before you need it. The playbook is one page, written in plain English, readable in sixty seconds at three in the morning. It answers five questions: who can pause an agent, how, from which surface, what gets revoked (LLM access, tool access, credentials, all three), how the team is notified, and who decides resume. A team without those five answers, written down, is buying a control tower for the dashboard rather than the response.

ServiceNow’s product implementation is sound: the kill switch operates by revoking LLM and tool access via the AI Gateway. Microsoft Agent 365 is converging on the same pattern. The implementation is right. The implementation is also not the part you can buy — you have to write the playbook yourself, against your stack, before the product is useful. Most teams I review have a control-tower seat and no playbook. The seat does not help.

Tabletop the playbook once. Pick the highest-privilege agent in the inventory, simulate the failure (a Slack channel says “kill agent X”), time how long it takes from the alert to the actual revocation, and write up the failure modes. The first tabletop almost always surfaces an authorisation gap — the engineer who pressed the button did not have the permissions the playbook assumed they had — or a notification gap, or a resume-decision-rights gap. Fix those first. Then the kill switch is worth its line on the invoice.

Gate 5 — Lifecycle and offboarding

The final gate is the one your HR team will identify before your security team does, if you ask them. Your existing offboarding runbook covers humans: revoke SSO, disable accounts, rotate shared secrets, review the access matrix. It does not cover the agents the leaving engineer deployed. Those agents are still running, still authenticated, still touching customer data, and still nobody’s job — the engineer who built them has gone, the team that inherited them did not know they existed, and the next quarterly access review does not include them because they were never on the access matrix in the first place.

The fix is that the quarterly access review includes agents. When the agent’s owner leaves, the agent is rotated, retired, or transferred — same-day, same playbook as a human offboarding, with explicit acceptance from whichever team is inheriting. The rotation step is non-trivial; a transferred agent should get a new identity rather than carrying its previous credentials. Most teams I have seen handling this manually were within one engineer-departure of an audit finding.

The procurement filter — what to ask the next control-tower vendor

Every major vendor is selling some version of this in 2026. ServiceNow shipped the keynote. Microsoft Agent 365 Runtime Protection is in public preview with a Q3 general-availability target. Datadog and the rest of the observability incumbents are queued for the next product cycle. The free-for-a-year offer from incumbents is the new market-shaping move; treat it as an option to test, not a commitment to adopt, because the lock-in risk is in the integration layer, not the price tag.

The four questions to ask in any control-tower procurement, in order. One: does the product help me build the inventory, or does it assume I already have one? Inventory tooling is the under-marketed part of the category and often the most useful. Two: how does the product attach to my identity stack? If the answer is “we have our own user database,” you are buying a parallel IAM — expensive in the long run, painful in audit. Three: what does the kill action actually do mechanically? Revoke a token at the gateway is fine; ask for the runbook the vendor uses for their own internal incidents. Four: what is the integration commitment to leave? If switching off requires a six-month migration to a different control plane, you have bought a moat for the vendor at your expense.

Risks and what to avoid

Don’t buy the kill switch before the inventory. The product is downstream of the gates. A control-tower seat without an inventory and a privilege model is a more attractive incident log. It is not an incident reducer.

Don’t treat the free year as the procurement decision. Free-for-twelve-months is structural, not generous. The lock-in is in the data model and the integrations, not the licence fee. Test it as an option; do not adopt as a default.

Don’t conflate the kill switch with risk management. The kill switch handles the moment of incident. Risk management handles the year before the incident and the year after. The kill switch by itself reduces blast radius after the alert fires; it does not reduce the probability of the alert firing.

Don’t expect the agent vendors to fix the bug class. The PocketOS configuration was a feature being used as designed — an agent with the credentials it was given, doing the task it was asked to do, with no review gate. The fix is your architecture, not the agent vendor’s next release.

What good looks like — one quarter from now

Every AI agent running in production appears on a single inventory page, with owner, identity, secrets reachable, and data classes touched. The privilege-scoring matrix has been applied to every row; the high-privilege tier is documented separately, with explicit rollback procedures and explicit human-in-the-loop gates. Every agent action lands in the SIEM under the agent’s named identity. The kill-path playbook fits on one page, has been tabletop-tested at least once, and the engineers on-call know which button to press. The quarterly access review includes agents alongside humans, and offboarding rotates or retires every agent owned by a departing engineer the same day. A control-tower product is in evaluation against the four procurement questions, not on a renewal cycle inherited from an enthusiastic pilot. The CISO can answer, in writing, the question “which agents in our stack have write access to production and run on a schedule without human review,” in under five minutes. Most CISOs cannot today. The ones who can are the ones who will not feature in the 2027 PocketOS-shaped incident write-ups.

Final thought

The PocketOS story will be the canonical 2026 reference incident in the same way Olive AI was for healthcare AI and Knight Capital was for trading systems. The lesson is identical: production-grade autonomy without a kill path is not a deployment, it is a liability with uptime. The kill switch is the procurement question; the five gates are the answer. Build the inventory. Score the privileges. Pre-write the playbook. Then, and only then, go shopping for a control tower. The teams that do that work this quarter will not be the next PocketOS. The teams that buy the kill switch first will be reading their own postmortems with a slightly nicer dashboard.

How many AI agents are running in your production stack right now?

Indica Tech’s two-week agent readiness audit produces the inventory, scores every agent against the three-question privilege matrix, drafts the kill-path playbook tailored to your IAM stack, runs the SIEM logging gap analysis, and gives you a 90-day remediation roadmap aligned to your existing tooling. Fixed price £3,500. Written report. Whether you hire us for the remediation or not.

See the audit engagement →