SKIP TO CONTENT
shaungehring.com
UPTIME 29Y 10M 14DLAT 35.2271°NLON 80.8431°W
SYS ONLINEMODE PUBLIC
/ HOME/ BLOG/ Leadership
#LEADERSHIPJUNE 26, 2026·5 min READPUBLISHED

84% of Your AI Engineers Are Building Bumpers, Not Cars. The Guardrail Tax Is the Real Story..

The headline from Sinch's AI Production Paradox report was the 74% rollback rate. The number buried two pages deeper is the one that matters: 84% of AI engineering teams spend at least half their time on safety infrastructure rather than building features.

SG
Shaun Gehring
PRINCIPAL · AI & SYSTEMS CONSULTING

84% of Your AI Engineers Are Building Bumpers, Not Cars. The Guardrail Tax Is the Real Story.

The headline from Sinch's May 13 AI Production Paradox report was the 74% rollback rate. Fair enough — punchy, sounds dire. But high rollback rates in mature orgs are a smoke detector working, not a system failing.

The number buried two pages deeper is the one I actually want to talk about. 84% of AI engineering teams said they spend at least half their time on safety infrastructure rather than building new features. Enterprises now invest more in trust, security, and compliance for AI (76%) than they invest in AI development itself (63%) — making governance the single largest line item in their AI program.

Strip the marketing language off that. Your AI team is mostly a safety team now. The headcount you fought to hire for "AI product engineering" is mostly writing eval suites, prompt filters, drift monitors, and audit hooks. That's not a deployment problem. That's a talent allocation problem nobody's putting on the agenda.

The Same Five Things, at Every Company in the Country

The pitch deck for hiring senior ML engineers in 2024 was beautiful: you'd build agents, design tools, pick models, shape product behavior, work at the frontier.

The reality in 2026 is you're building the same five things over and over at every company in the country: a prompt-injection filter, an output sampler, a guardrail-rule engine, a fallback handler for when the model returns garbage, and a dashboard that tells leadership whether any of it caught anything. None of these are interesting problems. All are necessary. Most should be commodities, but they aren't yet — so every team shipping an agent pays for a custom version of the same plumbing.

That's the guardrail tax, and it has the shape of every other infrastructure tax in software history. Twenty years ago every web team had a custom auth system. Then Auth0 and Okta showed up and you stopped writing it. Fifteen years ago every team had a custom logging pipeline; then Datadog and Splunk happened. Five years ago everyone rolled their own feature flags; LaunchDarkly happened. Undifferentiated heavy lifting starts as an in-house obligation, becomes a vendor product, then becomes part of the assumed stack. Right now, AI safety infrastructure is in the in-house-obligation phase. And it is eating senior engineers alive.

Count the Cards on Your Sprint Board

If you're the EM, here's the test: count the cards. How many are a new agent behavior, a new tool integration, a new user-facing capability? Now count how many are a new eval, a new filter, a new compliance check, a new escalation path. If the ratio is anywhere near 50/50, congratulations — Sinch found you.

Three things developers feel here that don't show up in the survey numbers.

The work isn't promotable. Eval harnesses don't get demoed at all-hands. Prompt filters don't get a ship announcement. What gets you promoted is the agent the company brags about — not the boring infrastructure that kept it from embarrassing you. Senior engineers read the room, and the ones doing safety work notice the visibility is going to the people who took the feature stories.

The work doesn't accumulate. A logging pipeline you build today still works in three years. A prompt filter for Claude 4.7 might need a full rewrite when Claude 5 ships — different failure modes, different injection surface. You're doing high-skill work on artifacts with a six-month half-life.

The work isn't differentiating. Every team builds the same eval framework, the same LLM-judge content filter, the same drift dashboard. None of it is your product. None of it is a moat. It should be a procurement decision, not a build decision — and the procurement option mostly doesn't exist yet, which is why you're stuck building it.

Reallocate Before the Vendors Arrive, Not After

Here's the take nobody at the leadership level wants to say out loud: the guardrail tax is a signal that AI agents shipped before the infrastructure for running them was ready, and the bill is being paid by engineering payroll, not by vendors. In any other category we'd call this a market failure — the platform layer is missing, so every customer is reinventing the platform privately, in silence.

Watch the next twelve months. A small number of vendors will productize what every AI team is hand-rolling — Guardrails AI, Galileo, LangSmith, Helicone — plus the incumbents folding AI instrumentation into the existing observability stack. When that wave finishes, the 84% number drops. Not because AI got safer, but because the safety work got commoditized and you stopped paying senior engineers to write it.

So the question to run into your one-on-ones: what work is my team doing that some vendor is about to do better, cheaper, and as a checkbox? And then: who should be on the differentiated thing — agent design, eval design, product surface — and who's on the commodity thing knowing it's a tactical role that gets reorganized out in a year? Because here's the trap, and I'm in it too on the platform-strategy side: when the commodity wave lands, the people who spent two years building bespoke filters are holding skills that don't transfer to the work that's left. You can't get good at the work that's left by spending all your time on the work that's leaving. Talent reallocation has to happen before the vendors arrive, not in the panic after.

Name the tax. Then decide whether you're going to keep building bumpers or start buying them. Both are valid answers. Pretending you're not making the choice is the only bad one.


Sources: Sinch research reveals 74% of enterprises have rolled back live AI agents | Sinch · The AI Production Paradox | Sinch · The AI Production Paradox: Findings From 2,500+ AI Leaders | Sinch Blog · AI agents fall short of expectations, Sinch research reveals | Capacity

// CROSS_REFERENCE

Adjacent signals.

← ALL POSTS