Most Companies Are Deploying AI Like a Feature. That's Why the Bills Are Shocking.

What happened?

Last week a lot of AI teams and upstream firms got an expensive surprise. Teams testing Claude Code, Anthropic's AI coding assistant, watched their budgets drain in minutes. Automated workflows were silently retrying failed calls and compounding the damage. Turns out there was a bug in the caching layer that was making costs run 10 to 20 times higher than they should have been. Real money, gone fast.

But honestly, the bug wasn't the story. The story is why so many teams were exposed to it in the first place.

This wasn't a freak event. We've seen this pattern before and I'll be direct about it: it's what happens when you deploy AI like a feature instead of an operational system.

The problem isn't the AI. It's how it's being implemented.

I talk to upstream firms all the time who are in the early stages of deploying AI. The approach is almost always the same: pick a model, wire it into a workflow, ship it. It works great in demos. It holds up in pilots. And then it hits production with real document volumes, real data complexity, real users, and the economics fall apart fast.

Here's what most people don't account for. Modern AI agents don't just generate answers. They scan, search, maintain context across long conversations, call tools, interpret results, and call more tools. A lot of consumption happens before you ever see any output. In a general purpose app that's manageable. In upstream oil and gas, with thousands of well files, multi-system data sources, dense regulatory documents and multi-step engineering workflows, the AI throughput exposure is orders of magnitude higher.

Think about a routine production variance analysis. An agent might touch your GL extract, your prior month reports, your field notes, and your production database before it writes the first line of output. If nobody designed for that, you find out when the invoice shows up.

And I'm hearing more and more that the invoice is coming from the AI consultant. They're billing AI throughput costs directly back to firms as a line item. Every token your workflows consume is on your bill. That's worth paying close attention to.

There's a better way to implement it. We've had the playbook for decades.

In upstream oil and gas we don't run complex systems on faith. We instrument them. We set authorization controls. We monitor performance in real time. We define what goes in, what comes out, and what happens at every step. We have always understood that throughput has a cost, and that unmanaged throughput is how projects blow their AFE.

The discipline we apply to production systems is exactly the discipline AI deployment requires. We just haven't applied it yet.

That's the reframe. AI is not a software feature you configure and walk away from. It's an operational system that consumes resources, needs monitoring, and rewards deliberate design. The firms that get this spend less, get more consistent outputs, and can actually scale. The ones that don't are making headlines right now.

What this actually looks like in practice

The principles here aren't complicated. They're just being skipped.

Structured inputs matter more than people think. The less you ask an agent to search around, the less it consumes. Scoped, pre-formatted inputs, a clean well header summary, a focused GL extract, a structured markdown spec, reduce AI throughput significantly compared to giving a model broad access and hoping it finds what it needs. Handing an agent a formatted well summary is fundamentally different from dropping a stack of raw AFE PDFs in front of it.

Break workflows into steps. I've seen teams try to run complex analysis as one large call and wonder why the costs and errors pile up. Decline curve first. Variance flag second. Recommendation third. Each step is cheaper, easier to audit, and a lot easier to fix when something breaks.

Scope your prompts explicitly. Simple instructions like "use only the provided production summary, do not query additional records unless instructed" have a real measurable effect on both cost and output consistency. Small instructions do serious work.

Track usage like you track production variances. AI throughput should be monitored continuously. When you see a spike, trace it to the workflow and fix it before it compounds. This isn't a technical detail. It's operational discipline, the same kind we apply everywhere else in this industry.

AI is not a panacea

I say this to every firm I talk to. AI is a powerful tool when deployed correctly for the right tasks. It is not magic. It doesn't replace domain expertise. And it will absolutely generate surprising costs if you hand it broad access to your systems without any controls around it.

The firms getting hurt right now aren't victims of bad AI. They deployed AI the same way you'd deploy a SaaS subscription, set it up and assumed it would handle itself.

In upstream oil and gas we've always known that complex systems don't manage themselves. That's just as true for AI as it is for a production facility.

The firms that win here won't necessarily be running the most powerful models. They'll be the ones running those models with the same operational discipline we've applied to everything else in this industry for decades. That playbook has been sitting right here the whole time. We just need to apply it.