Beyond the Token: Google's Per-Minute Pricing and the Disruption of Real-Time AI Economics

The token has been the unit of account for AI inference since the first public OpenAI APIs launched in 2020. Every pricing page, every cost model, every engineering estimate in the industry has been denominated in tokens per million. In 2026, Google disrupted that convention with the Gemini Live API, priced not at the token level but at $0.005 per minute of audio interaction. This is not a minor pricing variant — it is a structural challenge to the assumptions that underpin every real-time AI application budget. Understanding when per-minute pricing is economically superior to per-token pricing, and when it is not, is now a required competency for any engineering leader deploying AI at scale.

Read More

The Infinite Spend Bug: Recursive Agent Loops and the Metered Future of Agentic AI

A software bug that causes infinite recursion terminates with a stack overflow. A token bug — an agentic AI loop that recurses without a termination condition — terminates with a billing invoice. In 2026, as autonomous agents displaced simple chat completions as the primary AI interaction pattern, organisations discovered that the economics of agentic systems are fundamentally different from those of single-shot inference. Anthropic’s decision to move Claude agents onto metered billing across all subscription tiers was not a product update. It was a signal that the industry has reached an inflection point where agent economics require the same governance discipline as cloud infrastructure.

Read More

Open Source in Regulated Environments: Migration Summary and Checklist

This series connected a simple strategic idea—reducing lock‑in and increasing control—to how modern EU rules ask you to prove security, resilience, data respect, and product discipline. Open source is a means, not a certificate. The outcome regulators care about is operational: defensible governance, tested recovery, traceable software, and honest data handling.

Read More

The Context Tax: Quadratic Cost Scaling and the $6M Healthcare RAG Overrun

The price sheet for a frontier language model lists a flat rate per million tokens. It does not list the compounding cost effect that emerges when those tokens are assembled into long-context requests. This distinction — between the unit price of a token and the system cost of a context window — is where the most dangerous budget surprises of 2026 are occurring. A healthcare organisation recently discovered this gap at the scale of a $6 million overrun in a production retrieval-augmented generation (RAG) pipeline. Understanding what happened requires reasoning about the physics of context before you can reason about the economics.

Read More

EU Data Act: Switching, Lock-In, and Why Open Architectures Help

The EU Data Act introduces rules on access to and use of data, contractual fairness, switching between data processing services, and interoperability in several contexts (including Internet of Things and B2B data sharing scenarios per its scope). It is explicitly aimed at reducing lock‑in and unfair contractual imbalance—not identical to GDPR, but complementary when you negotiate cloud and platform contracts.

Read More

The Great Token Burn: How Uber Exhausted Its 2026 AI Budget by May

In early 2026, Uber’s CTO Praveen Neppalli Naga disclosed something that should have landed harder than it did: his engineering organisation had burned through its entire annual AI budget before the calendar reached May. The culprit was not a rogue data science project or an experimental LLM fine-tune. It was a coding assistant — specifically, Anthropic’s Claude Code — running at roughly $2,000 per engineer per month across a team large enough to make that number catastrophic at scale. This is the first instalment of an eight-part series on the 2026 AI Token Economy: what it costs, why costs escape, and how to govern them before your own budget evaporates.

Read More

EU Cyber Resilience Act: Products, Updates, and Open Components

The Cyber Resilience Act (CRA) targets products with digital elements placed on the EU market: secure by design, vulnerability handling, and transparency obligations that flow through manufacturers, importers, and distributors. If you ship hardware or software as a product—or embed connectivity in what you sell—CRA logic eventually touches how you build, update, and disclose flaws.

Read More

EU AI Act: Open Models, Governance, and Deployment Reality

The EU AI Act regulates how AI systems are placed on the market and used, with escalating obligations for high‑risk categories, transparency for certain general‑purpose and consumer‑facing cases, and governance expectations that land on deployers as well as providers. Open‑source weights or code do not automatically exempt a real‑world deployment from duties once the system is part of a product or business process.

Read More