Blog

Writing about AI agent systems, gaming technology, and building products at the intersection of AI and entertainment. Drawn from 25+ years of founding companies, shipping games, and working with emerging tech.

16 June 2026 · 19 min read

Frontier AI Performance at Home, Without the API Bill

Maestro fuses the model CLIs already on your machine into one answer that beats any of them alone. It is the same mixture-of-agents method OpenRouter's Fusion API benchmarked at frontier level for about half the cost, except it runs on the flat-rate subscriptions you already pay for instead of a metered API bill.

AI agents Maestro fusion cost local-first Read →

3 June 2026 · 37 min read

Why I Stopped Using Multi-Agent Frameworks (And What Replaced Them)

After 18 months of building with AutoGen, CrewAI, and LangGraph, I replaced all of them with explicit decomposition and deterministic orchestration. Here is why, and what to use instead.

AI agents multi-agent frameworks architecture opinion Read →

20 May 2026 · 33 min read

Prompt Engineering Is Dead. Context Engineering Is the Job Now.

Clever prompt wording stopped being the bottleneck once models got good at following instructions. The real work moved to engineering what goes into the context window.

AI LLM context-engineering prompt-engineering agents research Read →

6 May 2026 · 31 min read

What 18 Months of Production AI Agents Actually Taught Me

Five things that broke, three that worked unexpectedly, and the cost arc that surprised me. A first-person retrospective from running AI agents in production for 18 months.

AI agents production lessons retrospective research orchestration Read →

22 April 2026 · 32 min read

The Automation Paradox: Why AI Agents Create More Work Before They Create Less

Teams deploying AI agents see a productivity dip before gains materialize. The supervision tax, the correction loop, and the trust calibration period are predictable, manageable phases, not signs of failure.

AI agents automation productivity research management Read →

15 April 2026 · 13 min read

AI Is Running Out of Power, Data, and Quality All at Once

AI data centers now consume more electricity than Japan. Human training data may already be exhausted. Models are measurably degrading. These three crises are connected, and fixing one makes the others worse. Here's the research, and what might actually help.

AI infrastructure energy data centers model collapse scaling LLM research Read →

8 April 2026 · 15 min read

1-Bit LLMs Could Make GPUs Obsolete (And Why Nobody Is Betting on It Yet)

Microsoft's BitNet matches full-precision LLMs at 12x less energy using ternary weights on a CPU. If 1-bit LLMs scale, NVIDIA's $3 trillion valuation is built on an assumption that may not hold. Here's why AMD and Intel aren't rushing in.

AI LLM hardware NVIDIA AMD Intel research 1-bit BitNet inference Read →

3 April 2026 · 15 min read

Why Your Multi-Agent AI System Keeps Failing (And What the Research Actually Says)

Multi-agent LLM systems fail 41-87% of the time. 79% of those failures come from coordination, not capability. Here is what the research says about building systems that actually work.

AI multi-agent systems LLM orchestration agents research open source Read →

13 March 2026 · 9 min read

AI Agents Need Governance, Not Guardrails

For practitioners and engineering leads evaluating governance approaches: SDK wrappers are in-process guardrails agents can bypass. Infrastructure-level governance, where agents never hold API keys, is the only enforceable approach. Here's why.

AI technology agents governance LLM infrastructure Read →

28 January 2026 · 7 min read

We Were Live Streaming Before Twitch Existed

In 2008, Nihilum partnered with Xfire and Dyyno to live stream World of Warcraft raids to 2,800 concurrent viewers, years before Twitch launched. Here's what happened.

streaming history gaming World of Warcraft esports Read →