AI EngineerMay 30, 202610m

Why (Senior) Engineers Struggle to Build AI Agents — Philipp Schmid, Google DeepMind

TL;DR

Agents change the job from control to delegation — Schmid says traditional software engineers acted like traffic controllers, but with agents you’re a dispatcher who sets the destination and lets the model choose the route, even when the path looks strange.
Text is the new state layer — Instead of rigid booleans and flags, agents work on semantic context, like approving a deep research plan while adding “focus on the US market and ignore California” in one step.
Errors should be fed back into the loop, not restart it — If a 5- or 15-minute agent run fails, starting over wastes compute and loses context, so errors need to be treated like inputs the model can recover from.
Unit tests aren’t enough; reliability comes from evals — Because the same prompt can produce different outputs, the real question is not whether input A always yields C, but how often the agent succeeds well enough to trust in production.
APIs built for humans often fail agents — A method like delete item may feel obvious to backend engineers, but agents only see function schemas and docstrings, so tools need explicit, self-documenting semantic interfaces.
Build for replacement, not permanence — Schmid closes with the “bitter lesson” that agent software is disposable and will be rebuilt repeatedly as better models and agent patterns arrive.

The Breakdown

Senior engineers trip over AI agents because the mental model is backwards: you’re no longer a traffic controller scripting every move, you’re a dispatcher setting goals and recovering when the model takes weird-but-effective paths. Philipp Schmid from Google DeepMind lays out five concrete shifts—from treating text as state to replacing unit tests with evals—that explain why traditional software habits break down.