
Playbook
Tasteful Skills
“Tasteful Skills” argues that the best agent skills are not documentation or best-practice lists.
The real AI gatekeepers aren’t model companies — Nate argues that Cloudflare, Stripe, Okta, Oso, Snowflake, Datadog, AWS, and Vercel increasingly decide whether an agent can actually reach production because they control runtime, identity, data, payments, and observability.
Compute gets agents online, but control infrastructure gets them deployed — GPUs and data centers matter for serving models, but the harder production questions are operational: where an agent runs, what it remembers, who it acts for, what it can spend, and who can stop it.
Runtime is becoming a strategic control surface — Cloudflare’s Agents SDK uses durable objects with SQL, websockets, and scheduling; AWS Bedrock AgentCore bundles runtime, memory, identity, and observability; and Vercel’s AI Gateway turns routing, budgets, and monitoring into part of the stack.
Agent identity is the make-or-break problem inside enterprises — Nate highlights Oso, Okta, WorkOS, Microsoft Entra, and AWS AgentCore Identity as critical because agents operate with delegated authority across systems like Google, Slack, GitHub, and Salesforce, where fuzzy permissions become dangerous fast.
If an agent touches business data or money, governance is the product — Snowflake Cortex, Databricks Mosaic AI, and BigQuery/Gemini are trying to keep agents inside the company’s semantic layer, while Stripe, Visa, Mastercard, and Amex are racing to define trusted rails for agentic commerce.
Observability and kill switches are not optional extras — Datadog, LangSmith, Braintrust, Langfuse, and AWS are converging on ways to track agent work end-to-end, and Nate stresses that a real kill switch must exist across runtime, identity, gateway, framework, and payments—not just as a prompt that says “stop.”
Nate opens by flipping the usual assumption: the companies deciding whether your agent ships are often not OpenAI or Anthropic, but operators like Cloudflare, Stripe, Okta, Oso, Datadog, and Snowflake. His core point is simple but sharp: models are only one piece of the agent economy, while infrastructure companies increasingly control where agents run, what they know, what they can spend, and who can shut them down.
He quickly acknowledges the familiar AI infrastructure story—GPUs, power, data centers, networking, capex—but says that only answers whether AI can be served at scale. Once agents start doing real work, the bottleneck becomes governability: memory, approvals, authority, budgets, and intervention all have to be handled by infrastructure, not by the model itself.
Nate spends real time on the idea that agents need more than stateless prompt-response loops; they need memory, scheduled execution, recovery, streaming, and tool coordination. That’s why he spotlights Cloudflare’s durable objects, AWS Bedrock AgentCore, and Vercel AI Gateway as different versions of the same thesis: runtime itself is becoming a strategic place to control agent behavior.
This is where he brings in Oso, Okta, WorkOS, Entra, and AWS identity. A user logging into an app is one thing; an agent acting across Google, Slack, GitHub, Salesforce, and a RAG pipeline is another entirely, especially when approvals happen asynchronously and secrets can’t just be handed over forever. His phrase is memorable: the dangerous agent isn’t necessarily the most capable one, but the one with “very fuzzy authority.”
Nate then moves into Snowflake and Databricks, arguing that agents are only as good as the governed meaning of the data they touch. He lists the classic failure modes—wrong joins, stale docs, mistaken metrics, unauthorized retrieval—and frames Snowflake Cortex and Databricks Mosaic AI as attempts to keep agents inside the semantic layer where business truth about ARR, customers, churn, and forecasts actually lives.
Once agents can issue refunds, make purchases, or move money, the control problem becomes much more serious. Nate says Stripe is uniquely well positioned because it already sits in the middle of payments, fraud, disputes, billing, issuing, treasury, and merchant onboarding, while Visa, Mastercard, and Amex are making a different bet: proving that agent transactions can run through the same institutional trust chain as card payments.
He argues observability is badly underrated because agent failures often look valid on the surface: correct syntax, wrong tool, authorized data, wrong conclusion, or loops that quietly burn tokens. That’s where Datadog, LangSmith, Braintrust, Langfuse, and AWS come in, all trying in different ways to create one operational view that ties together traces, costs, tool calls, policies, and evals.
Near the end, Nate turns practical: pick one workflow—support refunds, claims, usage checks—and answer seven questions about runtime, identity, data, tools, payments, observability, and who can stop the system. He lands the point with a story from a data-team leader describing agents “hacking around” human permission structures, which captures the whole thesis: agents do not respect org charts, so your governance model has to compensate.
Share
Keep Reading
The Weekly Echo. The inbox-shaped summary of what mattered.
New editorials announced here.

Playbook
“Tasteful Skills” argues that the best agent skills are not documentation or best-practice lists.

Playbook
Learn how tasteful prompting helps you move beyond generic AI output by shaping context, style, and judgment from the start.

Playbook
OpenAI shipped /goal for the Codex CLI. It turns a prompt into a persisted, self-continuing contract.