
Playbook
Tasteful Skills
“Tasteful Skills” argues that the best agent skills are not documentation or best-practice lists.
Pinecone basically admitted vector search is no longer enough — Nate’s headline point is that Pinecone’s new Nexus + NoQL product reframes retrieval around “operating context,” not just semantic similarity, because agents need bundles like customer record + policy + history, not three vaguely relevant chunks.
The real bottleneck is agent memory, not model intelligence — He says agents waste huge amounts of compute rediscovering what they should already know, citing Pinecone’s claim that up to 85% of agent compute can go to this kind of repeated retrieval and re-summarization.
Different work needs different memory shapes — The video’s core framework is that FAQs may work with chunks, but contracts need document structure, SAP-style enterprise systems need governed tables and semantic layers, and dependency-style reasoning often needs graphs.
Page Index’s bet is that chunking can destroy meaning — Their tree-based document retrieval avoids embeddings entirely, preserves hierarchy like sections and schedules, and they claim 98.7% on FinanceBench by matching retrieval to the structure of filings instead of flattening them into vectors.
SAP’s €1B+ moves show enterprise AI is really a knowledge infrastructure story — Nate points to SAP’s acquisitions of Data (Dreamio) and Prior Labs, arguing the important enterprise knowledge often lives in ERP, CRM, and governed tables, where “index a PDF and answer from a paragraph” is the wrong abstraction.
His practical advice is to design the retrieval contract before buying infra — Instead of choosing Pinecone, Weaviate, Neo4j, or Chroma first, he recommends defining exactly what bundle an agent must receive, then selecting the primitives — vector search, doc trees, semantic layers, tabular models, graphs — that can reliably deliver it.
Nate opens on the irony: Pinecone, a vector database company, just shipped something that basically says vector search alone is insufficient. He ties that to a broader market signal — SAP spending over €1 billion on AI infrastructure, Google pushing knowledge architecture at Cloud Next, Cloudflare launching memory for agents, and Microsoft leaning into graphs — all signs that the industry now sees memory as the real agent problem.
He draws a clean line between classic RAG and agent workflows. Chatbots can survive on “find me three relevant chunks,” but agents doing support, contract review, or finance work need assembled context: customer record, policy, entitlements, prior tickets, definitions, exceptions, and permissions. Otherwise they burn tokens re-reading, re-summarizing, and re-asking for facts the system already had.
Pinecone’s Nexus and NoQL are presented as an attempt to upgrade retrieval into something richer than similarity search. Nate says the key idea is that retrieval now has to carry intent, filters, provenance, access policy, response shape, confidence, and budget — basically the conditions that make an answer usable for action, not just textually relevant.
Then he moves to Page Index, which makes the sharper claim: many documents should never be chunked at all. His examples are vivid — in filings, a note to financial statements is not the same thing as a narrative summary, and in contracts, a definition 40 pages away can completely change the meaning of a clause that looks relevant in isolation. Their answer is a hierarchical document tree with summaries at each node, no embeddings, and a claimed 98.7% on FinanceBench.
Nate uses SAP to show that enterprise memory often isn’t prose in the first place. He highlights Dreamio’s lakehouse, semantic layer, federation, permissions, and lineage, plus Prior Labs’ tabular foundation models like TabPFN, arguing that ERP, CRM, and governed tables are the real source of truth for many agent actions. His point lands hard: when a procurement agent uses the wrong source, the cost isn’t a bad paragraph — it’s real money out the door.
After prose, long documents, and tables, he adds graphs as the fourth major shape. Microsoft’s GraphRAG is his example: expensive, imperfect, and prone to staleness, but still necessary when the work is inherently relational — suppliers tied to shipments, incidents tied to root causes, customers sharing failure patterns.
He answers the obvious objection directly: no, you can’t just dump everything into a giant context window and move on. Citing Chroma’s research on “context rot,” he says large windows don’t decide authority, preserve hierarchy, enforce permissions, or distinguish user-confirmed memory from model inference; production agents need appropriate context, not maximum context.
Nate closes with a practical framework: first define the contract your agent needs with data, then write down the exact bundle it must receive, then choose the primitives that deliver that bundle. He also warns against overbuilding — a help center bot does not need GraphRAG plus document trees plus semantic layers — and says the cheapest place to learn is your own agent logs: watch retrieval calls, repeat source openings, wasted tokens, and how often the agent asks for data the system already has.
Share
Keep Reading
The Weekly Echo. The inbox-shaped summary of what mattered.
New editorials announced here.

Playbook
“Tasteful Skills” argues that the best agent skills are not documentation or best-practice lists.

Playbook
Learn how tasteful prompting helps you move beyond generic AI output by shaping context, style, and judgment from the start.

Playbook
OpenAI shipped /goal for the Codex CLI. It turns a prompt into a persisted, self-continuing contract.