Agent-Centric Search Redefines AI's Bottleneck As Retrieval

Original Title: Building Search for AI Agents with Exa CEO Will Bryk

The a16z Show · June 06, 2026 · Listen to Original Episode →

The shift from human-centric to agent-centric search isn’t just a technical upgrade--it’s a systemic rewiring of how knowledge flows through society. Will Bryk’s work at Exa reveals that the real bottleneck in AI progress isn’t intelligence, but retrieval: the ability to find, filter, and deliver perfect information at scale. This changes everything. The downstream effects? A world where decisions--from billion-dollar investments to personal connections--are no longer limited by fragmented data but powered by comprehensive, controllable search. Executives, builders, and investors who grasp this shift early gain a critical advantage: they’ll be the ones shaping the infrastructure of the agentic economy, not scrambling to catch up when it overtakes them.

Why the Obvious Fix--Smarter Models--Makes the Problem Worse

The prevailing assumption in AI development is that bigger, smarter models will eventually solve our information problems. But as Will Bryk points out, this is a dangerous illusion. The real issue isn’t intelligence--it’s access to the right information at the right time. When we rely on massive language models to hallucinate answers instead of retrieving verified facts, we’re not solving the problem. We’re outsourcing truth to statistical patterns trained on noisy, outdated, or biased data.

"A lot of knowledge work is actually a search problem, not an intelligence problem."

-- Will Bryk

This reframing flips the script on how we build AI systems. Instead of throwing more compute at larger models, the smarter move is to build better retrieval engines--tools that can surface accurate, comprehensive, and contextually relevant information so that smaller, more efficient models can act on it. This is the core insight behind Exa: that the future of AI doesn’t depend on ever-larger models, but on precise, scalable search.

The system responds accordingly. As more companies deploy AI agents for coding, recruiting, market research, and decision-making, the demand for perfect retrieval explodes. And here’s the kicker: every extra “nine” of reliability in search compounds across workflows. A 99.9% accurate search might miss one critical competitor in a thousand. For a human, that’s acceptable. For an agent making high-stakes investment decisions, it’s catastrophic. Over time, the gap between good-enough search and perfect search becomes a moat--one that separates functional automation from truly transformative intelligence.

How the System Routes Around Human-Centric Design

Google was built for humans. Its entire architecture--from ranking signals to UI--optimizes for clicks, speed, and immediate satisfaction. It assumes the user has limited attention, wants a quick answer, and will refine their query if needed. But agents don’t behave like humans. They have infinite patience, demand comprehensive results, and operate at machine speed. Trying to use a human-optimized search engine for agent workflows is like giving a racecar driver a bicycle: technically possible, but absurdly inefficient.

Bryk draws a sharp distinction:

"The world of agents searching is just completely different from human searching. An agent doesn’t just want 10 pieces of information--it wants everything."

-- Will Bryk

This isn’t a minor tweak. It’s a complete architectural overhaul. Agents need search engines that support complex, semantic queries; allow fine-grained filtering; return thousands or even tens of thousands of results; and do so with customizable latency. They don’t want summaries--they want raw, structured data they can analyze themselves.

The consequence? Google’s decades of click data--the crown jewel of its search dominance--becomes largely irrelevant. That data trained models to predict what humans click on, not what’s factually correct or comprehensive. For agents, relevance isn’t measured by engagement. It’s measured by completeness, accuracy, and utility in downstream tasks.

This creates a rare opening for startups. The moat that protected Google for 20 years--behavioral data--doesn’t apply in the agent era. Instead, the new advantage lies in building from first principles: indexing more of the web, training better retrieval models, and designing APIs that expose full control to agents. Exa’s small team (under 100 people) has been able to outperform Google in specific domains not because they’re smarter, but because they’re unburdened by legacy assumptions.

The 18-Month Payoff Nobody Wants to Wait For

Building a search engine for agents isn’t just technically hard--it’s psychologically difficult. The payoff is delayed, the metrics are unclear, and the path to perfection feels endless. Most teams would rather ship something fast than chase an ideal that may never arrive. But Bryk argues that this discomfort is precisely where advantage is created.

He recounts a thought experiment that became Exa’s founding principle: What if, for every query, you ran GPT-3 over every document on the web and filtered down to the best 10? The result would be better than Google. The cost? $10 billion per query. Impossible today--but it proved that better search was possible. The mission then became optimization: how to deliver that quality at a fraction of the cost.

This long-term bet is paying off. By focusing on retrieval efficiency, Exa helps AI companies reduce token usage by up to 20x. That’s not a marginal improvement--it’s a business-defining cost saving. And it only becomes visible after months of invisible engineering work: refining embeddings, optimizing vector databases, and training models to extract only the most relevant snippets.

The system reinforces this advantage. As more agents adopt Exa, they generate better feedback loops. Retrieval models improve. Costs drop further. Competitors who took the shortcut--relying on off-the-shelf search or generic APIs--find themselves stuck with bloated token bills and inaccurate results. They can’t catch up because they didn’t invest in the hard, unsexy work of perfect retrieval.

And here’s the deeper implication: this isn’t just about cost. It’s about capability. Smaller models, when paired with superior retrieval, can outperform larger ones. That means the future of AI isn’t a race to trillion-parameter models--it’s a race to build the most efficient, accurate, and scalable retrieval layer. The companies that win won’t be the ones with the smartest models. They’ll be the ones with the best search.

Where Immediate Pain Creates Lasting Moats

One of the most counterintuitive insights from the conversation is that the hardest problems in AI aren’t in model training--they’re in data access and indexing. The web is increasingly closed. Publishers fear being scraped. APIs restrict access. And vast amounts of useful information--company databases, private forums, satellite imagery, internal documentation--aren’t on the public web at all.

Bryk sees this not as a limitation, but as a frontier. The next wave of search won’t just index pages--it will “unearth the world’s data.” That means negotiating with content providers, building partnerships, and creating new economic models where value flows back to creators.

His vision? A trillion-dollar agentic economy where instead of $200 billion going to one search giant, $150 billion is redistributed to content providers. This isn’t idealism--it’s systems thinking. If agents are going to rely on high-quality information, the incentives must align to produce it. A search engine that enriches the ecosystem it depends on will outlast one that merely extracts from it.

"The agentic economy is going to be huge... There’s an opportunity for everyone to do amazingly."

-- Will Bryk

This reframes search as coordination infrastructure. It’s not just about finding answers--it’s about organizing human knowledge so that both people and agents can act on it. And in that light, nearly every unsolved problem--from political polarization to loneliness--becomes a search problem in disguise. Misinformation persists because accurate information is hard to find. People feel isolated because they can’t discover others who share their interests. These aren’t cultural failures. They’re retrieval failures.

The long-term payoff? A world where information is not just available, but perfect: comprehensive, controllable, and trustworthy. That’s not a feature. It’s a new foundation for civilization.

Key Action Items

Over the next quarter: Audit your AI workflows to identify where retrieval--not intelligence--is the bottleneck. Look for high token usage, frequent hallucinations, or reliance on outdated knowledge.
Within 6 months: Replace generic search APIs with specialized retrieval tools that support semantic queries, filtering, and large result sets. Prioritize providers that optimize for efficiency, not just relevance.
Flag: Discomfort now, advantage later--Invest in building or integrating a retrieval layer even if it slows initial development. The cost savings and accuracy gains compound over time.
This pays off in 12--18 months: Begin treating search as core infrastructure, not a utility. Design systems where small models delegate to powerful retrieval tools, reducing reliance on expensive LLM calls.
Start now: Rethink data partnerships. Instead of just scraping public sources, explore ways to collaborate with content providers to access deeper, fresher, or proprietary data.
Long-term (2+ years): Align your business model with the agentic economy. If you produce valuable information, position it as a retrieval asset. If you consume it, ensure your tools feed value back into the ecosystem.
Immediate: Test Exa or similar agent-first search tools in high-stakes workflows like coding, recruiting, or market intelligence. Measure improvements in accuracy, speed, and token efficiency.

More from The a16z Show

The Real AI Threat Is Status Collapse, Not Job Loss

Jun 09, 2026

AI won’t cause mass unemployment--it will collapse the career expectations of the upper middle class. The real disruption isn’t technological; it’s social and political, reshaping status, work, and identity for professionals in law, finance, and consulting.

View Episode Notes →

AI Coding Tools Are a Mirage -- Real Value Lies in Workflow Reinvention

Jun 04, 2026

AI’s first breakthrough isn’t in flashy apps--it’s in coding, where agentic tools quietly transformed software development. The real revolution? Not better models, but reimagined workflows that turn every company into a software shop, with AI as the invisible engine reshaping knowledge work from the ground up.

View Episode Notes →

PC AI Reckoning: Shedding Past for Forward-Looking Design

Jun 02, 2026

The PC's future lies in shedding backward compatibility to embrace AI-native design, unlocking efficiency and innovation beyond legacy constraints.

View Episode Notes →