Bridging Intent Ambiguity: Programmable Intelligence Over General Intelligence

Original Title: How Foundation Models Evolved: A PhD Journey Through AI's Breakthrough Era

The a16z Show · January 16, 2026 · Listen to Original Episode →

The most profound implication of the current AI landscape isn't about building more intelligent models, but about how we instruct them. This conversation with Omar Khattab, creator of DSPy, reveals a critical bottleneck: the gap between human intent and machine execution. While the allure of ever-larger models promises AGI, Khattab argues for "Artificial Programmable Intelligence," a paradigm shift that treats AI not as an oracle, but as a reliable, composable infrastructure layer. This insight is crucial for engineers, product managers, and anyone building with AI, offering a path to create predictable, maintainable systems rather than relying on unpredictable prompt-and-pray heuristics. Understanding this distinction provides a significant advantage in building robust AI applications that scale beyond mere capability.

The Illusion of Intelligence: Why Bigger Isn't Always Better

The prevailing narrative in AI development often centers on scaling laws: bigger models, more data, and inevitably, AGI. However, Omar Khattab challenges this assumption, positing that raw intelligence is only one piece of a much larger puzzle. The real problem, he argues, lies in specification: how do we effectively communicate our complex desires to these increasingly powerful, yet fundamentally literal, machines? The "bitter lesson" in AI, as he puts it, is that over-engineering intelligence through sheer scale often leads to regret when simpler, more generalizable methods emerge. This is particularly true when hard-coded decisions are made based on current, limited understanding of intelligence.

Instead of chasing an elusive AGI, Khattab advocates for "Artificial Programmable Intelligence." This reframing shifts the focus from what AI can do to how we can reliably direct it. The current trajectory, he observes, already reflects this shift. Frontier AI labs are moving beyond the "old playbook" of just scaling parameters and pre-training data. The massive emphasis on post-training pipelines, retrieval augmentation, tool use, and agent training all signal a recognition that raw capabilities are insufficient. The real challenge is building systems, not just models.

"The idea that scaling model parameters and scaling just pre training data is all you need it exists nowhere anymore nobody thinks that actually people deny they ever thought that at this point and now you see this massively human designed and very carefully constructed pipelines for post training where we really encode a lot of the things we want to do you see massive emphasis on retrieval and web search and tool use and agent training"

This fundamental insight highlights a critical consequence: an over-reliance on model capabilities without a robust specification layer leads to systems that are brittle and unpredictable. The competitive advantage lies not in having the "smartest" model, but in having the most effective way to harness its intelligence for specific, complex tasks. This requires moving beyond the limitations of natural language, which is too ambiguous, and traditional code, which is too rigid, to find a middle ground.

Bridging the Gap: From Assembly to AI's C

The core of Khattab's argument, and the innovation behind DSPy, lies in creating a new abstraction layer for AI intent. He draws a powerful analogy to the evolution of programming: the jump from assembly language to C. Assembly is low-level, powerful, but incredibly difficult to work with. C provided a higher-level abstraction that allowed developers to build complex software more efficiently and portably, without needing to manage every minute detail of the underlying hardware. Similarly, Khattab argues, we need an abstraction that bridges the gap between the raw, often inscrutable output of Large Language Models (LLMs) and the precise intent of human users.

This is where the concept of "signatures" in DSPy becomes critical. Signatures are not just prompts; they are formal declarations of intent, akin to function signatures in programming languages. They define the expected inputs and outputs, allowing for modularity, composability, and a degree of formality that natural language alone cannot provide. This structured approach decouples the user's intent from the specific LLM implementation, making AI systems more maintainable, portable, and understandable.

"The models keep getting better but in any case they keep changing and the thing you want to build changes a lot more slowly there is actually a conceptual separation between what you were trying to build and llms and you know vision language models all that like that space is basically separate and so if what if we could try to capture your intent in some kind of purer form"

The consequence of not adopting such a structured approach is that AI development remains tethered to the whims of model updates and the painstaking art of prompt engineering. This creates a fragile foundation for building critical systems. By contrast, embracing a structured, programmable approach to AI intent allows for the development of AI software engineering practices that mirror the robustness and maintainability we expect from traditional software. This is where delayed payoffs create significant competitive advantage; the initial discomfort of learning a new abstraction pays dividends in the long run through increased reliability and reduced maintenance overhead.

The Systemic Advantage: Beyond Individual Components

Khattab emphasizes that AI development is not just about individual models but about building cohesive systems. This systemic thinking is crucial for understanding the downstream effects of design choices. For instance, treating an LLM as a black box oracle, to be prompted and prayed over, creates a system that is inherently difficult to reason about or compose. This is where conventional wisdom fails: it focuses on the immediate capability of the model, neglecting the long-term implications of how that capability is integrated into a larger workflow.

DSPy, by introducing formal abstractions like signatures, modules, and optimizers, enables a more systematic approach. Signatures define the "what," modules handle specific inference strategies (like chain-of-thought or tool use) in a decoupled manner, and optimizers refine the entire program. This layered approach allows developers to manage complexity, isolate ambiguity at specific "joints," and build AI applications that are as robust and maintainable as traditional software.

"The thing i'm trying to say is i want them to express their intent that is less model specific and not worry that they're losing or leaving a lot on the table honestly this is where like you changed the way that i think about this whole thing"

The systemic advantage here is profound. By treating AI interactions as formal software systems, we gain the benefits of modularity, testability, and maintainability. This means that as underlying models improve, the core logic of the application, expressed through DSPy's abstractions, remains stable. This resilience is a significant competitive moat. It allows organizations to iterate on their AI applications with confidence, knowing that their core intent is captured in a way that is less susceptible to the rapid, often unpredictable, evolution of LLM capabilities. The effort invested in building these structured systems now pays off by enabling faster, more reliable deployment of future AI advancements.

Key Action Items

Embrace Programmable Intent: Shift focus from prompt engineering to defining formal specifications for AI interactions. This involves understanding and adopting structured abstractions.
- Immediate Action: Explore DSPy or similar frameworks to understand the concept of signatures and modules.
Decouple Intent from Model: Recognize that your application's core logic should be independent of the specific LLM used. This allows for easier model upgrades and experimentation.
- This pays off in 6-12 months: As models evolve, your core system logic remains stable, enabling faster adoption of new capabilities.
Adopt a Systems-Level View: Design AI applications as interconnected systems rather than isolated model calls. This includes considering how different components (retrieval, reasoning, generation) interact.
- Over the next quarter: Map out the intended system architecture for your next AI feature, identifying distinct functional modules.
Invest in Formalism: Understand that while natural language is a component, it cannot be the sole interface for complex AI systems. Formal structures are necessary for reliability and maintainability.
- This pays off in 12-18 months: Building with formal abstractions leads to more robust, debuggable, and scalable AI applications.
Prioritize Maintainability over Raw Capability: When evaluating AI solutions, consider their long-term maintainability and composability, not just their immediate performance on a benchmark.
- Immediate Action: When building a new AI feature, ask: "How will we update and maintain this in 18 months?"
Isolate Ambiguity: Use structured abstractions to isolate areas of inherent ambiguity (e.g., natural language descriptions of complex tasks) from areas requiring formal, deterministic logic.
- Over the next quarter: Identify a complex task you're automating and break it down into formal components and fuzzy components.
Build for Evolution, Not Stasis: Recognize that the AI landscape is rapidly changing. Building with abstractions that are designed to evolve alongside model capabilities ensures long-term viability.
- This pays off in 12-24 months: Organizations that adopt these evolving abstractions will be better positioned to integrate future AI breakthroughs without massive rewrites.

Related Episodes

AI Boom: Early Stage, Market-First Investing, and 3D Content Creation

Dec 30, 2025 The a16z Show

AI's early boom offers tangible value, unlike past hype, transforming coding and 3D content creation. True value emerges at the individual level, with enterprise adoption evolving.

View Episode Notes →

AI's Uncertain Trajectory: Platform Shift, Bubble Risk, and Productization

Dec 12, 2025 The a16z Show

AI's potential rivals electricity, but its true impact and widespread daily use remain unpredictable due to unknown physical limits and the risk of market bubbles.

View Episode Notes →

Spatial Intelligence: The Next Frontier Beyond Language AI

Nov 25, 2025 Latent Space: The AI Engineer Podcast

Spatial intelligence, the next AI frontier beyond LLMs, unlocks editable 3D worlds from multimodal inputs, offering richer understanding than language alone and enabling novel applications.

View Episode Notes →

AI's Generalization Gap Requires Research Over Scaling

Dec 15, 2025 The a16z Show

AI models excel on benchmarks but fail in the real world due to reward hacking. True AGI requires shifting from scaling to research-driven innovation focused on human-like learning and value functions.

View Episode Notes →

AI Agents Expand Software Creation to Billions, Redefining Development

Nov 28, 2025 AI + a16z

The addressable market for developer tools explodes from 17 million to 3 billion users as AI agents empower anyone to build and deploy applications, fundamentally rewriting software economics and the definition of a developer.

View Episode Notes →

AI's "Brute Force" Path: Expert Data Crisis & Sovereign Individuals

Nov 07, 2025 The a16z Show

## Episode Synopsis The age of solo entrepreneurship, powered by AI, is upon us, promising unprecedented individual productivity and opportunity....

View Episode Notes →