AI Agents Unlock Potential With Server-Side Code Generation

Original Title: MCP on Code Mode (Interview)

The Changelog: Software Development, Open Source · May 15, 2026 · Listen to Original Episode →

Beyond Tool Calls: Unlocking AI's Potential with Code Mode and Server-Side Execution

This conversation reveals a critical, often overlooked, bottleneck in how we deploy AI agents: the over-reliance on explicit tool-calling and the limitations of context windows. Matt Carey from Cloudflare argues that by shifting from discrete function calls to generative code execution, and by bringing that execution server-side, we can dramatically expand the capabilities of AI agents. This insight is crucial for developers and architects building sophisticated AI systems, offering a path to leverage vast APIs and complex logic without the usual constraints. Reading this offers a strategic advantage in understanding the next wave of AI agent development, moving beyond simple command-and-control to true programmatic interaction.

The Hidden Cost of Discrete Tools

The initial paradigm for AI agents interacting with the world was through function calling. Developers would define discrete functions, essentially tools, that an AI could call with specific parameters. This approach, while functional, quickly hits a wall. As Matt Carey explains, "as you try to get these agents to do more and more things you add more and more tools and then at some point you start filling the initial context window of the model." This is a fundamental architectural problem. Even with models boasting large context windows, their performance degrades significantly beyond certain thresholds. For personal AI or agents designed to automate complex jobs, the sheer number of individual functions required quickly outstrips the available context.

"The models of yesteryear like gpt 4 they had much smaller context windows and so you were filling them very quickly and even now the foundational models the best ones even though they have maybe half a million tokens context window or 200k to a million for the normal ones they they do start losing power around the 50k mark like all of them and this is quite well documented and so you really don't want to be filling the context window too much"

This limitation means that agents are often restricted to a small, cherry-picked set of capabilities, hindering their ability to perform complex, multi-step tasks. The problem isn't just the number of tools, but the cognitive load and the sheer token count required to represent them. Imagine an agent trying to manage your entire cloud infrastructure; a few dozen tools would barely scratch the surface of the available APIs.

Code Mode: Shifting from "Calling" to "Writing"

The innovation presented here is a shift from explicitly calling tools to having the AI generate code that uses those tools. This is the essence of "Code Mode." As Carey notes, the idea is that "models should just write code." AI models are trained on vast amounts of code, so leveraging this strength by having them compose SDK calls or write scripts directly is a more natural and powerful interaction. This approach bypasses the context window limitations of tool definitions. Instead of listing dozens of tools, the model can generate a single piece of code that orchestrates multiple underlying functions. This is akin to a human developer writing a script to automate a task rather than manually executing each command.

The advantage here is profound. A single "code generation" capability can, in theory, access an entire API surface if the underlying SDK is well-defined. This moves beyond the constraints of a predefined toolset and allows the AI to dynamically compose solutions.

Server-Side Execution: Safety and Scale

The true game-changer, however, is executing this generated code server-side, specifically within Cloudflare's dynamic Workers. Traditionally, running AI-generated code locally on a developer's machine or even within the agent's environment presented significant security risks. Carey highlights this: "Traditionally people got very very scared when you say give me code and I'm going to execute it on my machine because there are so many different ways that you can mess someone up by doing that."

Cloudflare's dynamic Worker loader provides a highly secure, sandboxed environment. This V8 isolate-based execution allows for the safe, instantaneous execution of code generated by an AI. This means that an AI can write and execute complex logic against Cloudflare's entire API surface--all 2,500+ endpoints--within a single, thousand-token context.

"all of that code is executed super safely securely on dynamic workers on our server side and so your agent that calls the tool can just just write code and um yeah we'll get executed on server"

This server-side execution model offers several critical advantages:
* Reduced Context Window Strain: The AI doesn't need to be aware of every individual API endpoint. Instead, it interacts with a simplified interface, often just two tools: search and execute. The complexity of mapping these to the vast API surface is handled server-side.
* Enhanced Security: By executing code in a sandboxed environment, the risk of malicious or erroneous code execution is drastically reduced. Fetch requests can be strictly controlled, preventing unintended side effects.
* Scalability: Dynamic Workers are designed for massive scale, allowing for the execution of AI-generated code at global levels without the typical infrastructure overhead.
* Simplified Agent Development: Developers don't need to worry about shipping code execution environments with their agents. The complexity is managed by the server-side infrastructure.

This approach fundamentally redefines how agents can interact with complex systems. Instead of being limited by a curated list of tools, an agent can now, in essence, write code to interact with an entire platform. The implication is that the AI can become a more sophisticated programmer, capable of composing novel solutions and automating tasks that were previously out of reach due to complexity or context limitations.

The Advantage of Server-Side Code Generation

The shift to server-side code execution for AI agents is not just an incremental improvement; it’s a paradigm shift. By abstracting the complexity of tool interaction into code generation and executing that code in a secure, scalable environment, we unlock new possibilities. The "Code Mode" approach, when combined with server-side execution, allows for a much more natural and powerful interaction with AI. It moves us closer to a future where AI agents can truly programmatically interact with and manage complex systems, not just by calling pre-defined functions, but by writing the code that orchestrates those functions. This offers a significant competitive advantage for organizations that can master this paradigm, enabling them to build more capable, secure, and scalable AI-powered applications.

Key Action Items

Explore Code Mode: Begin by reading the Cloudflare blog post on Code Mode and experimenting with its capabilities in a coding agent.
- Immediate Action: Dump the blog post content into your preferred coding agent to understand its output.
Rethink Tooling Strategy: Evaluate your current AI agent's interaction model. Are you relying heavily on discrete tool definitions? Consider how you could abstract these into an SDK or API that the AI can generate code against.
- Immediate Action: Map out the top 5-10 most frequently used tools by your agents and identify opportunities for SDK abstraction.
Investigate Server-Side Execution: Understand the benefits of executing AI-generated code in a sandboxed, server-side environment.
- Immediate Action: Research Cloudflare's dynamic Workers or similar serverless execution environments.
Prioritize API Surface Area: For platforms with extensive APIs, consider how Code Mode and server-side execution can expose the entire surface area to agents, rather than a curated subset.
- Longer-Term Investment (6-12 months): Develop an internal SDK or a simplified API gateway for your core services that can be consumed by AI agents via generated code.
Embrace Code Generation: Encourage AI models to write code rather than just call functions. This requires training and fine-tuning models or carefully crafting prompts to elicit code generation.
- Immediate Action: Experiment with prompts that explicitly ask the AI to "write a script" or "generate code" to accomplish a task, rather than "call tool X."
Focus on Developer Experience: While agents can write code, ensuring that code is secure, deployable, and maintainable is critical. This involves robust sandboxing and clear error handling.
- Immediate Action: Define basic security guardrails for any AI-generated code that will be executed.
Embrace the "Uncomfortable" Complexity: The shift to code generation and server-side execution introduces new complexities, but these are the complexities that will unlock future capabilities.
- Discomfort Now, Advantage Later: Invest time in understanding these new paradigms, even if they feel more challenging than simple tool-calling initially. This effort will pay off in more powerful and versatile AI agents.

Related Episodes

AI-Powered Parallelization Creates New Software Development Bottlenecks

Mar 06, 2026 Latent Space: The AI Engineer Podcast

AI in coding shifts focus from individual speed to massive parallelization, creating new bottlenecks in code review and deployment. Embrace tools amplifying collective output for a significant advantage.

View Episode Notes →

AI Code Generation's True Challenge: Validation and System Oversight

Feb 11, 2026 The Changelog: Software Development, Open Source

AI generates code at scale, but the real challenge is managing its output. Developers must shift from coding to curating and verifying AI work to build the systems that build the machines.

View Episode Notes →

AI Transforms Developer Workflows: Agents, Databases, and Web Interaction

Dec 05, 2025 The Changelog: Software Development, Open Source

AI agents are revolutionizing development, shifting focus to oversight and AI-directed output. Discover how this evolution demands new tools and redefines developer productivity.

View Episode Notes →

AI Agents Accelerate Development, Shifting Bottlenecks to Orchestration

Nov 24, 2025 Data Engineering Podcast

AI accelerates software development 2-10x, shifting bottlenecks to review and coordination. Master "context as code" and multiplayer agent environments to gain a significant velocity advantage.

View Episode Notes →

AI Agents Democratize Software Development Via CLI Interaction

Jan 07, 2026 The Startup Ideas Podcast

Non-technical individuals can now build and ship software by guiding AI agents, transforming learning from syntax mastery to prompt engineering and democratizing innovation.

View Episode Notes →

AI Agents Shift Bottleneck From Model to Human Interaction

Dec 14, 2025 Lenny's Podcast: Product | Career | Growth

AI coding agents accelerate development by transforming engineers into "super-powered" teammates, reducing complex projects from weeks to days and enabling rapid iteration.

View Episode Notes →