Multi-Agent Systems Require New Infrastructure for Identity, Discovery, and Observability

TL;DR

Multi-agent systems enable enterprises to tap into the service market by automating repetitive tasks, potentially increasing job interest and unlocking new value beyond traditional software.
Specialized agents, provided by best-in-class vendors or developed internally, are crucial for enterprise trust, necessitating an ecosystem of collaborating agents from diverse sources.
A new infrastructure layer beyond cloud-native is required for multi-agent systems, as agents possess unique properties of humans and workloads operating at machine speed.
Agent identity management is complex, requiring a blend of workload and business identity that adapts to scale and user context, necessitating new proxy solutions.
Decentralized discovery mechanisms like the proposed "Oasis" framework are essential for an "internet of agents," moving beyond simple registries to enable interoperability.
Observability for multi-agent systems must extend beyond workload metrics to include syntactic and semantic layers, reconstructing call graphs and assessing behavioral consistency.
The "lift and shift" approach to agentic systems, while a starting point, misses the full benefits of microservices and elastic, stateless architectures.

Deep Dive

Multi-agent systems, conceptualized as microservices, represent a significant advancement beyond single-agent AI, offering the potential to unlock vast service market value by automating repetitive tasks and enhancing human roles. This paradigm shift necessitates a new infrastructure layer, distinct from cloud-native architectures, to manage the unique properties of agents that combine human-like attributes with machine-speed execution and evolving identities. The development of these decentralized, interoperable agent ecosystems is crucial for enterprises to trust and leverage AI, moving beyond isolated, universal agents towards specialized, collaborating entities.

The core challenge lies in building a robust and trustworthy infrastructure for these multi-agent systems. Unlike traditional workloads, agent identity is fluid, shifting rapidly based on user context and task, demanding sophisticated access control and authorization mechanisms that go beyond standard workload identities. This complexity is further amplified by the need for effective agent discovery and communication protocols. While current approaches like registries and direct URL-based discovery are limited, open-source initiatives such as AGNTCY, with its Oasis framework and decentralized directory mesh, aim to create an "internet of agents" where diverse agents can be discovered and collaborate seamlessly. This decentralized approach, inspired by web3 principles, prioritizes provenance and verifiable agent identity, fostering a more open and interoperable landscape.

Observability in multi-agent systems also requires a significant evolution beyond traditional cloud-native telemetry. Simply monitoring workload health is insufficient; understanding the syntactic and semantic layers of agent communication is critical, especially in self-forming systems where call graphs are dynamic. Extending tools like OpenTelemetry to encompass LLM, single-agent, and multi-agent interactions, coupled with metric computation engines that can reconstruct call graphs and assess system stability, provides deeper insights. These tools enable qualitative analysis, such as identifying critical agents and evaluating the consistency of agent behavior, mirroring the management of human teams. While some complex use cases, like autonomous driving, may require near-perfect AI, many enterprise applications can achieve substantial value with 80% accuracy, accelerating processes like root cause analysis and making AI-driven services increasingly practical and impactful.

Action Items

Create agent identity framework: Define 3-5 distinct logical identities for agents, bridging workload and business context.
Audit agent communication protocols: Evaluate 5-10 communication patterns for reliability and semantic clarity (ref: A2A, ACP).
Implement agent discovery system: Deploy 3-5 decentralized directory nodes for agent discoverability (ref: Oasis, DHT).
Measure multi-agent system consistency: Track 5-10 tasks across 3-5 contexts to quantify behavioral stability.
Extend observability to agent semantics: Integrate Layer 8/9 analysis into 3-5 core agent interactions.

Related Episodes

Specialized AI Inference: Infrastructure, Performance, and Developer Experience

Nov 18, 2025 Gradient Dissent: Conversations on AI

Master AI inference infrastructure for custom models, optimizing performance and developer experience beyond basic serving to unlock true innovation and competitive advantage in a rapidly evolving market.

View Episode Notes →

AI Adoption Drives Market Shift Amidst Geopolitical and Energy Challenges

Dec 09, 2025 Bloomberg Surveillance

AI adoption drives efficiency and growth, reshaping markets by 2026, while energy demand surges and geopolitical trade dynamics shift.

View Episode Notes →

AI Cares: Organic Alignment Over Control

Nov 17, 2025 The a16z Show

Current AI alignment approaches are flawed; true alignment requires AI that genuinely cares and can say "no" to harmful requests, mirroring organic human relationship development.

View Episode Notes →

Vercel's Durable Serverless for AI Agents

Oct 31, 2025 Latent Space: The AI Engineer Podcast

## Episode Synopsis This episode centers on Vercel's strategic embrace of the AI engineering movement, focusing on making AI development more...

View Episode Notes →

Superhuman's AI Agentic Framework Accelerates Productivity and Engineering

Dec 11, 2025 Latent Space: The AI Engineer Podcast

Superhuman transforms your inbox into an AI agent, delivering proactive assistance without latency and accelerating engineers by 50% while widening the gap between skilled and unskilled developers.

View Episode Notes →

Trump's Autocratic Foreign Policy and Shifting Cultural Dynamics

Jan 06, 2026 Pivot

Trump's foreign policy assertions, driven by oil interests and personal gain, risk global instability and distract from domestic crises, while niche content now disrupts traditional media.

View Episode Notes →

Key Quotes

"we were of course like everyone you know witnessing you know the fast progress of llms and bots and the first agents and everyone was amazed by how these agents and these llms initially were capable of managing you know access to knowledge through this perfectly fluid natural language interface and the beauty of generative ai and so that was fascinating but something which intrigued us even more which got a little bit unnoticed is that as the world were getting acquainted with these strange entities which are llms we realized down the road that they had reasoning capabilities"

Guillaume De Saint Marc explains that while the initial fascination with LLMs centered on their ability to access knowledge through natural language, a more profound discovery was their emergent reasoning capabilities. This realization, he argues, is as significant as the initial chatbot breakthroughs and points towards a more complex future for AI.

"so we realized that to be trusted by an enterprise an agent needs to be really super specialized really really specialized and probably provided by the best vendor in this domain whatever the task is and so we rapidly convinced ourselves that agents will have to be really really specialized to be trusted by enterprise and then the minute you say okay enterprise will get the best agent from the best vendor or they might actually develop it themselves if this is really specific to their domain and part of their core business activity the minute you have an agent this is like hiring a super expert right so you hire a super expert and then you keep this guy you know locked in a room all day long and collaborating with no one else that's not great"

Guillaume De Saint Marc posits that for enterprises to trust AI agents, these agents must be highly specialized. He likens this to hiring a human expert, suggesting that a single, all-encompassing agent would be less effective and trustworthy than a collection of specialized agents. This specialization, he argues, is key to enterprise adoption.

"we realized the minute you have an agent the second after you wanted to collaborate with another agent specialized in you know the next task or the adjacent task and we're like okay so this is going to be a multi agent we started to realize how this is going to mimic how humans can work as teams and so that's why we were convinced very early that it was going to be important to have these ecosystems of agents capable of working with each other agents coming from different vendors hosted in different platforms they work with different frameworks and this is where we you know started to talk about the internet of agent like how do we bring this new internet where you can have agents from different governance collaborating in this ecosystem"

Guillaume De Saint Marc articulates the natural progression from single agents to multi-agent systems, drawing a parallel to human teamwork. He emphasizes the need for an "internet of agents" where diverse agents, potentially from different vendors and platforms, can collaborate effectively within an ecosystem. This interoperability, he suggests, is crucial for scalability and innovation.

"agents they have attributes of humans but they have also properties of workloads and they work at machine speed and scale and the way they get across the stack is very very new and you need to cater for that and so that's when we convinced ourselves we need a new layer a new low level infrastructure regardless of which multi agent application you're going to develop to build on that"

Guillaume De Saint Marc argues that agents represent a fundamentally new paradigm that requires a distinct infrastructure layer beyond existing cloud-native stacks. He explains that agents possess both human-like attributes and workload properties, operating at machine speed, which necessitates specialized low-level infrastructure to support multi-agent applications effectively.

"so you can launch a sort of a semantic search i'm looking for an agent which have this type of skills and then you get all the answers for that planet scale so we're starting a federation we're starting a test bed with this and with a few players so we hope we can announce that soon and it's going to be the first time we see completely different registries completely different types of agenting cars starting to operate and be able to find each other through this mesh network"

Guillaume De Saint Marc describes the vision for a decentralized agent discovery system, inspired by web3 principles and cybersecurity frameworks. He highlights the proposed "agency directory" as a mesh network where diverse agent descriptions can interoperate, enabling planet-scale semantic searches for agents with specific skills, moving beyond current limited, non-interoperable registries.

"you can't just call it application yes there is a lot of application information shared but you need to like open the message and look at it and it has a syntax and it has a semantic right the syntax is you know this is where we have a to a and this is where you have ways to or mcp or like ways to make sure that the communication the syntax of the communication can flow from one entity to the other but the level above is really like the semantic and you need to get into this type of observability as well if you want to understand what the hell is happening or going on between all these agents"

Guillaume De Saint Marc stresses the importance of semantic observability for multi-agent systems, differentiating it from traditional workload monitoring. He explains that understanding the "syntax" (communication protocols) is insufficient; one must also analyze the "semantics" (meaning) of agent interactions to grasp the overall system behavior, especially in complex, self-forming architectures.

Resources

External Resources

Articles & Papers

"Treating your agents like microservices" (The Stack Overflow Podcast) - Discussed as the title of the episode featuring Guillaume De Saint Marc.

People

Guillaume De Saint Marc - VP of Engineering at Outshift by Cisco, guest on the podcast discussing multi-agent architectures.
Ryan Donovan - Host of The Stack Overflow Podcast, editor of the blog.
Avraam Mavridis - Socratic badge winner for asking well-received questions on 100 separate days.

Organizations & Institutions

Outshift by Cisco - Tech incubator pursuing emerging technologies like agentic AI.
Cisco - Company where Guillaume De Saint Marc is VP of Engineering.
DeepMind - Mentioned in relation to demonstrating reasoning capabilities of LLMs.
Anthropic - Mentioned in relation to reasoning capabilities of LLMs.
OpenAI - Mentioned in relation to reasoning capabilities of LLMs.
Google - Mentioned in relation to A2A (Agent to Agent) initiative.
Microsoft - Mentioned in relation to extending OpenTelemetry.
MIT - Mentioned in relation to the Namba project.
Splunk - Now part of Cisco, mentioned for work on extending OpenTelemetry.

Websites & Online Resources

Outshift (https://outshift.cisco.com/) - Cisco's tech incubator, mentioned for information on multi-agent architecture.
AGNTCY (https://agntcy.org/) - Open-source collective for multi-agent architecture.
Stack Overflow (https://stackoverflow.com/) - Platform where the podcast is hosted and Avraam Mavridis earned a badge.
Linkedin - Platform for connecting with Guillaume De Saint Marc and Ryan Donovan.
Art19 (https://art19.com/privacy) - Mentioned for privacy policy.
Oasis - Proposed system for agent discovery, inspired by OCSf.
OCSF (Open Agentic Description Framework) - Framework for defining agent cards.
Agent Directory - Open-sourced module for agent discovery.
At Protocol - Mentioned as inspiration for decentralized directory mesh.
Blue Sky - Mentioned as inspiration for decentralized directory mesh.

Other Resources

Multi-agent architectures - Primary topic of discussion in the episode.
Microservices - Concept used to describe how agents can be treated.
Agentic AI - Emerging technology pursued by Outshift.
Quantum computing - Emerging technology pursued by Outshift.
Next-gen infrastructure - Emerging technology pursued by Outshift.
LLMs (Large Language Models) - Technology discussed in relation to agents and reasoning capabilities.
Generative AI - Technology discussed in relation to agents.
Cloud native architecture - Architecture discussed in relation to building scalable systems.
A2A (Agent to Agent) - Protocol for agent communication.
ACP (Agent Communication Protocol) - Protocol created in Agency.
Agency - Open-source initiative for agent ecosystems.
SMP (Service Management Protocol) - Mentioned in relation to A2A.
SPI (Service Principal Identity) - Classic workload identity example.
MCP (Multi-cloud Platform) - Server used for agent access, discussed in relation to security and agent-to-agent communication.
OpenTelemetry - Framework for telemetry, discussed in relation to extending it for agents.
Splunk - Mentioned as a tool for telemetry.
MCE (Metric Computation Engine) - Open-source tool for analyzing multi-agent system telemetry.
LLM as a judge - Qualitative analysis method for multi-agent systems.
AGI (Artificial General Intelligence) - Future concept discussed in relation to LLMs.
Layer 8 and 9 - Syntactic and semantic layers of communication, discussed in relation to observability.
DHTs (Distributed Hash Tables) - Technology used for decentralized directories.
Namba project - Project at MIT related to agent discovery.
Socratic badge - Badge awarded on Stack Overflow.