All Episodes
AI Redefines Scientific Method Through Agentic Loops and Scale
AI is fundamentally redefining science, moving beyond speed to unlock unprecedented discovery rates by mastering knowledge structure and nuanced judgment.
View Episode Notes →
AI Integration Accelerates Scientific Discovery Beyond Productivity Gains
AI transforms scientific discovery by embedding directly into workflows, shifting bottlenecks from human effort to experimentation capacity and accelerating progress exponentially.
View Episode Notes →
AI's Rapid Advancement Challenges Existing Models
AI models now solve advanced math problems and automate complex coding, transforming intellectual work and accelerating scientific discovery.
View Episode Notes →
Brex's Three-Pillar AI Strategy Drives 10x Workflows and Business Growth
AI transforms finance software, shifting from dashboards to AI executive assistants coordinating specialist agents for 10x workflows and cost leverage.
View Episode Notes →
Independent AI Benchmarking Reveals Cost Paradoxes and Nuanced Performance
Independent AI benchmarking reveals surprising truths: intelligence is cheap, but complex reasoning is becoming more expensive, and "I don't know" is a valuable metric.
View Episode Notes →
Independent AI Benchmarking Reveals Cost, Transparency, and Performance Trade-offs
Independent AI benchmarking reveals model labs manipulate results, while true intelligence costs plummet yet overall AI spending rises due to complex workflows.
View Episode Notes →
Reinforcement Learning Scales With Self-Supervised Representation Learning
Reinforcement learning now scales to 1,000-layer networks by shifting from reward maximization to self-supervised representation learning, unlocking unprecedented performance.
View Episode Notes →
Evolving AI Coding Benchmarks Toward Long-Horizon Development and Collaboration
AI coding agents now face long-horizon development tournaments and diverse tasks, moving beyond simple tests to simulate real-world engineering challenges and optimize performance.
View Episode Notes →
LMArena's AI Evaluation North Star: Integrity, Real-World Feedback, and Vertical Expansion
LMArena drives AI evaluation with real-world conversations and transparent leaderboards, securing $100M to scale inference, expand into specialized verticals, and become the industry's North Star.
View Episode Notes →
Post-Training AI Complexity Hinges on Data Quality and Token Efficiency
AI development pivots from scaling to nuanced post-training optimization, prioritizing data quality and token efficiency over raw compute for superior tool-calling and agent workflows.
View Episode Notes →
Co-Designing AI Products and Models for Specialized RL Application
RL advances AI by integrating economically valuable tasks into model training, shifting from "one model fits all" to specialized, co-designed products for practical, rapid progress.
View Episode Notes →
AI Personalization and Data Infrastructure Drive 2026 Consumerization
AI consumerization unlocks in 2026 via personalization, driven by memory management and continual learning, while real-world data proves superior to synthetic RL environments.
View Episode Notes →
Model Context Protocol Emerges as Standard for Interoperable AI Agents
AI agents now communicate and integrate tools seamlessly via the Model Context Protocol, evolving from a simple experiment to an open industry standard.
View Episode Notes →
AI Agents Redefine Software Engineering: From Code Writing to Orchestration
AI redefines coding: master AI orchestration by 2025, not lines of code, as traditional IDEs become obsolete and experienced engineers face obsolescence.
View Episode Notes →
AI Coding Agents Evolve to Trusted Collaborative Partners
AI coding agents, trained with "personality" and capable of complex, autonomous tasks, are revolutionizing software development and personal automation, democratizing elite engineering access by 2026.
View Episode Notes →
SAM 3 Unifies Vision Tasks With Concept-Prompted Segmentation, Detection, and Tracking
Unify segmentation, detection, and tracking with natural language prompts. SAM 3 processes images in 30ms, slashing annotation time and enabling advanced visual reasoning.
View Episode Notes →
AI Security Requires System-Level Defense and Radical Transparency
AI guardrails are security theater, sacrificing capability without enhancing safety. True AI security requires system-level defenses and radical transparency, not model lobotomization.
View Episode Notes →
Roadrunner Rebuilds CPQ for AI-Driven Pricing Complexity
Legacy CPQ systems fail modern sales complexity. Roadrunner's AI-native architecture rebuilds pricing models to automate deal desk functions and boost sales productivity.
View Episode Notes →
Superhuman's AI Agentic Framework Accelerates Productivity and Engineering
Superhuman transforms your inbox into an AI agent, delivering proactive assistance without latency and accelerating engineers by 50% while widening the gap between skilled and unskilled developers.
View Episode Notes →
World Models: Next AI Frontier Beyond LLMs
World models, trained on real-world interactions, are the next AI frontier, surpassing LLMs for spatial intelligence and embodied robotics.
View Episode Notes →
Spatial Intelligence: Beyond LLMs to Generative 3D Worlds
Unlock AI's next frontier: spatial intelligence. Discover how generative world models like Marble move beyond LLMs to create and interact with rich 3D environments, powered by massive compute.
View Episode Notes →
Spatial Intelligence: The Next Frontier Beyond Language AI
Spatial intelligence, the next AI frontier beyond LLMs, unlocks editable 3D worlds from multimodal inputs, offering richer understanding than language alone and enabling novel applications.
View Episode Notes →
AI Engineering's Future: Output-Based Pay Unlocks Millions
Output-based AI engineering unlocks 10x productivity, shifting bottlenecks to human capital, where long-term selfish engineers and rigorous interviews identify elite talent for rapid prototyping and autonomous agent development.
View Episode Notes →
Glean's "Boring" Search Moat Fuels AI Acceleration
Glean's "boring" enterprise search foundation became a significant moat, while Anthropic achieves unprecedented growth, redefining "fastest-growing software company."
View Episode Notes →