DeepL's Specialized AI: Data, Compute, and Translation Mastery

Gradient Dissent: Conversations on AI · July 08, 2025 · Listen to Original Episode →

Original Title: How DeepL Built a Translation Powerhouse with AI with CEO Jarek Kutylowski

Related Episodes

Mixture-of-Experts Architecture Drives AI Intelligence and Cost Efficiency

Dec 29, 2025 NVIDIA AI Podcast

Unlock massive AI intelligence with fewer active parameters. Discover how Mixture-of-Experts and specialized hardware slash computational costs, making advanced AI more accessible and efficient.

View Episode Notes →

AI Revolution: Unprecedented Acceleration, Market Infancy, and Shifting Economics

Jan 07, 2026 The a16z Show

AI is reshaping markets at an unprecedented rate, driven by collapsing costs and rapid global adoption. The market remains in its early stages, with current AI products poised for dramatic evolution.

View Episode Notes →

AI's Evolution: From Reactive Answers to Proactive Super Assistants

Mar 15, 2026 BG2Pod with Brad Gerstner and Bill Gurley

AI evolves from answering questions to becoming a proactive "super assistant" that unlocks latent human potential by handling complex, long-horizon tasks.

View Episode Notes →

Nvidia-Intel Alliance Reshapes AI Chip Landscape

Jan 06, 2026 AI + a16z

Nvidia invests $5 billion in Intel, forging an unexpected alliance to reshape AI chip manufacturing and challenge competitors like AMD and ARM.

View Episode Notes →

NVIDIA's Extreme Co-Design Fuels AI Infrastructure Revolution

Sep 26, 2025 BG2Pod with Brad Gerstner and Bill Gurley

AI's shift to accelerated computing is an infrastructure revolution, demanding trillions in refreshes and augmenting human intelligence to drive unprecedented GDP growth and national security imperatives.

View Episode Notes →

Unlock Idle GPUs: The Tetris Game of AI Resource Allocation

Nov 25, 2025 The Stack Overflow Podcast

GPU scarcity stems from underutilization, not capacity limits; efficient allocation requires Tetris-like scheduling, not simple distribution, optimizing omnicloud resources for AI's CapEx economics.

View Episode Notes →

Resources

Resources & Recommendations

Books

"transformer" - Mentioned as a foundational architecture that quickly emerged and became relevant for translation models.

Tools & Software

DGX - Generation of GPUs from Nvidia, mentioned in the context of substantial costs for training large-scale models.

People Mentioned

Jarek Kutylowski (CEO of DeepL) - The guest on the podcast, discussing DeepL's translation powerhouse built with AI.

Organizations & Institutions

DeepL - A company specializing in AI-powered translation, particularly for businesses.
OpenAI - Mentioned as a competitor in the AI space, against whom DeepL aims to stay ahead.
Meta - Mentioned in the context of publishing large language models like Llama.
Nvidia - Manufacturer of DGX GPUs, crucial for DeepL's model training.

Websites & Online Resources

Llama (Meta) - A large language model published by Meta, discussed in relation to DeepL's model training strategies.

Other Resources

Neural Machine Translation - The technology that DeepL adopted in 2017, marking a significant shift in translation capabilities.
Transformer architecture - A key development in neural machine translation, mentioned for its rapid emergence.
Parallel corpora - Bilingual text data used for training translation models, discussed in the context of data acquisition.
Monolingual corpora - Single-language text data, also important for training translation models, especially for languages with less bilingual data.
Speech translation - A newer market for DeepL, introduced last year, offering real-time translation for spoken language.