Building Gemini's Coding Capabilities – Google AI: Release Notes – Podcast

Afleveringen

Yossi Matias on the golden age of research
22 jun· Google AI: Release Notes
Yossi Matias, Vice President at Google and Head of Google Research, joins host Logan Kilpatrick for a wide-ranging conversation about what it means to do research at Google scale. Their conversation explores the "magic cycle” of research from breakthrough to real-world impact across flood forecasting, MedGemma, and the just-launched Gemini for Science. Learn more about AI tools for scientific discovery including Co-Scientist and Empirical Research Assistance (ERA), and topics like speculative decoding, generative UI, and quantum computing milestones including the Willow chip.

Watch on YouTube: https://www.youtube.com/watch?v=FPBwadTeph0&t=1s
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
Gemini co-leads on project origins and what's next
22 jun· Google AI: Release Notes
To mark the launch of Gemini 3.5 Flash, Logan Kilpatrick sat down at Gradient Canopy with four of the people who built it: Jeff Dean, Koray Kavukcuoglu, Noam Shazeer, and Oriol Vinyals. They talked about the origin of the Gemini project, the bet on a single unified model, why each Flash generation now outperforms the previous Pro, and what's coming next.

Watch on YouTube: https://www.youtube.com/watch?v=8hfpLa5wPGo
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
Zijn er afleveringen die ontbreken?

Klik hier om de feed te vernieuwen.
Google I/O 2026 Recap with Logan Kilpatrick, Josh Woodward and Tulsee Doshi
22 jun· Google AI: Release Notes
Josh Woodward from Google Labs, Gemini and AI Studio and Tulsee Doshi from Google DeepMind join host Logan Kilpatrick to break down the biggest launches from Google I/O 2026, from Gemini 3.5 Flash and Gemini Omni to the debut of Gemini Spark. Their conversation digs into everything from agent payments and voice to predictions for what comes next.
Watch on YouTube: https://www.youtube.com/watch?v=RsDSeMXaCak
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
Google Maps Leaders Talk About Its Biggest Update in 10 Years
12 mrt· Google AI: Release Notes
Miriam Daniel and David Cronin join host Logan Kilpatrick to unveil the biggest update in Google Maps history. This episode explores the launch of Ask Maps, a conversational experience powered by Gemini 3.0 Pro, and Immersive Navigation, a high-fidelity 3D driving experience. Learn how Gemini is making navigation more intuitive, personalized, and stress-free.
Watch on YouTube: https://www.youtube.com/watch?v=YzZFdHzx-Y4
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
Gemini in Workspace: New Ways to Create Faster
12 mrt· Google AI: Release Notes
Chapters:

1:15 - Gemini in Docs
3:17 - Which models power Workspace
3:45 - AI Overviews in Drive
5:22 - Rollout and availability
6:33 - Reimagining every Workspace canvas
8:58 - Gemini in Calendar
9:50 - The future of the side panel
11:16 - A new way to work
13:18 - AI-powered slide generation
15:08 - User data and privacy
19:07 - Balancing AI innovation and user trust
22:04 - The power of Google Vids
24:42 - The journey to deep canvas integration
28:32 - Vibe coding meets Apps Script
32:42 - AI as a tool for thinking
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
Gemini in Chrome: Your agentic browsing assistant
12 mrt· Google AI: Release Notes
Chapters:

0:00 - Introduction
2:49 - Evolution from web apps to integrated assistants
4:37 - Chrome as a platform for personal context
6:38 - Navigating the context overload problem
7:52 - Transforming media in-context with Nano Banana
9:10 - Solving tab overload with history recall
13:28 - The browser as an automated workflow system
15:50 - Demo: Nano Banana
17:20 - Demo: Auto browse
22:48 - Demo: Agentic research and guardrails
26:04 - Designing for billions
29:14 - Transitioning to agentic web actuation
30:37 - Standards and security in an AI-driven web
35:18 - Infrastructure and investment strategy
39:23 - Empowering knowledge workers
42:11 - Collaboration within Google
44:18 - Safety and the user alignment critic
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
Inside Lyria 3, Google's music generation model
18 feb· Google AI: Release Notes
1:00 - Defining music generation models
1:40 - Lyria as a new instrument
3:05 - Connecting language and creative intent
5:08 - Guest backgrounds and musical journeys
7:57 - Demo: Instrumental funk jam
8:29 - Bridging the gap for non-musicians
12:03 - Demo: Exploring lyrics and vocals
15:07 - The magic of iterative co-creation
15:40 - Meeting users across the expertise spectrum
17:01 - Empowering new musical expressions
18:29 - Emotional and communal impact of music
19:51 - Opportunities for developers and community
21:09 - Real-time vs. song generation models
23:23 - Creating experimental sonic landscapes
25:08 - Demo: Capturing unexpectedness and energy
28:33 - Evaluating music through taste and expertise
31:30 - The diligence of music evaluation
31:52 - The future of Lyria and AI-first workflows
35:07 - Articulating creative vision through languag
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
Project Genie: Create and explore worlds
30 jan· Google AI: Release Notes
Chapters:
00:00 - Intro and defining world models and RL roots
01:51 - Demo: Goldfish and shark in underwater world
04:59 - Project Genie gallery
06:31 - Physics, remixing, and UI prompts
11:00 - Demo: Nano Banana mascot “Bob”
13:20 - Constraints, generation limits, and infrastructure
17:04 - Trusted testers and robotics future
28:34 - Frontier prompting and universal simulation
29:27 - Cross-Google collaboration
31:16 - Adoption timelines and impact
34:16 - Model generalization and historical context
38:52 - Hardware limits and the slope of progress
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
Gemini 3 and Gen UI in Google Search
18 dec 2025· Google AI: Release Notes
Rhiannon Bell and Robby Stein, Product and Design leads for Google Search, join host Logan Kilpatrick for a deep dive into the integration of Gemini 3 into Search. Their conversation explores the evolution of Generative UI, where models act as designers to create bespoke, interactive simulations on the fly. Learn more about the role of Gemini 3 Flash in delivering speed at scale, the development of Search's new "persona," and how models like Nano Banana are powering next-generation data visualization.
Watch on YouTube: https://www.youtube.com/watch?v=AqyclkRBSe4
Chapters:
0:00 - Introduction
1:24 - What is Generative UI?
2:23 - From static to generative design
6:37 - Interactive simulations
8:47 - Latency and visual QA
10:48 - Gemini 3 Flash in Search
12:08 - Fusing AI Mode and AI Overviews
14:24 - The Search persona
17:12 - Agentic system understanding
18:22 - Visualizing data with Nano Banana
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
Sundar Pichai: Gemini 3, Vibe Coding and Google's Full Stack Strategy
26 nov 2025· Google AI: Release Notes
Logan Kilpatrick from Google DeepMind sits down with Sundar Pichai, CEO of Google and Alphabet to discuss the launch of Gemini 3, Nano Banana Pro and Google's overall AI momentum. They talk about Google’s long-term bets on infrastructure, what it’s actually like to ship SOTA models, and the rise of vibe coding. Sundar also shares his personal launch day rituals and thoughts on future moonshots like putting data centers in space.
Watch on YouTube: https://www.youtube.com/watch?v=iFqDyWFuw1c

Chapters:
0:00 - Intro
0:51 - Shipping Gemini 3
2:44 - Google's decade-long investment in AI
4:27 - The full stack advantage
5:43 - Scaling up compute and capacity
7:32 - Sim-shipping Gemini across products
9:35 - Nano Banana Pro
12:13 - Monitoring launch day
14:13 - Future model roadmap
16:05 - Launch day rituals
18:02 - The Blue Micro Kitchen
21:57 - Future moonshots
23:26 - The rise of vibe coding
26:50 - What’s next
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
Nano Banana Pro: Hands-on with the World’s Most Powerful Image Model
26 nov 2025· Google AI: Release Notes
Introducing Nano Banana Pro, a powerful model built on Gemini 3 Pro, designed to enhance text rendering, infographics, and structured content generation. Tune in to learn about Nano Banana Pro’s advanced visual reasoning and multi-turn generation capabilities, and how this next-gen tool enables complex image edits and real-world applications. In this episode, we discuss how user feedback and continuous benchmarking drive model improvements, ensuring a superior experience for developers.
Watch on YouTube: https://www.youtube.com/watch?v=hk6gwiZmSWA

Chapters:
00:00 - Introducing Nano Banana Pro
02:00 - Enhanced world understanding
04:59 - Advanced text rendering
05:49 - Gemini 3 Pro's influence
09:30 - Multi-turn & infographics
14:04 - Text rendering comparison
16:26 - Multilingual text support
18:22 - Infographics for learning
24:00 - Multi-image input
26:38 - Resolution & fidelity
30:07 - Advanced editing & style
32:09 - Practical use cases
35:26 - Future outlook & thanks
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
Koray Kavukcuoglu: “This Is How We Are Going to Build AGI”
25 nov 2025· Google AI: Release Notes
Join Logan Kilpatrick and Koray Kavukcuoglu, CTO of Google DeepMind and Chief AI Architect of Google, as they discuss Gemini 3 and the state of AI!
Their conversation includes the reception of Gemini 3, the ongoing advancements in AI research, and the role of benchmarks in pushing new frontiers. They explore critical areas for Gemini's focus, emphasizing instruction following, tool calls, and internationalization, alongside Google's collaborative approach to AI development.
Watch on YouTube: https://www.youtube.com/watch?v=fXtna7UrL44
Chapters:
0:00 - Intro
2:00 - Gemini 3 launch reception
4:16 - Continuous progress and innovation
6:47 - Key areas for Gemini improvement
11:45 - Product scaffolding for model improvement
13:56 - Chief AI architect role
17:04 - Engineering mindset and collaboration
18:37 - Future growth areas for Gemini
20:33 - From research to engineering mindset
23:22 - The rise of generative media
27:22 - Nano Banana Pro capabilities
29:31 - Towards unified model checkpoints
36:26 - Organizing for AI success
38:26 - Balancing exploration and scaling
41:40 - DeepMind's collaborative culture
45:21 - Innovating at Google
48:37 - Closing
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
Google Antigravity: Hands on with our new agentic development platform
25 nov 2025· Google AI: Release Notes
Explore Antigravity, Google DeepMind’s innovative new AI developer coding product, with Varun Mohan on Release Notes. This episode dives into Antigravity as a powerful agent development platform, integrating a familiar IDE experience with browser verification and Gemini 3.0 capabilities. Discover how developers can orchestrate complex agentic workflows, leverage artifacts for task communication, and balance AI automation with human collaboration. Learn about the philosophy behind building next-gen agentic experiences, the platform's multimodal strengths, and its role in accelerating software development at scale.
Watch on YouTube: https://www.youtube.com/watch?v=uzFOhkORVfk
Chapters
00:00 - Introducing Google Antigravity
04:02 - Evolution of AI in coding
04:53 - Beyond writing code
06:21 - Ideal Google Antigravity user
09:48 - Evolving user personas
11:46 - Agents versus the IDE
14:46 - Human-agent collaboration
16:43 - Local versus server-side
18:50 - Self-improvement and knowledge
21:29 - Generalizing agent capabilities
24:20 - Naming Google Antigravity
27:04 - Integrating Google's AI models
27:59 - Demo: Airbnb for dogs
28:48 - Understanding artifacts
29:51 - Asynchronous user feedback
32:16 - Agent manager workflow
33:17 - Browser actuation demo
34:36 - Browser for research and testing
36:45 - Parallel agent conversations
41:04 - Agent task best practices
42:51 - Future of Google Antigravity
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
Gemini 3: Launch day reactions
25 nov 2025· Google AI: Release Notes
Join us for a special episode of Release Notes as we unpack Gemini 3, Google’s latest AI model with key team members. Learn how Gemini 3 empowers developers with enhanced multimodal understanding, agentic capabilities for complex tasks, and generative interfaces that transform prompts into interactive applications. We discuss real-world use cases, the iterative development process driven by user feedback, and the strategic balance between model performance and broad accessibility across various Google platforms.
Watch on YouTube: https://www.youtube.com/watch?v=mci0f2dy7G0
Chapters:
00:00 - Introducing Gemini 3
03:08 - Gemini 3 everywhere
04:13 - The product-model partnership
08:20 - Balancing speed and quality
11:40 - Gemini 3 'wow' moments
27:47 - Generative interfaces and UI
31:44 - Gemini's agentic capabilities
33:55 - Proactive AI and future
34:55 - Managing compute demand
39:32 - The Gemini 3 family
41:45 - Conclusion
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
How a Moonshot Led to Google DeepMind's Veo 3
16 okt 2025· Google AI: Release Notes
Dumi Erhan, co-lead of the Veo project at Google DeepMind, joins host Logan Kilpatrick for a deep dive into the evolution of generative video models. They discuss the journey from early research in 2018 to the launch of state-of-the-art Veo 3 model with native audio generation. Learn about the technical hurdles in evaluating and scaling video models, the challenges of long-duration video coherence and how user feedback is shaping the future of AI-powered video creation.

Chapter:
0:00 - Intro
0:47 - Veo project's beginnings
3:02 - Veo's origins in Google Brain
5:07 - Video prediction and robotics applications
7:45 - Early progress and evaluation challenges
10:30 - Physics-based evaluations and their limitations
12:18 - The launch of the original Veo model
14:06 - Scaling challenges for video models
16:02 - The leap from Veo1 to Veo2
19:40 - Veo 3’s viral audio moment
21:17 - User trends shaping Veo's roadmap
23:49 - Image-to-video vs. text-to-video complexity
26:00 - New prompting methods and user control
27:55 - Coherence in long video generation
31:03 - Genie 3 and world models
35:54 - The steerability challenge
41:59 - Capability transfer and image data's role
47:25 - Closing
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
GDM’s Pushmeet Kohli on solving science's biggest challenges with AI
15 sep 2025· Google AI: Release Notes
Pushmeet Kohli, Head of Science and Strategic Initiatives at Google DeepMind, joins host Logan Kilpatrick to explore the intersection of AI and scientific discovery. Learn how the team's unique problem-solving framework led to innovations like AlphaFold and AlphaEvolve, and how new tools like AI Co-scientist aim to democratize these types of breakthroughs for everyone.
Watch on YouTube: https://www.youtube.com/watch?v=o7mdsL6BHsk
Chapters:
0:00 - Intro
1:04 - Recent Alpha launches
02:15 - Framework for selecting research domains
06:21 - Scientific, commercial and social impact
15:00 - Wielding AGI for breakthroughs
16:48 - Tech transfer and team collaboration
19:46 - IMO Gold Medal
21:42 - Evaluating math proofs
22:55 - From specialized models to Deep Think
24:22 - Do math skills generalize?
25:53 - Generalizing the IMO model
27:43 - Democratizing AI science tools
30:09 - AI Co-scientist
35:17 - An API for science?
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
Behind the scenes of Google's state-of-the-art "nano-banana" image model
27 aug 2025· Google AI: Release Notes
Join host Logan Kilpatrick in discussion with some of the minds behind Google's new state-of-the-art image model, Gemini 2.5 Flash. Product and research leads from the Gemini team break down the technology behind its key capabilities, including interleaved generation for complex edits and new approaches to achieving character consistency and pixel-perfect control. With Nicole Brichtova, Kaushik Shivakumar, Mostafa Dehghani and Robert Riachi.

Watch on YouTube:

Chapters:
0:37 - New model introduction
1:21 -Demo - Image Editing
3:44 - Text rendering capabilities
4:44 Beyond human preference evals
6:44 - Text rendering as a proxy for quality
8:38 - Positive transfer between modalities
11:25 - Demo - Multi-turn, context aware image generation
13:54 - Pixel-perfect editing and character consistency
15:51 - Interleaved image generation
17:59 - Specialized vs. native models
19:52 - Understanding nuanced prompts
20:59 - User feedback shaping model development
22:37 - Improvements in character consistency
24:17 - More natural looking images from team collaboration
26:41 - What’s next for image generation models
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
Demis Hassabis on shipping momentum, better evals and world models
11 aug 2025· Google AI: Release Notes
Demis Hassabis, CEO of Google DeepMind, sits down with host Logan Kilpatrick. In this episode, learn about the evolution from game-playing AI to today's thinking models, how projects like Genie 3 are building world models to help AI understand reality and why new testing grounds like Kaggle’s Game Arena are needed to evaluate progress on the path to AGI.
Watch on YouTube: https://www.youtube.com/watch?v=njDochQ2zHs

Chapters:
00:00 - Intro
01:16 - Recent GDM momentum
02:07 - Deep Think and agent systems
04:11 - Jagged intelligence
07:02 - Genie 3 and world models
10:21 - Future applications of Genie 3
13:01 - The need for better benchmarks and Kaggle Game Arena
19:03 - Evals beyond games
21:47 - Tool use for expanding AI capabilities
24:52 - Shift from models to systems
27:38 - Roadmap for Genie 3 and the omni model
29:25 - The quadrillion token club
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
Building real-time voice applications with Live API
6 aug 2025· Google AI: Release Notes
Shrestha Basu Mallick, one of the product leads for the Gemini API, joins host Logan Kilpatrick for a deep dive of Gemini Live API, Google’s real-time, multimodal interface for developers. Learn about how native audio alongside new capabilities like proactive audio and async function calling unlocks the unique power of audio as an interface.
Watch on YouTube: https://www.youtube.com/watch?v=4xlwlU6h-wM

0:00 - Intro
1:18 - Live API Overview
3:36 - Why audio is a special modality
5:07 - Speed vs. precision in audio
6:17 - Controllable and promptable TTS
8:31 - What developers are building with the Live API
11:14 - URL context and async calling features
15:02 - Proactive audio and affective dialog
16:55 - Addressing developer feedback
21:54 - Live API roadmap
23:49 - The role of long context
24:57 - What’s next for the Live API
26:41 - State of the AI audio market
30:10 - Advice for developers getting started with the Live API
31:16 - Live API demo
38:10 - Demo wrap up and closing
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
Building a frontier AI search experience
23 jul 2025· Google AI: Release Notes
Robby Stein, VP of Product for Google Search, joins host Logan Kilpatrick to explore how Search is evolving into a frontier AI product. Their conversation covers the shift from simple keywords to complex, conversational queries, the rise of agentic capabilities that can take action on your behalf, and the vision to help billions of users truly "ask anything." Learn more about the technology behind AI Overviews, AI Mode, Deep Search, and the future of multimodal interaction.

Watch on YouTube: https://youtu.be/zUB5A_ezIOU
Chapters
01:07 Search as a Frontier AI Product
02:38 Reaching 1.5 Billion Users
03:37 What Is AI Mode?
04:17 Understanding Query Fan-Out
05:18 Balancing Latency and performance with Gemini 2.5 Pro
06:51 How Deep Search works
09:08 Fine-tuning models for product experience
11:24 Shifting user behaviors
14:07 The rise of visual search
16:52 Speech and conversational AI in Search
18:36 Comparing Gemini and Search
20:04 Real-time tool use in Search
22:52 Evolving the Search interface
26:03 Making Search more personal
29:15 The agentic future of Search
31:15 Agents beyond booking tickets
37:11 On-the-fly software creation
38:06 Google DeepMind and Search collaboration
40:08 What's next for Search
- Luisteren Nogmaals beluisteren Doorgaan Wordt afgespeeld...
- Later beluisteren Later beluisteren
Laat meer zien

Afleveringen

Yossi Matias on the golden age of research

Gemini co-leads on project origins and what's next

Google I/O 2026 Recap with Logan Kilpatrick, Josh Woodward and Tulsee Doshi

Google Maps Leaders Talk About Its Biggest Update in 10 Years

Gemini in Workspace: New Ways to Create Faster

Gemini in Chrome: Your agentic browsing assistant

Inside Lyria 3, Google's music generation model

Project Genie: Create and explore worlds

Gemini 3 and Gen UI in Google Search

Sundar Pichai: Gemini 3, Vibe Coding and Google's Full Stack Strategy

Nano Banana Pro: Hands-on with the World’s Most Powerful Image Model

Koray Kavukcuoglu: “This Is How We Are Going to Build AGI”

Google Antigravity: Hands on with our new agentic development platform

Gemini 3: Launch day reactions

How a Moonshot Led to Google DeepMind's Veo 3

GDM’s Pushmeet Kohli on solving science's biggest challenges with AI

Behind the scenes of Google's state-of-the-art "nano-banana" image model

Demis Hassabis on shipping momentum, better evals and world models

Building real-time voice applications with Live API

Building a frontier AI search experience