Gemini 3.5 Live Translate brings real-time speech translation to APIs, Meet, and Translate

Google's June 10, 2026 launch brings low-latency speech translation to the Gemini Live API, Google AI Studio, Google Meet, and Google Translate.

Google announced Gemini 3.5 Live Translate on June 10, 2026, bringing near real-time, natural speech translation across several surfaces. Developers can access it in public preview through the Gemini Live API and Google AI Studio, enterprises can test it in a Google Meet private preview starting this month, and consumers will see it roll out through Google Translate on Android and iOS.

The key change is that speech translation is moving from turn-by-turn processing toward streaming interaction. Gemini 3.5 Live Translate processes speech as it arrives, reducing the pause between listening and response. It also handles multilingual input without manual language configuration. Google highlights noise robustness as well, which matters for real meetings, classrooms, broadcasts, and customer-support environments.

Inside Google Meet, the capability expands speech translation to more than 70 languages and over 2,000 language combinations. Just as important, translation is no longer centered only on English as the bridge language. For distributed teams, that can make multilingual meetings feel closer to natural conversation instead of forcing every discussion through an English relay.

For developers, the Gemini Live API is the other major entry point. Real-time speech translation can be embedded beyond meetings into education, support, medical intake, cross-border sales, live events, and community products. Once a model can handle continuous audio, language detection, and natural voice output, translation becomes part of the interaction layer rather than a separate text feature.

Enterprises still need to design the boundaries carefully. Real-time translation runs into domain terminology, accents, background noise, privacy, recording policies, and accountability. For customer communication or internal decisions, teams should keep human confirmation, key summaries, and original-language records rather than treating translated output as the only source of truth.

Overall, Gemini 3.5 Live Translate shows multimodal AI moving from content generation toward communication infrastructure. When the same model capability reaches APIs, meetings, and consumer translation tools, language barriers shift from manual process problems into real-time services that can be embedded in products and workflows.

MODULE.002 //

More insights

Ideas on websites, AI automation, digital marketing, AI news, and VMTS updates.