Gemini Live

Creator

Creator

Seonglae Cho

Created

Created

2024 Dec 22 14:11

Editor

Editor

Seonglae Cho

Edited

Edited

2025 Oct 7 23:19

Refs

Refs

OpenAI Realtime API

live-api-web-console

google-gemini • Updated 2025 Oct 7 12:6

not yet for webrtc

Multimodal Live API | Gemini API | Google AI for Developers

The Multimodal Live API enables low-latency, two-way interactions that use text, audio, and video input, with audio and text output. This facilitates natural, human-like voice conversations with the ability to interrupt the model at any time. The model's video understanding capability expands communication modalities, enabling you to share camera input or screencasts and ask questions about them.

Multimodal Live API | Gemini API | Google AI for Developers

https://ai.google.dev/api/multimodal-live

Multimodal Live API | Gemini API | Google AI for Developers

Gemini Live API

Google AI Studio on Twitter / X

https://t.co/MZz9dI3ws6— Google AI Studio (@GoogleAIStudio) September 23, 2025

https://x.com/GoogleAIStudio/status/1970545734736023564

Backlinks

Recommendations

////////