A protocol-level feature built on top of Vllm Streaming Requests' low latency, inspired by OpenAI's Realtime API as a WebSocket-based interface. WebSocket communication protocol between client and server for bidirectional real-time streaming of audio/text
Vllm Realtime API
Creator
Creator
Seonglae ChoCreated
Created
2026 Mar 24 14:20Editor
Editor
Seonglae ChoEdited
Edited
2026 Mar 24 14:25Refs
Refs
