Conversational AI

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2024 Nov 5 21:10
Editor
Edited
Edited
2026 Mar 24 14:6

Conversational Speech Model

Conversational AIs
 
 
Conversational AI Notion
 
 
https://www.pnas.org/doi/10.1073/pnas.0903616106
 

Hertz-dev

Introducing hertz-dev - Standard Intelligence
For the last few months, we at Standard Intelligence have been researching scalable cross-modality learning. We're excited to announce that we're open-sourcing current checkpoints of our full-duplex, audio-only base model, hertz-dev, with a total of 8.5 billion parameters and three primary parts:

Full-Duplex-Bench

Paper page - Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities
Join the discussion on this paper page
Paper page - Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities
schema
A Full-duplex Speech Dialogue Scheme Based On Large Language Models
We present a generative dialogue system capable of operating in a full-duplex manner, allowing for seamless interaction. It is based on a large language model (LLM) carefully aligned to be aware...
A Full-duplex Speech Dialogue Scheme Based On Large Language Models

Beyond one-on-one

Beyond one-on-one: Authoring, simulating, and testing dynamic human-AI group conversations
Erzhen Hu, Student Researcher, and Ruofei Du, Interactive Perception & Graphics Lead, Google XR
Beyond one-on-one: Authoring, simulating, and testing dynamic human-AI group conversations

Full Duplex Model

Streaming Requests & Realtime API in vLLM
Large language model inference has traditionally operated on a simple premise: the user submits a complete prompt (request), the model processes it, and returns
Streaming Requests & Realtime API in vLLM
STT를 넘고, Realtime STT, 그리고 곧 다가올 Full Duplex 모델 시대로
제 MBTI가 N이라서 그런지 개인적으로 미래 예측을 좋아하는데요, 다만 제 미래 예측을 믿고 판단 및 행동하는 편은 아닙니다.
STT를 넘고, Realtime STT, 그리고 곧 다가올 Full Duplex 모델 시대로
 
 

Backlinks

LLM

Recommendations