Conversational Speech Model
Conversational AIs
Conversational AI Notion
Hertz-dev
Introducing hertz-dev - Standard Intelligence
For the last few months, we at Standard Intelligence have been researching scalable cross-modality learning. We're excited to announce that we're open-sourcing current checkpoints of our full-duplex, audio-only base model, hertz-dev, with a total of 8.5 billion parameters and three primary parts:
https://si.inc/hertz-dev/
Full-Duplex-Bench
Paper page - Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities
Join the discussion on this paper page
https://huggingface.co/papers/2503.04721
schema
A Full-duplex Speech Dialogue Scheme Based On Large Language Models
We present a generative dialogue system capable of operating in a full-duplex manner, allowing for seamless interaction. It is based on a large language model (LLM) carefully aligned to be aware...
https://arxiv.org/abs/2405.19487

Beyond one-on-one
Beyond one-on-one: Authoring, simulating, and testing dynamic human-AI group conversations
Erzhen Hu, Student Researcher, and Ruofei Du, Interactive Perception & Graphics Lead, Google XR
https://research.google/blog/beyond-one-on-one-authoring-simulating-and-testing-dynamic-human-ai-group-conversations/


Seonglae Cho