Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/Audio AI/Voice AI/TTS/
Dia
Search

Dia

Creator
Creator
Seonglae Cho
Created
Created
2025 Apr 26 16:6
Editor
Editor
Seonglae Cho
Edited
Edited
2025 Jun 15 22:53
Refs
Refs
Parkeet
SoundStorm
dia
nari-labs • Updated 2025 Jun 15 22:19
, No training script, just TTS model
(laughs), (clears throat), (sighs), (gasps), (coughs), (singing), (sings), (mumbles), (beep), (groans), (sniffs), (claps), (screams), (inhales), (exhales), (applause), (burps), (humming), (sneezes), (chuckle), (whistles)
  • RoPE
  • RMS Normalization
  • Grouped-query Attention
  • Byte Level Tokenizer
  • CFG Scale
  • Delay Pattern
 
 
 
 
 
Nari Labs: Dia Examples | Notion
Comparison between Dia-1.6B (ours), ElevenLabs Studio, and Sesame CSM-1B. Plus fun examples (including audio prompt use).
Nari Labs: Dia Examples | Notion
https://yummy-fir-7a4.notion.site/dia
Nari Labs: Dia Examples | Notion
nari-labs/Dia-1.6B · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
nari-labs/Dia-1.6B · Hugging Face
https://huggingface.co/nari-labs/Dia-1.6B
nari-labs/Dia-1.6B · Hugging Face
Documentation
documentation.md
devnen

Nari Labs

Nari Labs : Free And Open-Source TTS AI Voice Dialogue
Discover Nari Labs, a open-source TTS AI for ultra-realistic dialogue and voice cloning. Build immersive audio experiences with real-time streaming.
Nari Labs : Free And Open-Source TTS AI Voice Dialogue
https://narilabs.org/
Nari Labs : Free And Open-Source TTS AI Voice Dialogue
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/Audio AI/Voice AI/TTS/
Dia
Copyright Seonglae Cho