SeamlessM4T

Multilingual

Speech-to-speech translation (S2ST)

Speech-to-text translation (S2TT)

Text-to-speech translation (T2ST)

Text-to-text translation (T2TT)

Automatic speech recognition (ASR)


import torch
from seamless_communication.models.inference import Translator

# Initialize a Translator object with a multitask model, vocoder on the GPU.
translator = Translator("seamlessM4T_large", vocoder_name_or_card="vocoder_36langs", device=torch.device("cuda:0"))

facebook/seamless-m4t-large · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

https://huggingface.co/facebook/seamless-m4t-large

facebook/seamless-m4t-large · Hugging Face

Introducing SeamlessM4T, a Multimodal AI Model for Speech and Text Translations | Meta

SeamlessM4T allows people to communicate effortlessly through speech and text across different languages.

https://about.fb.com/news/2023/08/seamlessm4t-ai-translation-model/

SeamlessM4T

Multilingual

Recommendations