Multilingual
- Speech-to-speech translation (S2ST)
- Speech-to-text translation (S2TT)
- Text-to-speech translation (T2ST)
- Text-to-text translation (T2TT)
- Automatic speech recognition (ASR)
import torch from seamless_communication.models.inference import Translator # Initialize a Translator object with a multitask model, vocoder on the GPU. translator = Translator("seamlessM4T_large", vocoder_name_or_card="vocoder_36langs", device=torch.device("cuda:0"))
facebook/seamless-m4t-large · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/facebook/seamless-m4t-large
Introducing SeamlessM4T, a Multimodal AI Model for Speech and Text Translations | Meta
SeamlessM4T allows people to communicate effortlessly through speech and text across different languages.
https://about.fb.com/news/2023/08/seamlessm4t-ai-translation-model/


Seonglae Cho