S

Seamless M4t V2 Large

Developed by facebook
SeamlessM4T v2 is a large-scale multilingual multimodal machine translation model released by Facebook, supporting speech and text translation for nearly 100 languages.
Downloads 64.59k
Release Time : 11/29/2023

Model Overview

SeamlessM4T is a comprehensive large-scale multilingual multimodal machine translation model that provides high-quality translation services for speech and text. It supports multiple tasks including speech-to-speech, speech-to-text, text-to-speech, text-to-text translation, and automatic speech recognition.

Model Features

Multilingual support
Supports speech input in 101 languages, text input/output in 96 languages, and speech output in 35 languages.
Multimodal translation
Supports multiple tasks including speech-to-speech, speech-to-text, text-to-speech, text-to-text translation, and automatic speech recognition.
High-quality translation
Adopts the new UnitY2 architecture, outperforming the previous version in both quality and inference speed for speech generation tasks.
Fast inference
Significantly improves inference speed through hierarchical character-to-unit upsampling and non-autoregressive text-to-unit decoding.

Model Capabilities

Speech-to-speech translation
Speech-to-text translation
Text-to-speech translation
Text-to-text translation
Automatic speech recognition

Use Cases

Translation services
Multilingual conference translation
Real-time translation of conference speech into text or speech output in multiple languages.
High-quality multilingual translation, improving conference efficiency and communication effectiveness.
Speech content transcription
Automatically transcribes speech content into text, supporting multiple languages.
Accurate speech recognition and transcription, suitable for scenarios like subtitle generation and meeting minutes.
Education
Language learning assistance
Helps learners with language learning through mutual translation between speech and text.
Provides high-quality multilingual translation to assist language learning.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase