M

M Ctc T Large

Developed by speechbrain
A large-scale multilingual speech recognition model introduced by Meta AI, supporting 60 languages, based on a 1-billion-parameter Transformer encoder architecture.
Downloads 88
Release Time : 5/27/2022

Model Overview

M-CTC-T is a multilingual speech recognition model capable of converting speech to text, supporting multiple languages while preserving punctuation and capitalization.

Model Features

Multilingual Support
Supports speech recognition for 60 languages, covering a wide range of linguistic needs.
Large-scale Training Data
Trained on the Common Voice and VoxPopuli corpora, featuring extensive and diverse datasets.
Character-level Transcription
Uses unnormalized character-level transcription text, preserving punctuation and capitalization.

Model Capabilities

Speech Recognition
Multilingual Transcription
Character-level Text Generation

Use Cases

Speech Transcription
Multilingual Speech-to-Text
Converts speech in multiple languages to text, suitable for international application scenarios.
Character Error Rate (CER) of 21.4-23.3 on the Common Voice test set
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase