M

Mctct Large

Developed by cwkeam
A large-scale multilingual speech recognition model introduced by Meta AI, featuring 1 billion parameters and supporting character-level transcription for 60 languages
Downloads 21
Release Time : 5/5/2022

Model Overview

M-CTC-T is a large-scale multilingual speech recognition model based on a Transformer encoder, equipped with a CTC head and a language identification head. It can process speech input in 60 languages and output character-level transcribed text (preserving punctuation and capitalization).

Model Features

Multilingual Support
Supports speech recognition in 60 languages with language identification capability
Large-scale Training
Based on a Transformer architecture with 1 billion parameters, trained on data from Common Voice and VoxPopuli
Character-level Transcription
Output preserves the original text's punctuation and capitalization format
End-to-End Model
Directly recognizes from 16kHz audio signals using Mel filterbank features

Model Capabilities

Multilingual Speech Recognition
Language Identification
Character-level Text Transcription

Use Cases

Speech-to-Text
Automatic Meeting Transcription
Automatically converts multilingual meeting recordings into text transcripts
Voice Assistants
Supports multilingual voice command recognition
Speech Analysis
Multilingual Content Analysis
Analyzes speech content in different languages
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase