M

Mms Tts Pcm

Developed by facebook
A Nigerian Pidgin text-to-speech model developed by Meta, based on the VITS architecture, supporting high-quality speech synthesis
Downloads 47
Release Time : 9/1/2023

Model Overview

This model is part of Meta's Massively Multilingual Speech (MMS) project, specifically designed to convert Nigerian Pidgin text into natural speech. It employs the VITS end-to-end architecture, combining variational inference and adversarial training techniques.

Model Features

End-to-End Speech Synthesis
Based on the VITS architecture, it directly generates high-quality speech waveforms from text without intermediate feature extraction
Multilingual Support
As part of the MMS project, it supports multiple languages including Nigerian Pidgin
Stochastic Duration Prediction
Achieves varied rhythm speech synthesis for the same text through stochastic duration prediction
High-Quality Vocoder
Uses a vocoder structure similar to HiFi-GAN to generate natural and fluent speech

Model Capabilities

Text-to-Speech
Multilingual Speech Synthesis
Speech Waveform Generation

Use Cases

Speech Technology Applications
Voice Assistants
Provides localized voice interaction experiences for Nigerian Pidgin users
Generates natural and fluent voice responses
Audiobooks
Converts Nigerian Pidgin text content into speech
Supports speech output with varied rhythms and intonations
Language Learning
Helps learners acquire standard pronunciation of Nigerian Pidgin
Provides accurate speech demonstrations
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase