M

Mms Tts Nan

Developed by facebook
Southern Min text-to-speech model released by Meta, based on VITS architecture, supporting high-quality speech synthesis
Downloads 861
Release Time : 9/1/2023

Model Overview

This model is part of Meta's Massively Multilingual Speech project, specifically designed for Southern Min (nan) text-to-speech synthesis, utilizing the end-to-end VITS architecture to achieve high-quality speech generation.

Model Features

End-to-end speech synthesis
Uses VITS architecture to directly generate speech waveforms from text without intermediate feature extraction
Multilingual support
As part of the MMS project, supports multiple languages including Southern Min
High-quality speech generation
Combines conditional variational autoencoder and adversarial training to produce natural and fluent speech
Random duration prediction
Supports generating speech with varying rhythms for the same text, enhancing expressiveness

Model Capabilities

Southern Min text-to-speech
High-quality speech synthesis
Variable rhythm speech generation

Use Cases

Voice applications
Southern Min voice assistant
Develop voice interaction applications for Southern Min speakers
Generates natural and fluent Southern Min speech responses
Audiobook production
Convert Southern Min text into speech for audio content creation
Efficiently generates high-quality Southern Min narration audio
Language preservation
Southern Min digital preservation
Convert Southern Min text into speech for cultural preservation
Helps preserve and disseminate Southern Min cultural heritage
Featured Recommended AI Models
ยฉ 2025AIbase