M

Matxa Tts Cat Multispeaker

Developed by projecte-aina
A Catalan multi-speaker text-to-speech model based on Matcha-TTS architecture, trained with optimal transport conditional flow matching for fast and high-quality speech synthesis
Downloads 21
Release Time : 3/28/2024

Model Overview

Matxa-TTS is a non-autoregressive text-to-speech model specifically designed for Catalan, supporting multi-speaker speech synthesis. It employs an encoder-decoder architecture combined with optimal transport conditional flow matching training, capable of generating high-quality speech output with fewer synthesis steps.

Model Features

Multi-speaker support
Supports speech synthesis for 47 Catalan speakers
Fast high-quality synthesis
Uses optimal transport conditional flow matching training to generate high-quality speech with fewer synthesis steps
Efficient architecture
Transformer-based U-Net decoder structure with 1D CNN to reduce memory consumption and improve synthesis speed
Language-specific optimization
Fine-tuned using Catalan phonemizer and dedicated datasets for optimized native language support

Model Capabilities

Catalan text-to-speech
Multi-speaker speech synthesis
Adjustable speech rate and generation temperature
High-quality speech output

Use Cases

Speech synthesis applications
Voice assistants
Provides natural speech output for Catalan voice assistants
Supports multiple speaker voice options
Audiobooks
Converts Catalan text into natural speech
Allows adjustment of speech rate and intonation as needed
Assistive technology
Offers text-to-speech functionality in Catalan for visually impaired users
Supports multiple voice options to meet personal preferences
Featured Recommended AI Models
ยฉ 2025AIbase