M

Matxa Tts Cat Multiaccent

Developed by projecte-aina
The first neural speech synthesis model supporting multiple speakers and dialects, capable of generating high-quality emotional speech in four Catalan dialects
Downloads 139
Release Time : 4/16/2024

Model Overview

A non-autoregressive speech synthesis model based on Matcha-TTS architecture, working in conjunction with the alVoCat vocoder to support speech synthesis in four Catalan dialects

Model Features

Multidialect Support
Supports speech synthesis in four Catalan dialects: Balearic, Central, Northwestern, and Valencian
Multi-speaker Support
Each dialect includes 2 distinct speakers (one male and one female), offering a total of 8 voice options
Efficient Synthesis
Utilizes Optimal Transport Conditional Flow Matching (OT-CFM) training to generate high-quality output with fewer synthesis steps
Emotional Speech
Capable of generating natural speech with emotional expressiveness

Model Capabilities

Catalan text-to-speech
Multidialect speech synthesis
Multi-speaker speech synthesis
Emotional speech generation

Use Cases

Voice Assistants
Dialect Voice Assistant
Provides dialect-based voice interaction for Catalan users in different regions
Enhances user experience and familiarity
Audio Content Creation
Dialect Audiobooks
Generates audiobook content with regional characteristics
Preserves dialect features and cultural heritage
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase