C

Csm Expressiva 1b

Developed by senstella
An emotional speech model fine-tuned based on the CSM-1b conversational speech model, supporting whisper-style speech synthesis
Downloads 105
Release Time : 4/10/2025

Model Overview

This model fine-tunes the CSM base model through SFT, utilizing whisper-style speech data from the Expresso dataset, validating the LoRA fine-tuning effects of the csm-mlx codebase, capable of generating speech with specific emotional characteristics.

Model Features

Whisper-style speech synthesis
Capable of generating emotional speech with specific whisper-style characteristics
LoRA fine-tuning optimization
Uses Low-Rank Adaptation (LoRA) technology for efficient fine-tuning, adding new features while preserving the base model's capabilities
Lightweight training
Can be trained on a MacBook Air with 16GB memory, suitable for resource-limited environments
Improved stability
Significantly reduces typical base model failures (such as infinite silence) through fine-tuning

Model Capabilities

Text-to-Speech
Emotional speech synthesis
Whisper-style generation

Use Cases

Speech synthesis
Emotional voice assistant
Adds whisper and other emotional speech output capabilities to voice assistants
Capable of generating natural emotional speech
Audio content creation
Provides diverse speech styles for audiobooks, podcasts, and other content creation
Can generate speech content with specific styles
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase