Convtasnet Libri3Mix Sepclean 16k
A ConvTasNet model trained on the Asteroid framework for speech separation tasks, trained on the Libri3Mix dataset, supporting 16kHz sample rate audio input.
Downloads 48
Release Time : 3/2/2022
Model Overview
This model is an audio-to-audio conversion model specifically designed to separate clean speech signals from mixed audio.
Model Features
Efficient speech separation
Effectively separates speech signals of multiple speakers from mixed audio.
Optimized ConvTasNet architecture
Utilizes an optimized ConvTasNet architecture with 8 blocks and 3 repetitions, providing efficient audio processing capabilities.
High-quality separation results
Achieves significant SI-SDR and SDR improvements on the Libri3Mix test set, demonstrating notable separation performance.
Model Capabilities
Multi-speaker speech separation
Audio signal enhancement
16kHz audio processing
Use Cases
Speech processing
Meeting recording enhancement
Separates individual speaker voices from multi-speaker meeting recordings to improve speech recognition accuracy.
SI-SDR improvement of 12.3, SDR improvement of 12.77
Speech signal dereverberation
Extracts clean speech signals from noisy environments to improve speech quality.
STOI improvement of 0.255
Featured Recommended AI Models
Š 2025AIbase