Convtasnet Libri2Mix Sepclean 8k
ConvTasNet model trained on the Asteroid framework for speech separation tasks at 8kHz sampling rate, trained on the Libri2Mix dataset.
Downloads 179
Release Time : 3/2/2022
Model Overview
This model is used to separate clean speech signals from mixed audio, particularly suitable for dual-speaker scenarios.
Model Features
Efficient speech separation
Effectively separates speakers in mixed speech at 8kHz sampling rate
Optimized ConvTasNet architecture
Uses a deep network structure with 8 blocks and 3 repetitions, featuring 128 skip connection channels
Lightweight training
Efficient training with a batch size of 24 and audio segment length of 3 seconds
Model Capabilities
Dual-speaker speech separation
Audio signal enhancement
Speech signal processing
Use Cases
Speech processing
Meeting recording separation
Separates individual speaker voices from multi-person meeting recordings
SI-SDR improvement of 14.76dB
Speech enhancement
Extracts target speech from background noise or other speakers
STOI improvement of 0.218
Featured Recommended AI Models
Š 2025AIbase