C

Convtasnet Libri3Mix Sepclean 16k

Developed by JorisCos
A ConvTasNet model trained on the Asteroid framework for speech separation tasks, trained on the Libri3Mix dataset, supporting 16kHz sample rate audio input.
Downloads 48
Release Time : 3/2/2022

Model Overview

This model is an audio-to-audio conversion model specifically designed to separate clean speech signals from mixed audio.

Model Features

Efficient speech separation
Effectively separates speech signals of multiple speakers from mixed audio.
Optimized ConvTasNet architecture
Utilizes an optimized ConvTasNet architecture with 8 blocks and 3 repetitions, providing efficient audio processing capabilities.
High-quality separation results
Achieves significant SI-SDR and SDR improvements on the Libri3Mix test set, demonstrating notable separation performance.

Model Capabilities

Multi-speaker speech separation
Audio signal enhancement
16kHz audio processing

Use Cases

Speech processing
Meeting recording enhancement
Separates individual speaker voices from multi-speaker meeting recordings to improve speech recognition accuracy.
SI-SDR improvement of 12.3, SDR improvement of 12.77
Speech signal dereverberation
Extracts clean speech signals from noisy environments to improve speech quality.
STOI improvement of 0.255
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase