Dprnntasnet Ks2 WHAM Sepclean
A speech separation model trained on the Asteroid framework using the WHAM! dataset, focusing on clean speech separation tasks.
Downloads 512
Release Time : 3/2/2022
Model Overview
This model adopts the DPRNN architecture, specifically designed to separate clean speech signals from mixed audio, suitable for speech enhancement and separation tasks.
Model Features
Efficient Speech Separation
Utilizes the DPRNN architecture to effectively process long-sequence audio signals, achieving high-quality speech separation.
Low Sample Rate Support
Supports audio input at 8000Hz sampling rate, suitable for various speech processing scenarios.
Lightweight Design
Features a lightweight design with a kernel size of 2 and 64 filters, balancing performance and computational efficiency.
Model Capabilities
Audio Separation
Speech Enhancement
Multi-Speaker Separation
Use Cases
Speech Processing
Conference Recording Separation
Separate clear speech of individual speakers from multi-person conference recordings
SI-SDR improvement of 19.32dB
Speech Enhancement
Extract clear speech from noisy recordings
STOI improvement of 0.24
Featured Recommended AI Models
Š 2025AIbase