D

Dprnntasnet Ks2 WHAM Sepclean

Developed by mpariente
A speech separation model trained on the Asteroid framework using the WHAM! dataset, focusing on clean speech separation tasks.
Downloads 512
Release Time : 3/2/2022

Model Overview

This model adopts the DPRNN architecture, specifically designed to separate clean speech signals from mixed audio, suitable for speech enhancement and separation tasks.

Model Features

Efficient Speech Separation
Utilizes the DPRNN architecture to effectively process long-sequence audio signals, achieving high-quality speech separation.
Low Sample Rate Support
Supports audio input at 8000Hz sampling rate, suitable for various speech processing scenarios.
Lightweight Design
Features a lightweight design with a kernel size of 2 and 64 filters, balancing performance and computational efficiency.

Model Capabilities

Audio Separation
Speech Enhancement
Multi-Speaker Separation

Use Cases

Speech Processing
Conference Recording Separation
Separate clear speech of individual speakers from multi-person conference recordings
SI-SDR improvement of 19.32dB
Speech Enhancement
Extract clear speech from noisy recordings
STOI improvement of 0.24
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase