D

Dprnntasnet Ks16 WHAM Sepclean

Developed by julien-c
This is an audio source separation model trained based on the Asteroid framework, specifically designed to extract clean speech signals from mixed audio.
Downloads 66
Release Time : 3/2/2022

Model Overview

The model utilizes the DPRNNTasNet architecture and is trained on the sep_clean task of the WHAM! dataset, effectively separating speech signals from mixed audio.

Model Features

Efficient audio separation
Adopts the DPRNN architecture to efficiently separate clean speech signals from mixed audio.
Small kernel size
Uses a smaller kernel size (16), aiding in capturing finer audio features.
High separation quality
Performs excellently on the WHAM! dataset, achieving an SI-SDR improvement of 18.23 dB.

Model Capabilities

Audio source separation
Speech signal extraction
Mixed audio processing

Use Cases

Speech processing
Speech enhancement
Extracts clear speech signals from noisy environments
SI-SDR improvement of 18.23 dB
Meeting transcription
Separates audio signals of multiple people speaking simultaneously
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase