S

Sepformer Whamr Enhancement

Developed by speechbrain
This model achieves speech enhancement (denoising + dereverberation) through the SepFormer architecture, pre-trained on the WHAMR! dataset (8kHz), with a test set SI-SNR of 10.59dB.
Downloads 570
Release Time : 3/2/2022

Model Overview

A Transformer-based speech enhancement model specifically designed to process speech signals containing environmental noise and reverberation, capable of simultaneous denoising and dereverberation.

Model Features

Dual-task processing
Capable of handling both speech denoising and dereverberation tasks simultaneously
Transformer architecture
Utilizes the SepFormer architecture with attention mechanisms for efficient speech separation
Low sampling rate optimization
Specifically optimized for 8kHz sampled speech signals

Model Capabilities

Speech denoising
Speech dereverberation
Audio quality enhancement

Use Cases

Speech processing
Call quality enhancement
Improves speech clarity in noisy environments for calls
SI-SNR improvement of 10.59dB, PESQ reaches 2.84
Meeting recording enhancement
Enhances recording quality in highly reverberant environments like conference rooms
Featured Recommended AI Models
ยฉ 2025AIbase