S

Sepformer Libri3mix

Developed by speechbrain
This is an audio source separation model based on the SepFormer architecture, trained on the Libri3Mix dataset, capable of separating mixed speech into multiple independent sound sources.
Downloads 1,511
Release Time : 9/16/2022

Model Overview

This model implements audio source separation using the SepFormer architecture, specifically designed for mixed speech scenarios, enabling the extraction of independent speech sources from mixed audio.

Model Features

High-performance separation capability
Achieves 19.8 dB SI-SNRi separation performance on the Libri3Mix test set.
Transformer-based architecture
Utilizes the advanced SepFormer architecture with self-attention mechanisms for efficient separation.
Multi-speaker separation
Capable of simultaneously separating multiple speaker voices from mixed audio.

Model Capabilities

Audio source separation
Multi-speaker speech separation
8kHz audio processing

Use Cases

Speech processing
Meeting recording separation
Separate multi-person meeting recordings into individual speaker audio tracks.
Clearly separates up to 3 simultaneous speech sources.
Speech enhancement
Extract clear speech from background noise.
Improves speech recognition accuracy.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase