S

Speaker Diarization 2.5

Developed by Willy030125
A speaker diarization model modified based on pyannote/speaker-diarization-3.0, using speechbrain/spkrec-ecapa-voxceleb for speaker embedding, with better performance in certain tests
Downloads 26
Release Time : 3/24/2025

Model Overview

Used for speaker segmentation and change detection in audio, supporting automatic voice activity detection, overlapping speech detection, and automatic speaker counting

Model Features

Automatic speaker counting
No need to manually specify the number of speakers, the model can automatically detect
Improved speaker embedding
Uses speechbrain/spkrec-ecapa-voxceleb for speaker embedding, with better performance in certain scenarios
Fully automatic processing
No manual voice activity detection or hyperparameter tuning required
GPU acceleration support
Supports GPU processing with a real-time factor of about 2.5%

Model Capabilities

Speaker diarization
Speaker change detection
Voice activity detection
Overlapping speech detection
Automatic speaker counting

Use Cases

Meeting transcription
Meeting transcription analysis
Automatically identifies speech segments from different speakers in meetings
DER 12.3% (AISHELL-4 dataset)
Speech transcription
Automatic speech recognition preprocessing
Provides speaker segmentation information for ASR systems
Media analysis
Broadcast program analysis
Analyzes speaking patterns of different hosts and guests in broadcast programs
DER 7.8% (REPERE dataset)
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase