# Speaker diarization
Speaker Diarization 2.5
MIT
A speaker diarization model modified based on pyannote/speaker-diarization-3.0, using speechbrain/spkrec-ecapa-voxceleb for speaker embedding, with better performance in certain tests
Speaker Analysis
S
Willy030125
26
0
Segmentation 3.0
MIT
This is an audio segmentation model capable of detecting speaker changes, voice activity, and overlapping speech, suitable for audio analysis in multi-speaker scenarios.
Audio Processing
S
fatymatariq
1,228
0
Kotoba Whisper V2.2
Apache-2.0
Japanese automatic speech recognition model based on Whisper, integrating speaker separation and punctuation addition functions
Speech Recognition
Transformers Japanese

K
kotoba-tech
22.80k
47
Speaker Segmentation Fine Tuned Callhome Jpn
MIT
This is a speaker diarization model fine-tuned from the pyannote/segmentation-3.0 base model, specifically optimized for Japanese telephone conversation scenarios.
Speaker Analysis
Transformers

S
kamilakesbi
18
0
Pyannote Segmentation 30
MIT
This is an audio processing model for speaker diarization, capable of detecting speech activity, overlapping speech, and multiple speakers.
Audio Processing
P
collinbarnwell
873
0
Speaker Diarization Optimized
MIT
The speaker diarization pipeline of Pyannote.audio, used to automatically detect speaker changes in audio and segment speech segments.
Speaker Analysis
S
G-Root
349
0
Segmentation 3.0
MIT
This is a powerset-encoded speaker diarization model capable of processing 10-second audio clips to identify multiple speakers and their overlapping speech.
Speaker Analysis
S
pyannote
12.6M
445
Pyannote Segmentation
MIT
This is an end-to-end speaker diarization model that supports voice activity detection, overlap speech detection, and resegmentation tasks.
Speaker Analysis
P
philschmid
427
9
Pyannote Speaker Diarization Endpoint
MIT
Speaker diarization model based on pyannote.audio 2.0 for automatic detection of speaker changes and speech activity in audio
Speaker Analysis
P
philschmid
51
18
Speaker Diarization
MIT
Speaker diarization model based on pyannote.audio 2.1.1, used for automatic detection of speaker changes and overlap speech in audio
Speaker Analysis
S
pyannote
910.93k
1,038
Overlapped Speech Detection
MIT
A pre-trained model for detecting overlapped speech in audio, capable of identifying time segments where two or more speakers are active simultaneously.
Speaker Analysis
O
pyannote
144.68k
35
Featured Recommended AI Models