P

Pyannote Speaker Diarization Endpoint

Developed by philschmid
Speaker diarization model based on pyannote.audio 2.0 for automatic detection of speaker changes and speech activity in audio
Downloads 51
Release Time : 10/7/2022

Model Overview

This model is an end-to-end speaker diarization system capable of automatically detecting speaker changes, speech activity, and overlapping speech in audio, completing speaker diarization tasks without manual intervention.

Model Features

Fully automated processing
Performs segmentation without manual speech activity detection or specifying the number of speakers
Overlapping speech detection
Capable of detecting and handling overlapping speech scenarios
Speaker count adaptation
Automatically determines the number of speakers, also supports manual specification
High performance
Excellent performance across multiple benchmark datasets

Model Capabilities

Speaker diarization
Speech activity detection
Overlapping speech detection
Speaker change detection
Automatic speaker counting

Use Cases

Meeting transcription
Meeting transcription segmentation
Automatically segments different speakers in meeting recordings
Achieves 18.21% DER on the AMI dataset
Call recording analysis
Customer service call analysis
Automatically distinguishes between agent and customer speech segments
Achieves 30.24% DER on the CALLHOME dataset
Media content analysis
Interview program analysis
Automatically identifies hosts and guests in interview programs
Achieves 12.76% DER on the VoxConverse dataset
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase