W

Wespeaker Voxceleb Resnet34 LM

Developed by Wespeaker
A speaker embedding model based on the ResNet34 architecture, fine-tuned with large margin, trained on the VoxCeleb2 dataset, supporting tasks such as speaker recognition and similarity calculation.
Downloads 33
Release Time : 2/26/2024

Model Overview

This model is used to extract speaker feature embeddings and supports tasks such as speaker recognition, similarity calculation, and speech segmentation.

Model Features

Large Margin Fine-tuning
The model is fine-tuned with a large margin, improving the accuracy of speaker recognition.
Efficient Inference
The model has a moderate number of parameters with a computational load of 4.55G, making it suitable for practical deployment.
Multi-functional Support
Supports various tasks such as speaker embedding extraction, similarity calculation, and speech segmentation.

Model Capabilities

Speaker Feature Extraction
Speaker Similarity Calculation
Speech Segmentation
Speaker Recognition
Speaker Registration and Verification

Use Cases

Security Authentication
Voiceprint Recognition System
A voiceprint recognition system for identity verification
Achieved an EER (Equal Error Rate) of 0.723 on the VoxCeleb test set
Speech Analysis
Meeting Speech Segmentation
Automatically segments speech from different speakers in meeting recordings
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase