W

Wav2vec2 Large Superb Sid

Developed by superb
Speaker identification model based on the Wav2Vec2-Large architecture, trained on the VoxCeleb1 dataset for classifying speech by speaker identity
Downloads 27
Release Time : 3/2/2022

Model Overview

This model is an audio classification model for speaker identification, fine-tuned from Facebook's wav2vec2-large-lv60 model, capable of recognizing and classifying speech features from different speakers.

Model Features

High Accuracy
Achieves 86.13% accuracy on the VoxCeleb1 test set
Based on Wav2Vec2 Pre-trained Model
Leverages the powerful speech representation capabilities of wav2vec2-large-lv60 for fine-tuning
16kHz Speech Support
Optimized specifically for 16kHz sampled speech audio

Model Capabilities

Speaker Identification
Speech Classification
Audio Feature Extraction

Use Cases

Security Authentication
Voice Identity Verification
Authenticates user identity through voice recognition for security purposes
Speech Analysis
Meeting Transcript Analysis
Identifies speech segments from different speakers in meeting recordings
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase