W

Wav2vec2 Base Superb Sid

Developed by superb
A speaker identification model fine-tuned on the VoxCeleb1 dataset based on the Wav2Vec2-base pre-trained model, designed for voice classification tasks
Downloads 1,489
Release Time : 3/2/2022

Model Overview

This model is a ported version of S3PRL's Wav2Vec2 for the SUPERB speaker identification task, capable of classifying each speech segment by its speaker identity

Model Features

Based on Wav2Vec2 Pre-trained Model
Uses facebook/wav2vec2-base as the base model, which is pre-trained on 16kHz sampled speech audio
Fine-tuned on VoxCeleb1 Dataset
Fine-tuned on the widely-used VoxCeleb1 dataset, suitable for speaker identification tasks
High Accuracy
Achieves 75.18% accuracy on the test set

Model Capabilities

Speaker Identification
Voice Classification
Audio Feature Extraction

Use Cases

Security Verification
Voiceprint Recognition System
Used for speaker identification in authentication systems
Can identify specific speaker identities
Speech Analysis
Meeting Transcription Analysis
Identifies speech segments from different speakers in meeting recordings
Automatically distinguishes between different speakers
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase