W

Wav2vec2 Xls R Adult Child Cls

Developed by bookbot
An audio classification model based on the XLS-R architecture, designed to distinguish between adult and child speech.
Downloads 20
Release Time : 3/2/2022

Model Overview

This model is a fine-tuned version of wav2vec2-xls-r-300m on a private adult/child speech classification dataset, primarily used for speech classification tasks.

Model Features

High accuracy
Achieves 94.69% accuracy and an F1 score of 0.9508 on the evaluation dataset.
Based on XLS-R architecture
Utilizes the powerful feature extraction capabilities of the XLS-R architecture for speech classification.
Efficient training
Optimizes the training process using gradient accumulation and linear learning rate scheduling.

Model Capabilities

Audio classification
Adult/Child speech differentiation

Use Cases

Speech analysis
Child speech recognition
Used to identify and classify child speech, suitable for educational or child product applications.
94.69% accuracy
Adult speech recognition
Used to identify and classify adult speech, suitable for customer service or voice assistant applications.
F1 score 0.9508
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase