W

Wav2vec2 Large Robust 24 Ft Age Gender

Developed by audeering
This model takes raw audio signals as input and outputs age predictions and gender probabilities (child/female/male), along with the pooled state of the last transformer layer.
Downloads 44.13k
Release Time : 9/4/2023

Model Overview

A voice age and gender recognition model obtained by fine-tuning Wav2Vec2-Large-Robust on multiple datasets, capable of predicting speaker age and gender from raw audio.

Model Features

Multi-dataset training
Trained on multiple datasets including aGender, Mozilla Common Voice, Timit, and Voxceleb 2 to enhance model generalization
End-to-end processing
Directly processes raw audio signals without complex feature engineering
Multi-task output
Simultaneously outputs age predictions, gender probabilities, and transformer pooled states
Strong robustness
Based on the Wav2Vec2-Large-Robust architecture, offering strong robustness against noise and speech variations

Model Capabilities

Voice age recognition
Voice gender classification
Voice feature extraction

Use Cases

Speech analysis
Speaker demographics
Analyze the age and gender distribution of speakers from voice data
Can output age predictions (0-100 years) and gender probabilities
Voice interaction systems
Provide user demographic information for voice assistants to enable personalized interactions
Voice data analysis
Extract speaker age and gender features from large volumes of voice data
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase