V

Voc2vec As Pt

Developed by alkiskoudounas
voc2vec is a foundational model specifically designed for non-linguistic human data, built upon the wav2vec 2.0 framework.
Downloads 31
Release Time : 2/6/2025

Model Overview

This model is used for non-linguistic audio classification tasks, particularly for recognizing non-linguistic vocalizations such as infant cries.

Model Features

Non-linguistic audio processing
Model optimized specifically for non-linguistic human sounds (e.g., infant cries)
Multi-dataset pre-training
Pre-trained on 10 datasets containing approximately 125 hours of non-linguistic audio
Continued training based on AudioSet
Further pre-training from a model initially trained on the AudioSet dataset

Model Capabilities

Non-linguistic audio classification
Infant cry recognition
Audio feature extraction

Use Cases

Healthcare
Infant cry analysis
Used to identify and analyze different types of infant cries
Speech research
Non-linguistic vocalization research
Used to study the characteristics and patterns of human non-linguistic vocalizations
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase