# High-precision Audio Recognition
Wav2vec2 Large Emotion Detection German
Apache-2.0
A German speech emotion detection model based on wav2vec2, trained on the emo-DB dataset, capable of recognizing 7 different emotions.
Audio Classification
Transformers German

W
padmalcom
20
3
Ast Finetuned Audioset 14 14 0.443
Bsd-3-clause
An audio spectrogram transformer fine-tuned on the AudioSet dataset, which converts audio into spectrograms and processes them using a vision transformer architecture, achieving excellent performance in audio classification tasks.
Audio Classification
Transformers

A
MIT
194.20k
5
Ast Finetuned Audioset 12 12 0.447
Bsd-3-clause
An Audio Spectrogram Transformer (AST) fine-tuned on the AudioSet dataset, using ViT architecture to process audio spectrograms, achieving excellent performance on multiple audio classification benchmarks.
Audio Classification
Transformers

A
MIT
25
0
Ast Finetuned Audioset 10 10 0.448
Bsd-3-clause
An Audio Spectrogram Transformer (AST) fine-tuned on the AudioSet dataset, utilizing a vision transformer architecture to process audio spectrograms, achieving excellent performance in audio classification tasks.
Audio Classification
Transformers

A
MIT
326
0
Ast Finetuned Audioset 10 10 0.4593
Bsd-3-clause
The Audio Spectrogram Transformer (AST) is a model fine-tuned on AudioSet, which converts audio into spectrograms and applies a vision transformer for audio classification.
Audio Classification
Transformers

A
MIT
308.88k
311
Distil Wav2vec2 Xls R Adult Child Cls 64m
Apache-2.0
A distilled audio classification model based on XLS-R architecture for distinguishing between adult and child voices
Audio Classification
Transformers English

D
bookbot
15
1
Featured Recommended AI Models