AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Speech transcription

# Speech transcription

Phowhisper Large
Bsd-3-clause
PhoWhisper is a system specifically designed for Vietnamese automatic speech recognition, fine-tuned based on the Whisper model, supporting various Vietnamese accents.
Speech Recognition Transformers Other
P
vinai
2,373
28
Ai Light Dance Singing Ft Wav2vec2 Large Xlsr 53
Apache-2.0
This model is an automatic speech recognition model fine-tuned on the AI_LIGHT_DANCE - ONSET-SINGING dataset based on facebook/wav2vec2-large-xlsr-53, primarily used for singing voice recognition tasks.
Speech Recognition Transformers
A
gary109
23
1
Bp500 Base100k Voxpopuli
Apache-2.0
Speech recognition model optimized for Brazilian Portuguese, trained with 453 hours of audio from 7 public datasets
Speech Recognition Transformers Other
B
lgris
23
1
Wav2vec2 Large Pitch Recognition
Apache-2.0
A speech recognition model fine-tuned on Japanese accent datasets based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Transformers Japanese
W
vumichien
15
2
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase