# Long audio processing

Aero 1 Audio
MIT
Lightweight audio model, excelling in speech recognition, audio understanding, and executing audio instructions among other diverse tasks
Audio-to-Text Transformers English
A
lmms-lab
1,348
74
Whisper Large V3 Vaani Hindi
Apache-2.0
A Hindi speech recognition model fine-tuned based on OpenAI's Whisper-Large-V3, trained on approximately 718 hours of transcribed Hindi speech data
Speech Recognition Safetensors
W
ARTPARK-IISc
15.55k
3
Whisper Large V3 Turbo
MIT
Whisper large-v3-turbo is an automatic speech recognition and speech translation model proposed by OpenAI, trained with large-scale weak supervision and supporting multiple languages.
Speech Recognition Transformers Supports Multiple Languages
W
Daemontatox
26
1
Chunkformer Large Vie
A large-scale Vietnamese automatic speech recognition model based on the ChunkFormer architecture, fine-tuned on approximately 3000 hours of publicly available Vietnamese speech data, with excellent performance.
Speech Recognition PyTorch Other
C
khanhld
1,765
12
Whisper Large V3 Turbo Turkish
MIT
A Turkish speech recognition model fine-tuned on the Common Voice 17.0 dataset based on openai/whisper-large-v3-turbo
Speech Recognition Transformers Other
W
selimc
289
6
Whisper Large V3 Turbo
Apache-2.0
Whisper large-v3-turbo is a distilled version of OpenAI Whisper large-v3, with the decoder layers reduced from 32 to 4, significantly improving speed while slightly reducing quality.
Speech Recognition Supports Multiple Languages
W
deepdml
883
6
Faster Whisper Large V3 Ru Podlodka Int8
Apache-2.0
This is a Russian speech recognition model based on the OpenAI Whisper architecture, optimized for Russian speech-to-text tasks and converted to ctranslate2 format for improved inference efficiency.
Speech Recognition Other
F
bzikst
29
3
Whisper Tiny En
Other
An English speech recognition and translation model optimized for mobile deployment, implemented by Qualcomm.
Speech Recognition PyTorch
W
qualcomm
3,269
7
Nb Whisper Base
Apache-2.0
An automatic speech recognition model developed by the National Library of Norway, based on the OpenAI Whisper architecture, supporting transcription in Norwegian and English.
Speech Recognition Transformers
N
NbAiLab
1,629
2
Nb Whisper Large
Apache-2.0
An automatic Norwegian speech recognition model launched by the National Library of Norway, developed based on OpenAI's Whisper architecture, supporting multiple Norwegian dialects and English.
Speech Recognition Transformers Supports Multiple Languages
N
NbAiLab
5,214
26
Nb Whisper Large
Apache-2.0
An automatic speech recognition model developed by the National Library of Norway, based on the Whisper architecture, supporting speech transcription and translation of Norwegian and English.
Speech Recognition Transformers
N
NbAiLabBeta
776
9
Whisper Large V3
Apache-2.0
Whisper is an advanced automatic speech recognition (ASR) and speech translation model proposed by OpenAI, trained on over 5 million hours of labeled data, with strong cross-dataset and cross-domain generalization capabilities.
Speech Recognition Supports Multiple Languages
W
openai
4.6M
4,321
Whisper Tamil Large V2
Apache-2.0
Tamil speech recognition model fine-tuned based on OpenAI Whisper-large-v2, trained on multiple public Tamil ASR corpora
Speech Recognition Other
W
vasista22
325
7
Wav2vec2 Large Xls R 300m Bg
Apache-2.0
An automatic speech recognition model fine-tuned on the Common Voice 8 Bulgarian dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers Other
W
anuragshas
1,469
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase