# Zero-shot transfer learning
Florence 2 Large DOTA V1.0 Lmmrotate
MIT
LMMRotate is a fine-tuned large multimodal language model specifically designed for rotating object detection tasks, particularly suitable for aerial image analysis.
Image-to-Text
TensorBoard English

F
Qingyun
17
1
Ssast Small Patch Audioset 16 16
Bsd-3-clause
Audio classification model pre-trained on AudioSet and Librispeech, using vision transformer architecture to process audio spectrograms
Audio Classification
Transformers

S
Simon-Kotchou
2,408
1
Autotrain Vision Tcg 40463105224
This is a multi-class image classification model trained via AutoTrain, demonstrating outstanding performance on the validation set with all evaluation metrics reaching 1.0.
Image Classification
Transformers

A
micazevedo
16
0
Mt5 Small
Apache-2.0
mT5 is the multilingual variant of the T5 model, supporting 101 languages, pretrained on the mC4 corpus, suitable for multilingual text generation and understanding tasks.
Large Language Model Supports Multiple Languages
M
google
139.42k
149
Wav2vec2 Lv 60 Espeak Cv Ft
Apache-2.0
This model is based on the pre-trained Wav2Vec2-Large-LV60 model and fine-tuned on the CommonVoice dataset for multilingual phoneme recognition.
Speech Recognition
Transformers Other

W
facebook
18.77k
43
Wav2vec2 Xlsr 53 Espeak Cv Ft
Apache-2.0
This model is a multilingual phoneme recognition model fine-tuned on the CommonVoice dataset based on the wav2vec2-large-xlsr-53 pre-trained model, supporting the recognition of phoneme labels in multiple languages.
Speech Recognition
Transformers

W
facebook
315.39k
31
Featured Recommended AI Models