# Deep Learning

Scarlet
This model is based on the Transformers library, with no specific functionality clearly stated
Large Language Model Transformers
S
agasta
426
0
Videomae Base Finetuned Kinetics 0408 Final 45sec Org
A video understanding model fine-tuned based on MCG-NJU/videomae-base-finetuned-kinetics, achieving an accuracy of 90.97% on the evaluation set
Video Processing Transformers
V
d2o2ji
26
0
Albert Base V1 Stackoverflow Prediction
This model is based on the transformers library, and its specific functionality and purpose require further information for confirmation.
Large Language Model Transformers
A
notshivain1
77
1
Urdu Text To Speech Tts
This model is based on the transformers library, and its specific purpose and functionality require further information to determine.
Large Language Model Transformers
U
ShigrafS
23
0
Detr Finetuned Rwy Obb
This model is built based on the Transformers library, and its specific functions and uses require further information to be supplemented.
Large Language Model Transformers
D
Spatiallysaying
13
0
Github Samples Tclassifier
This model is based on the Transformers library, but its specific purpose and functionality are not clearly stated.
Large Language Model Transformers
G
h1alexbel
176
2
Iris 7b
Apache-2.0
Iris is a deep learning-based Korean-English sentence translation model that achieves high-quality translation through advanced natural language processing technology.
Machine Translation Transformers Supports Multiple Languages
I
davidkim205
716
13
CLIP ViT B 16 DataComp.XL S13b B90k
This model is based on the Transformers library, and its specific functions and uses require further information for confirmation.
Large Language Model Transformers
C
Solenya-ai
135
0
CREMA D Model
Apache-2.0
A speech emotion recognition model fine-tuned based on facebook/wav2vec2-base, achieving 73.22% accuracy on the evaluation set
Audio Classification Transformers
C
jdmartinev
21
0
Vilanocr
Urdu language model based on the transformers library, suitable for natural language processing tasks.
Large Language Model Transformers Other
V
musadac
24
0
Ser Model Adjusted 2023 03 03
Apache-2.0
A speech emotion recognition model fine-tuned based on facebook/wav2vec2-base, achieving an accuracy of 75.73% on the evaluation set
Audio Classification Transformers
S
aherzberg
18
0
Detr Resnet 50 CD45RB 1000 Att
Apache-2.0
A fine-tuned model based on facebook/detr-resnet-50 for object detection tasks
Object Detection Transformers
D
polejowska
13
0
Beit Base Patch16 224 Pt22k Ft22k Finetuned FER 5e 05 3
Apache-2.0
A facial expression recognition model fine-tuned based on Microsoft BEiT, achieving 68.6% accuracy on the FER dataset
Image Classification Transformers
B
lixiqi
17
0
Wav2vec2 Base Finetuned Ie
Apache-2.0
A fine-tuned version based on facebook/wav2vec2-base model for specific tasks
Speech Recognition Transformers
W
minoosh
14
0
Beit Base Patch16 224 Pt22k Ft22k Finetuned FER2013CKPlus
Apache-2.0
This model is an image classification model based on the BEiT architecture, fine-tuned on the FER2013CKPlus dataset for facial expression recognition tasks.
Image Classification Transformers
B
Celal11
19
0
Pepe
An image classification model provided by Keras, supporting multiple pre-trained architectures and suitable for common image classification tasks.
Image Classification
P
PeskyAmiable
0
0
Vivit B 16x2
MIT
ViViT is an extension of the Vision Transformer (ViT) for video processing, primarily used for downstream tasks such as video classification.
Video Processing Transformers
V
google
989
11
Fastbook 04 Mnist Basics
An image classification model built on the fastai framework, with unspecified categories
Image Classification
F
fastai
21
2
Resnet34
Apache-2.0
ResNet-34 model pretrained on the ImageNette dataset for image classification tasks.
Image Classification Transformers
R
frgfm
51
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase