Model Selection

Deep Learning

# Deep Learning

This model is based on the Transformers library, with no specific functionality clearly stated

Large Language Model

Videomae Base Finetuned Kinetics 0408 Final 45sec Org

A video understanding model fine-tuned based on MCG-NJU/videomae-base-finetuned-kinetics, achieving an accuracy of 90.97% on the evaluation set

Video Processing

Albert Base V1 Stackoverflow Prediction

This model is based on the transformers library, and its specific functionality and purpose require further information for confirmation.

Large Language Model

Urdu Text To Speech Tts

This model is based on the transformers library, and its specific purpose and functionality require further information to determine.

Large Language Model

Detr Finetuned Rwy Obb

This model is built based on the Transformers library, and its specific functions and uses require further information to be supplemented.

Large Language Model

Spatiallysaying

Github Samples Tclassifier

This model is based on the Transformers library, but its specific purpose and functionality are not clearly stated.

Large Language Model

Iris is a deep learning-based Korean-English sentence translation model that achieves high-quality translation through advanced natural language processing technology.

Machine Translation

Transformers Supports Multiple Languages

CLIP ViT B 16 DataComp.XL S13b B90k

This model is based on the Transformers library, and its specific functions and uses require further information for confirmation.

Large Language Model

A speech emotion recognition model fine-tuned based on facebook/wav2vec2-base, achieving 73.22% accuracy on the evaluation set

Audio Classification

Urdu language model based on the transformers library, suitable for natural language processing tasks.

Large Language Model

Transformers Other

Ser Model Adjusted 2023 03 03

A speech emotion recognition model fine-tuned based on facebook/wav2vec2-base, achieving an accuracy of 75.73% on the evaluation set

Audio Classification

Detr Resnet 50 CD45RB 1000 Att

A fine-tuned model based on facebook/detr-resnet-50 for object detection tasks

Object Detection

Beit Base Patch16 224 Pt22k Ft22k Finetuned FER 5e 05 3

A facial expression recognition model fine-tuned based on Microsoft BEiT, achieving 68.6% accuracy on the FER dataset

Image Classification

Wav2vec2 Base Finetuned Ie

A fine-tuned version based on facebook/wav2vec2-base model for specific tasks

Speech Recognition

Beit Base Patch16 224 Pt22k Ft22k Finetuned FER2013CKPlus

This model is an image classification model based on the BEiT architecture, fine-tuned on the FER2013CKPlus dataset for facial expression recognition tasks.

Image Classification

An image classification model provided by Keras, supporting multiple pre-trained architectures and suitable for common image classification tasks.

Image Classification

ViViT is an extension of the Vision Transformer (ViT) for video processing, primarily used for downstream tasks such as video classification.

Video Processing

Fastbook 04 Mnist Basics

An image classification model built on the fastai framework, with unspecified categories

Image Classification

ResNet-34 model pretrained on the ImageNette dataset for image classification tasks.

Image Classification

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase