# Few-shot Fine-tuning

Logoiconemojimoe V0.2 FLUX.1 Dev LoRA
Other
A LoRA adapter based on the FLUX.1-dev model, specifically designed for generating logos, icons, and emojis, supporting various 3D rendering effects including Microsoft FluentUI style.
Image Generation
L
Borcherding
282
1
Bge Base En V1.5 Course Recommender V5
This is a sentence-transformers model fine-tuned from BAAI/bge-base-en-v1.5, which maps sentences and paragraphs to a 768-dimensional dense vector space.
Text Embedding
B
datasocietyco
15.87k
1
Test With Sdfvd
A video understanding model fine-tuned based on MCG-NJU/videomae-base, with average performance on the evaluation set (accuracy 50%)
Video Processing Transformers
T
cocovani
16
0
Videomae Base Finetuned 1e 08 Bs4 Ep2
A video understanding model fine-tuned based on MCG-NJU/videomae-base, trained on an unknown dataset
Video Processing Transformers
V
EloiseInacio
14
0
Finetuning Sentiment Model 3000 Samples
Apache-2.0
A sentiment analysis model fine-tuned based on distilbert-base-uncased, achieving 87.67% accuracy on the evaluation set
Text Classification Transformers
F
mayank15122000
111
1
Nuke X Gemma3 1B Reasoner Testing
Apache-2.0
A reasoning-enhanced model optimized from Google Gemma-3-1B, improving logical reasoning capabilities through GRPO algorithm and high-quality datasets
Large Language Model Transformers English
N
NuclearAi
77
2
Learn Hf Food Not Food Text Classifier Distilbert Base Uncased
Apache-2.0
A DistilBERT-based text classification model for distinguishing between food and non-food texts
Text Classification Transformers
L
HimanshuGoyal2004
70
1
Finetuned ViT Model
MIT
Hardhat detection model fine-tuned based on DETR-ResNet50 architecture, designed for industrial scenarios
Object Detection Transformers English
F
bnina-ayoub
21
1
Finetuning Sentiment Model 3000 Samples 1
Apache-2.0
A sentiment analysis model fine-tuned based on distilbert-base-uncased, achieving an accuracy of 85.67% on the evaluation set
Text Classification Transformers
F
nayaksaroj
23
1
Ddpm Fewshot Anime Face
MIT
A diffusion model based on the DDPM architecture for generating cartoon-style character avatars
Image Generation
D
xchuan
25
1
Florence 2 DocVQA
A version fine-tuned for 1 day using the Docmatix dataset (5% data volume) based on Microsoft's Florence-2 model, suitable for image-text understanding tasks
Text-to-Image Transformers
F
impactframes
30
1
Paligemma Vqav2
This model is a fine-tuned version of google/paligemma-3b-pt-224 on a subset of the VQAv2 dataset, specializing in visual question answering tasks.
Text-to-Image Transformers
P
merve
168
13
Finetuned Clothes
Apache-2.0
A clothing classification model fine-tuned based on Google's ViT model, supporting image classification for 7 clothing categories
Image Classification Transformers
F
samokosik
50
2
Intent Classifier
A Flan-T5-Base fine-tuned intent classification model for categorizing customer queries into predefined categories
Text Classification Transformers
I
Serj
364
4
Metavoice 1B V0.1
Apache-2.0
MetaVoice-1B is a 1.2 billion parameter text-to-speech (TTS) foundation model trained on 100,000 hours of speech data, specializing in generating emotional English speech with support for voice cloning and long-form synthesis.
Speech Synthesis English
M
metavoiceio
571
785
Blip Image Captioning Base Test Sagemaker Tops 3
Bsd-3-clause
This model is a fine-tuned version of Salesforce's BLIP image captioning base model on the SageMaker platform, primarily used for image caption generation tasks.
Image-to-Text Transformers
B
GHonem
13
0
Model3
MIT
Document image understanding model fine-tuned based on naver-clova-ix/donut-base-finetuned-cord-v2
Image-to-Text Transformers
M
sunilsai
13
0
Donut Base Sroie
MIT
A model fine-tuned on an image folder dataset based on naver-clova-ix/donut-base, with no specific use case explicitly stated
Text Recognition Transformers
D
iamkhadke
13
0
Swinv2 Tiny Patch4 Window8 256 Finetuned THFOOD 50
This model is a vision classification model fine-tuned on the THFOOD-50 Thai food dataset based on the Swin Transformer V2 architecture, specifically designed for Thai food image recognition.
Image Classification Transformers
S
thean
30
2
All Format
MIT
A model fine-tuned based on philschmid/donut-base-sroie, suitable for image processing tasks
Text Recognition Transformers
A
dreeven
17
0
Platzi Vit Model Julio Test
Apache-2.0
This is an image classification model fine-tuned on a bean dataset based on Google's ViT model, achieving a high accuracy of 99.25% on the validation set.
Image Classification Transformers
P
platzi
18
0
Swin Tiny Patch4 Window7 224 Finetuned Skin Cancer
Apache-2.0
A fine-tuned model based on the Swin Transformer architecture, specifically designed for skin cancer image classification tasks
Image Classification Transformers
S
MPSTME
18
0
Swin Tiny Patch4 Window7 224 Finetuned Trash Classification
Apache-2.0
A fine-tuned model based on Swin Transformer architecture for garbage classification tasks, achieving 88.27% accuracy
Image Classification Transformers
S
maixbach
22
2
Swin Small Finetuned Cifar100
Apache-2.0
A small model based on the Swin Transformer architecture, fine-tuned on the CIFAR-100 dataset for image classification tasks
Image Classification Transformers
S
MazenAmria
37
0
Donut Base Sroie
MIT
A model fine-tuned on the image folder dataset based on naver-clova-ix/donut-base, suitable for document understanding tasks
Text Recognition Transformers
D
zahra000
16
0
Convnext Tiny 224 Finetuned Eurosat Vitconfig Test 1
ConvNeXt-Tiny model fine-tuned on an image folder dataset, suitable for image classification tasks
Image Classification Transformers
C
polejowska
30
0
Vit Base Patch16 224 In21k Finetuned Cifar10 Test
Apache-2.0
A fine-tuned test version of Google Vision Transformer (ViT) base model on CIFAR-10 dataset
Image Classification Transformers
V
minhhoque
30
0
Ast Finetuned Audioset 10 10 0.4593 Finetuning ESC 50 Slower LR
Bsd-3-clause
Audio classification model based on AST architecture, pre-trained on the AudioSet dataset and fine-tuned on the ESC-50 dataset
Audio Classification Transformers
A
xpariz10
22
0
Vit Base Patch16 224 Finetuned
Apache-2.0
An image classification model fine-tuned based on Google's Vision Transformer (ViT), trained on custom image datasets
Image Classification Transformers
V
clp
30
0
Donut Base Sroie Fine Tuned
MIT
A fine-tuned version based on the naver-clova-ix/donut-base model on an image folder dataset, suitable for document understanding tasks.
Text Recognition Transformers
D
adrianccy
21
0
Dof Receipts 1
MIT
Model fine-tuned based on naver-clova-ix/donut-base for processing image data
Text Recognition Transformers
D
Sebabrata
31
0
Donut Base Label Studio 200 Invoices
MIT
Invoice recognition model based on Donut architecture, fine-tuned on a dataset of 200 invoices
Text Recognition Transformers
D
Prem11100
18
0
Vit Base Patch16 224 Finetuned Imageclassification
Apache-2.0
Image classification model fine-tuned on image folder dataset based on Google's ViT model, achieving 95.02% accuracy
Image Classification Transformers
V
thaonguyen274
13
0
Deit Base Patch16 224 FV Finetuned Memes
Apache-2.0
A meme classification model fine-tuned from facebook/deit-base-patch16-224, achieving 84.85% accuracy on the imagefolder dataset
Image Classification Transformers
D
jayanta
11
0
My Awesome Eli5 Mlm Model
Apache-2.0
Model fine-tuned based on distilroberta-base, specific purpose not clearly stated
Large Language Model Transformers
M
stevhliu
425
1
Bart Base Few Shot K 256 Finetuned Squad Seed 0
Apache-2.0
This model is a fine-tuned version of facebook/bart-base on the SQuAD dataset, suitable for question-answering tasks.
Question Answering System Transformers
B
anas-awadalla
13
0
Bart Base Few Shot K 64 Finetuned Squad Seed 2
Apache-2.0
A question-answering model fine-tuned on the SQuAD dataset based on facebook/bart-base
Question Answering System Transformers
B
anas-awadalla
13
0
Vit Base Patch16 384 Wi3
Apache-2.0
Fine-tuned model based on Google Vision Transformer (ViT) architecture, suitable for image classification tasks
Image Classification Transformers
V
Imene
21
0
Wav2vec2 Xls R 300m Mrbrown Finetune1
Apache-2.0
A speech recognition model fine-tuned using the uob_singlish dataset based on the facebook/wav2vec2-xls-r-300m pre-trained model
Speech Recognition Transformers
W
RuiqianLi
18
0
Malaya Speech Mrbrown Finetune1
This model is a fine-tuned version of wav2vec2-xls-r-300m-mixed based on the uob_singlish dataset, specializing in Singapore English speech recognition.
Speech Recognition Transformers
M
RuiqianLi
24
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase