# Lightweight Fine-tuning

Smoothie Qwen3 8B
Apache-2.0
Smoothie Qwen is a lightweight fine-tuning tool that significantly improves the balance of multilingual generation by smoothing the token probability distribution of Qwen and similar models.
Large Language Model Transformers English
S
dnotitia
267
8
Gemma 2 2b It Tool Think
MIT
Text generation model fine-tuned based on google/gemma-2b-it, supporting tool call reasoning process
Large Language Model Transformers
G
langdai
36
2
Thinkedit Deepseek Qwen 14b
Other
ThinkEdit is a lightweight weight editing method that identifies and edits a small number of attention heads to mitigate the issue of large language models generating overly short reasoning chains in inference tasks, thereby improving reasoning accuracy.
Large Language Model Transformers
T
cesun
46
2
Trained Lumina2 Lora Yarn
Apache-2.0
This is a DreamBooth LoRA weight trained based on Alpha-VLLM/Lumina-Image-2.0, specifically designed for generating yarn art-style images.
Image Generation
T
sayakpaul
17
3
Ltx Video
A LoRA fine-tuned version based on the Lightricks/LTX-Video model, specializing in text-to-video generation tasks
Text-to-Video
L
smktech9
15
1
Qwen2.5 0.5b Test Ft
Apache-2.0
Qwen 2.5 0.5B is a compact yet powerful language model, fine-tuned based on Qwen/Qwen2.5-0.5B-Instruct, supporting multiple languages with performance close to the Llama 3.2 1B model.
Large Language Model Transformers Supports Multiple Languages
Q
KingNish
1,004
11
Phi 3 Mini 4k Instruct Gguf Derived
Apache-2.0
phi3 is an open-source model based on the Apache-2.0 license, supporting English language, primarily used for summarization tasks.
Large Language Model English
P
zhhan
39
0
Mamba 790m Hf
Mamba is an efficient sequence model compatible with transformers, with 790 million parameters, suitable for causal language modeling tasks.
Large Language Model Transformers
M
state-spaces
6,897
4
Mamba 130m Hf
Mamba is a transformer-compatible sequence modeling model with efficient inference capabilities.
Large Language Model Transformers
M
state-spaces
46.83k
56
Mamba 1.4b Hf
Mamba is an efficient language model based on the State Space Model (SSM) architecture, with 1.4B parameters, supporting text generation tasks
Large Language Model Transformers
M
state-spaces
5,431
11
Mamba 2.8b Hf
A 2.8 billion parameter language model based on the Mamba architecture, compatible with HuggingFace Transformers library
Large Language Model Transformers
M
state-spaces
8,731
103
Tinyllama Tarot V1
Apache-2.0
A Tarot card interpretation model fine-tuned based on TinyLlama-1.1B, capable of making predictions and interpretations based on Tarot cards.
Large Language Model TensorBoard
T
barissglc
13.64k
6
Med BLIP 2 QLoRA
BLIP2 is a vision-language model based on OPT-2.7B, focusing on visual question answering tasks. It can understand image content and answer related questions.
Text-to-Image
M
NouRed
16
1
Mythomax L2 Kimiko V2 13b
A large language model fine-tuned based on MythoMax-L2-13b, optimized through LoRA technology merging, suitable for creative text generation and dialogue tasks
Large Language Model Transformers
M
Undi95
33
15
Blip Test
Bsd-3-clause
Image caption generation model fine-tuned based on Salesforce/blip-image-captioning-base
Image-to-Text Transformers
B
mooncakex
15
0
Sentence Similarity Semantic Search
Apache-2.0
This model is a fine-tuned sentence transformer based on a news dataset, specifically designed for semantic search and sentence similarity calculation.
Text Embedding English
S
Sakil
801
25
Simpledataset
Apache-2.0
A model fine-tuned based on distilroberta-base, with specific uses and training data not clearly stated
Large Language Model Transformers
S
DioLiu
174
0
Distilbert Base Turkish Cased Clip
A Turkish text encoder fine-tuned from dbmdz/distilbert-base-turkish-cased, designed to work with CLIP's ViT-B/32 image encoder
Text-to-Image Transformers
D
mys
2,354
1
Electra Small Discriminator Finetuned Ner
Apache-2.0
A named entity recognition model based on ELECTRA-small architecture, fine-tuned on the wikiann dataset
Sequence Labeling Transformers
E
dbsamu
16
1
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase