Model Selection

Lightweight Fine-tuning

# Lightweight Fine-tuning

Smoothie Qwen3 8B

Smoothie Qwen is a lightweight fine-tuning tool that significantly improves the balance of multilingual generation by smoothing the token probability distribution of Qwen and similar models.

Large Language Model

Transformers English

Gemma 2 2b It Tool Think

Text generation model fine-tuned based on google/gemma-2b-it, supporting tool call reasoning process

Large Language Model

Thinkedit Deepseek Qwen 14b

ThinkEdit is a lightweight weight editing method that identifies and edits a small number of attention heads to mitigate the issue of large language models generating overly short reasoning chains in inference tasks, thereby improving reasoning accuracy.

Large Language Model

Trained Lumina2 Lora Yarn

This is a DreamBooth LoRA weight trained based on Alpha-VLLM/Lumina-Image-2.0, specifically designed for generating yarn art-style images.

Image Generation

A LoRA fine-tuned version based on the Lightricks/LTX-Video model, specializing in text-to-video generation tasks

Qwen2.5 0.5b Test Ft

Qwen 2.5 0.5B is a compact yet powerful language model, fine-tuned based on Qwen/Qwen2.5-0.5B-Instruct, supporting multiple languages with performance close to the Llama 3.2 1B model.

Large Language Model

Transformers Supports Multiple Languages

Phi 3 Mini 4k Instruct Gguf Derived

phi3 is an open-source model based on the Apache-2.0 license, supporting English language, primarily used for summarization tasks.

Large Language Model English

Mamba is an efficient sequence model compatible with transformers, with 790 million parameters, suitable for causal language modeling tasks.

Large Language Model

Mamba is a transformer-compatible sequence modeling model with efficient inference capabilities.

Large Language Model

Mamba is an efficient language model based on the State Space Model (SSM) architecture, with 1.4B parameters, supporting text generation tasks

Large Language Model

A 2.8 billion parameter language model based on the Mamba architecture, compatible with HuggingFace Transformers library

Large Language Model

Tinyllama Tarot V1

A Tarot card interpretation model fine-tuned based on TinyLlama-1.1B, capable of making predictions and interpretations based on Tarot cards.

Large Language Model

Med BLIP 2 QLoRA

BLIP2 is a vision-language model based on OPT-2.7B, focusing on visual question answering tasks. It can understand image content and answer related questions.

Mythomax L2 Kimiko V2 13b

A large language model fine-tuned based on MythoMax-L2-13b, optimized through LoRA technology merging, suitable for creative text generation and dialogue tasks

Large Language Model

Image caption generation model fine-tuned based on Salesforce/blip-image-captioning-base

Sentence Similarity Semantic Search

This model is a fine-tuned sentence transformer based on a news dataset, specifically designed for semantic search and sentence similarity calculation.

Text Embedding English

A model fine-tuned based on distilroberta-base, with specific uses and training data not clearly stated

Large Language Model

Distilbert Base Turkish Cased Clip

A Turkish text encoder fine-tuned from dbmdz/distilbert-base-turkish-cased, designed to work with CLIP's ViT-B/32 image encoder

Electra Small Discriminator Finetuned Ner

A named entity recognition model based on ELECTRA-small architecture, fine-tuned on the wikiann dataset

Sequence Labeling

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase