Model Selection

Lightweight Model

# Lightweight Model

Final Complete Malicious Url Model GGUF

This is a quantized model for malicious URL detection, based on the BERT architecture, capable of effectively identifying malicious URLs and phishing attacks.

Text Classification

Transformers English

Deepseek R1 0528 GGUF

DeepSeek-R1 is a large language model focused on foundational mathematics and model reasoning capabilities.

Large Language Model

Transformers English

Ultravox V0 5 Llama 3 2 1b GGUF

Ultravox v0.5 is an audio-to-text model optimized from the Llama-3 2.1B architecture, focusing on efficient speech transcription tasks.

Speech Recognition

Japanese Reranker Tiny V2

This is a very compact and fast Japanese reranking model, suitable for improving the accuracy of RAG systems and can run efficiently on CPUs or edge devices.

Text Embedding Japanese

Japanese Reranker Xsmall V2

This is a very compact and fast Japanese reranking model, suitable for improving the accuracy of RAG systems.

Text Embedding Japanese

Qwen3 0.6B TLDR Lora

Qwen3-0.6B is an open-source language model based on the Transformer architecture, with a parameter scale of 600 million, suitable for natural language processing tasks such as text summarization.

Text Generation

Mlabonne Qwen3 8B Abliterated GGUF

This is the quantized version of the Qwen3-8B-abliterated model, quantized using llama.cpp, suitable for text generation tasks.

Large Language Model

Qwen3 1.7B ONNX

Qwen3-1.7B is a 1.7B-parameter open-source large language model released by Alibaba Cloud, based on the Transformer architecture, supporting various natural language processing tasks.

Large Language Model

Deepthink 1.5B Open PRM Q8 0 GGUF

Deepthink-1.5B-Open-PRM is a 1.5B parameter open-source language model, converted to GGUF format for use with llama.cpp.

Large Language Model English

Qwen2.5 1.5B Sign

A text-to-Chinese Sign Language model developed based on the Qwen2.5 architecture

Text Generation Chinese

Llama OuteTTS 1.0 1B 3bit

This is a 3-bit quantized text-to-speech model in MLX format, supporting multiple languages.

Speech Synthesis Supports Multiple Languages

DeBERTa-v3-small is a lightweight variant of the DeBERTa model released by Microsoft, suitable for text classification tasks.

Text Classification English

T5 Small Title Ft

T5 Small is the compact version of Google's T5 (Text-to-Text Transfer Transformer) model, suitable for various natural language processing tasks.

Text Generation

Transformers English

Slim Orpheus 3b JAPANESE Ft Q8 0 GGUF

This is a GGUF format model converted from the slim-orpheus-3b-JAPANESE-ft model, specifically optimized for Japanese text processing.

Large Language Model Japanese

Faster Distil Whisper Large V3.5

Distil-Whisper is a distilled version of the Whisper model, optimized for Automatic Speech Recognition (ASR) tasks, offering faster inference speeds.

Speech Recognition English

Huihui Ai.deepseek V3 0324 Pruned Coder 411B GGUF

DeepSeek-V3-0324-Pruned-Coder-411B is a pruned and optimized code generation model based on the DeepSeek-V3 architecture, focusing on code generation tasks.

Large Language Model

Text To Cypher Gemma 3 4B Instruct 2025.04.0

Gemma 3.4B IT is a large language model based on text-to-text generation, specifically designed for converting natural language into Cypher query language.

Knowledge Graph

Mizan Rerank V1

A revolutionary open-source model capable of reordering long Arabic texts with exceptional efficiency and accuracy.

Text Embedding Supports Multiple Languages

DASS Small AudioSet 47.2

The first state space model to surpass Transformer-based audio classifiers, achieving state-of-the-art performance on AudioSet audio classification tasks while significantly reducing model size.

Audio Classification

Learn Hf Food Not Food Text Classifier Distilbert Base Uncased

A DistilBERT-based text classification model for distinguishing between food and non-food texts

Text Classification

HimanshuGoyal2004

Allura Org Gemma 3 Glitter 4B GGUF

GGUF format model file converted from allura-org/Gemma-3-Glitter-4B, optimized with imatrix quantization

Large Language Model English

Codesearch ModernBERT Snake

A sentence transformer model specifically designed for code search, based on the ModernBERT architecture, supporting 8192 token long sequence processing

Text Embedding English

Snac 24khz ONNX

SNAC 24kHz is a model for feature extraction, suitable for audio signal processing tasks.

Audio Classification

Tinyllava Video Qwen2.5 3B Group 16 512

TinyLLaVA-Video is a video understanding model based on Qwen2.5-3B and siglip-so400m-patch14-384, utilizing a grouped resampler for video frame processing

Whisper Custom Small

A small speech recognition model based on the OpenAI Whisper architecture, focused on English speech-to-text tasks.

Speech Recognition English

Distil Large V3.5 Ct2

Distil-Whisper is a distilled version of the Whisper model, achieving efficient speech recognition through large-scale pseudo-labeling technology

Speech Recognition English

Lightblue Reranker 0.5 Bincont Filt Gguf

This is a text ranking model used for sorting text by relevance.

Lightblue Reranker 0.5 Cont Gguf

This is a text ranking model used for reordering and scoring texts.

Lightblue Reranker 0.5 Cont Filt Gguf

A text ranking model fine-tuned based on Qwen2.5-0.5B-Instruct, suitable for information retrieval and relevance ranking tasks

Large Language Model

Jbaron34 Qwen2.5 0.5b Bebop Reranker Newer Small Gguf

A 50-million-parameter text reranking model based on the Qwen2.5 architecture, suitable for information retrieval and document ranking tasks

Large Language Model

Jbaron34 Qwen2.5 0.5b Bebop Reranker New Small Gguf

A text reranking model based on the Qwen2.5 architecture with 0.5B parameters, suitable for reranking tasks.

Large Language Model

Huihui Ai.granite Vision 3.2 2b Abliterated GGUF

Granite Vision 3.2 2B Abliterated is a vision-language model focused on image-to-text conversion tasks.

Distill Any Depth Small Hf

Distill-Any-Depth is a SOTA monocular depth estimation model trained based on knowledge distillation algorithms, capable of efficient and accurate depth estimation.

Qwq Math IO 500M GGUF

QwQ-Math-IO-500M is a 500M-parameter language model focused on mathematical reasoning and input-output processing, offering quantized versions in GGUF format.

Large Language Model English

LTX-Video is a model based on text-to-video generation technology, capable of generating corresponding video content based on input text descriptions.

Text-to-Video English

SoT_DistilBERT is a classification model fine-tuned based on DistilBERT, designed to select the optimal reasoning paradigm for a given query according to the Sketch-of-Thought (SoT) framework.

Text Classification

Transformers English

Gemmax2 28 2B 4bit

The GemmaX2-28-2B GGUF quantized model is a collection of quantized versions of the GemmaX2-28-2B-v0.1 translation large language model developed by Xiaomi, supporting machine translation tasks in 28 languages.

Machine Translation

Transformers Supports Multiple Languages

Vulnerability Severity Classification Distilbert Base Uncased

A DistilBERT-based vulnerability severity classification model for automatically determining severity levels based on vulnerability descriptions

Text Classification

HealthGPT is a model specifically developed for unified multimodal healthcare tasks, supporting both English and Chinese.

Large Language Model Supports Multiple Languages

Inf Retriever V1 1.5b

INF-Retriever-v1-1.5B is a dense retrieval model based on large language models developed by INF TECH, optimized and fine-tuned for Chinese-English data retrieval tasks.

Transformers Supports Multiple Languages

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase