# Lightweight Model
Final Complete Malicious Url Model GGUF
Apache-2.0
This is a quantized model for malicious URL detection, based on the BERT architecture, capable of effectively identifying malicious URLs and phishing attacks.
Text Classification
Transformers English

F
mradermacher
175
1
Deepseek R1 0528 GGUF
MIT
DeepSeek-R1 is a large language model focused on foundational mathematics and model reasoning capabilities.
Large Language Model
Transformers English

D
unsloth
143
79
Ultravox V0 5 Llama 3 2 1b GGUF
MIT
Ultravox v0.5 is an audio-to-text model optimized from the Llama-3 2.1B architecture, focusing on efficient speech transcription tasks.
Speech Recognition
U
ggml-org
421
1
Japanese Reranker Tiny V2
MIT
This is a very compact and fast Japanese reranking model, suitable for improving the accuracy of RAG systems and can run efficiently on CPUs or edge devices.
Text Embedding Japanese
J
hotchpotch
339
3
Japanese Reranker Xsmall V2
MIT
This is a very compact and fast Japanese reranking model, suitable for improving the accuracy of RAG systems.
Text Embedding Japanese
J
hotchpotch
260
1
Qwen3 0.6B TLDR Lora
Apache-2.0
Qwen3-0.6B is an open-source language model based on the Transformer architecture, with a parameter scale of 600 million, suitable for natural language processing tasks such as text summarization.
Text Generation
Q
phh
56
0
Mlabonne Qwen3 8B Abliterated GGUF
This is the quantized version of the Qwen3-8B-abliterated model, quantized using llama.cpp, suitable for text generation tasks.
Large Language Model
M
bartowski
6,892
5
Qwen3 1.7B ONNX
Qwen3-1.7B is a 1.7B-parameter open-source large language model released by Alibaba Cloud, based on the Transformer architecture, supporting various natural language processing tasks.
Large Language Model
Transformers

Q
onnx-community
189
1
Deepthink 1.5B Open PRM Q8 0 GGUF
Apache-2.0
Deepthink-1.5B-Open-PRM is a 1.5B parameter open-source language model, converted to GGUF format for use with llama.cpp.
Large Language Model English
D
prithivMLmods
46
2
Qwen2.5 1.5B Sign
MIT
A text-to-Chinese Sign Language model developed based on the Qwen2.5 architecture
Text Generation Chinese
Q
thundax
28
2
Llama OuteTTS 1.0 1B 3bit
This is a 3-bit quantized text-to-speech model in MLX format, supporting multiple languages.
Speech Synthesis Supports Multiple Languages
L
mlx-community
16
0
Ai Cop
DeBERTa-v3-small is a lightweight variant of the DeBERTa model released by Microsoft, suitable for text classification tasks.
Text Classification English
A
dejanseo
53
1
T5 Small Title Ft
Apache-2.0
T5 Small is the compact version of Google's T5 (Text-to-Text Transfer Transformer) model, suitable for various natural language processing tasks.
Text Generation
Transformers English

T
swarup3204
25
0
Slim Orpheus 3b JAPANESE Ft Q8 0 GGUF
Apache-2.0
This is a GGUF format model converted from the slim-orpheus-3b-JAPANESE-ft model, specifically optimized for Japanese text processing.
Large Language Model Japanese
S
Gapeleon
26
0
Faster Distil Whisper Large V3.5
MIT
Distil-Whisper is a distilled version of the Whisper model, optimized for Automatic Speech Recognition (ASR) tasks, offering faster inference speeds.
Speech Recognition English
F
Purfview
565
2
Huihui Ai.deepseek V3 0324 Pruned Coder 411B GGUF
DeepSeek-V3-0324-Pruned-Coder-411B is a pruned and optimized code generation model based on the DeepSeek-V3 architecture, focusing on code generation tasks.
Large Language Model
H
DevQuasar
2,706
2
Text To Cypher Gemma 3 4B Instruct 2025.04.0
Gemma 3.4B IT is a large language model based on text-to-text generation, specifically designed for converting natural language into Cypher query language.
Knowledge Graph
T
neo4j
596
2
Mizan Rerank V1
Apache-2.0
A revolutionary open-source model capable of reordering long Arabic texts with exceptional efficiency and accuracy.
Text Embedding Supports Multiple Languages
M
ALJIACHI
167
1
DASS Small AudioSet 47.2
Bsd-3-clause
The first state space model to surpass Transformer-based audio classifiers, achieving state-of-the-art performance on AudioSet audio classification tasks while significantly reducing model size.
Audio Classification
Transformers

D
saurabhati
47
1
Learn Hf Food Not Food Text Classifier Distilbert Base Uncased
Apache-2.0
A DistilBERT-based text classification model for distinguishing between food and non-food texts
Text Classification
Transformers

L
HimanshuGoyal2004
70
1
Allura Org Gemma 3 Glitter 4B GGUF
GGUF format model file converted from allura-org/Gemma-3-Glitter-4B, optimized with imatrix quantization
Large Language Model English
A
ArtusDev
69
1
Codesearch ModernBERT Snake
Apache-2.0
A sentence transformer model specifically designed for code search, based on the ModernBERT architecture, supporting 8192 token long sequence processing
Text Embedding English
C
Shuu12121
36
2
Snac 24khz ONNX
MIT
SNAC 24kHz is a model for feature extraction, suitable for audio signal processing tasks.
Audio Classification
S
onnx-community
46
1
Tinyllava Video Qwen2.5 3B Group 16 512
Apache-2.0
TinyLLaVA-Video is a video understanding model based on Qwen2.5-3B and siglip-so400m-patch14-384, utilizing a grouped resampler for video frame processing
Video-to-Text
T
Zhang199
76
0
Whisper Custom Small
Apache-2.0
A small speech recognition model based on the OpenAI Whisper architecture, focused on English speech-to-text tasks.
Speech Recognition English
W
gyrroa
15
1
Distil Large V3.5 Ct2
MIT
Distil-Whisper is a distilled version of the Whisper model, achieving efficient speech recognition through large-scale pseudo-labeling technology
Speech Recognition English
D
distil-whisper
264
3
Lightblue Reranker 0.5 Bincont Filt Gguf
This is a text ranking model used for sorting text by relevance.
Text Embedding
L
RichardErkhov
2,054
0
Lightblue Reranker 0.5 Cont Gguf
This is a text ranking model used for reordering and scoring texts.
Text Embedding
L
RichardErkhov
1,986
0
Lightblue Reranker 0.5 Cont Filt Gguf
A text ranking model fine-tuned based on Qwen2.5-0.5B-Instruct, suitable for information retrieval and relevance ranking tasks
Large Language Model
L
RichardErkhov
2,130
0
Jbaron34 Qwen2.5 0.5b Bebop Reranker Newer Small Gguf
A 50-million-parameter text reranking model based on the Qwen2.5 architecture, suitable for information retrieval and document ranking tasks
Large Language Model
J
RichardErkhov
2,117
0
Jbaron34 Qwen2.5 0.5b Bebop Reranker New Small Gguf
A text reranking model based on the Qwen2.5 architecture with 0.5B parameters, suitable for reranking tasks.
Large Language Model
J
RichardErkhov
2,454
0
Huihui Ai.granite Vision 3.2 2b Abliterated GGUF
Granite Vision 3.2 2B Abliterated is a vision-language model focused on image-to-text conversion tasks.
Image-to-Text
H
DevQuasar
724
1
Distill Any Depth Small Hf
MIT
Distill-Any-Depth is a SOTA monocular depth estimation model trained based on knowledge distillation algorithms, capable of efficient and accurate depth estimation.
3D Vision
Transformers

D
xingyang1
1,214
3
Qwq Math IO 500M GGUF
Apache-2.0
QwQ-Math-IO-500M is a 500M-parameter language model focused on mathematical reasoning and input-output processing, offering quantized versions in GGUF format.
Large Language Model English
Q
tensorblock
56
1
Ltxv0.9.5 Gguf
Other
LTX-Video is a model based on text-to-video generation technology, capable of generating corresponding video content based on input text descriptions.
Text-to-Video English
L
calcuis
337
5
Sot DistilBERT
MIT
SoT_DistilBERT is a classification model fine-tuned based on DistilBERT, designed to select the optimal reasoning paradigm for a given query according to the Sketch-of-Thought (SoT) framework.
Text Classification
Transformers English

S
saytes
20.95k
5
Gemmax2 28 2B 4bit
Apache-2.0
The GemmaX2-28-2B GGUF quantized model is a collection of quantized versions of the GemmaX2-28-2B-v0.1 translation large language model developed by Xiaomi, supporting machine translation tasks in 28 languages.
Machine Translation
Transformers Supports Multiple Languages

G
Tonic
19
1
Vulnerability Severity Classification Distilbert Base Uncased
Apache-2.0
A DistilBERT-based vulnerability severity classification model for automatically determining severity levels based on vulnerability descriptions
Text Classification
Transformers

V
CIRCL
199
1
Healthgpt M3
MIT
HealthGPT is a model specifically developed for unified multimodal healthcare tasks, supporting both English and Chinese.
Large Language Model Supports Multiple Languages
H
lintw
79
8
Inf Retriever V1 1.5b
Apache-2.0
INF-Retriever-v1-1.5B is a dense retrieval model based on large language models developed by INF TECH, optimized and fine-tuned for Chinese-English data retrieval tasks.
Text Embedding
Transformers Supports Multiple Languages

I
infly
19.59k
25
- 1
- 2
- 3
- 4
- 5
- 6
Featured Recommended AI Models