# GGUF efficient inference
Gemma 2 9b It Russian Function Calling GGUF
Apache-2.0
This model is a fine-tuned version of google/gemma-2-9b-it for function calling tasks, with training data entirely manually annotated using the Russian version of the DiTy/function-calling dataset.
Large Language Model Other
G
DiTy
509
23
3b De Ft Research Release Q4 K M GGUF
Apache-2.0
This is a GGUF format model converted from the canopylabs/3b-de-ft-research_release model, specifically optimized for German text processing.
Large Language Model German
3
TheVisitorX
16
0
Slim Orpheus 3b JAPANESE Ft Q4 K M GGUF
Apache-2.0
This is a GGUF-format model converted from the slim-orpheus-3b-JAPANESE-ft model, specifically optimized for Japanese text processing.
Large Language Model Japanese
S
Gapeleon
40
0
Llama 3 3 Nemotron Super 49B V1 Q6 K GGUF
Other
This model is a GGUF format version converted from NVIDIA's Llama-3_3-Nemotron-Super-49B-v1, suitable for text generation tasks.
Large Language Model English
L
openfree
2,495
5
Qwen2.5 VL 32B Instruct GGUF
Apache-2.0
Qwen2.5-VL-32B-Instruct is a 32B-parameter multimodal vision-language model that supports joint understanding and generation tasks for images and text.
Image-to-Text English
Q
Mungert
9,766
6
T5 Small Q4 K M GGUF
Apache-2.0
This model is a quantized version converted from google-t5/t5-small to GGUF format using llama.cpp via ggml.ai's GGUF-my-repo space.
Machine Translation Supports Multiple Languages
T
egrhfnfdg
25
0
Mental Health FineTuned Mistral 7B Instruct V0.2 I1 GGUF
Apache-2.0
This is a mental health counseling dialogue model fine-tuned based on the Mistral-7B-Instruct-v0.2 model, offering multiple quantized versions to suit different needs.
Large Language Model English
M
mradermacher
501
3
T5 Small Q8 0 GGUF
Apache-2.0
This model is a quantized version converted from google-t5/t5-small to GGUF format using llama.cpp via ggml.ai's GGUF-my-repo space.
Machine Translation Supports Multiple Languages
T
agkavin
27
1
Summllama3 8B Q3 K M GGUF
This model is a GGUF format conversion of DISLab/SummLlama3-8B, suitable for text summarization tasks.
Text Generation
S
dil99x
32
0
C4ai Command R 08 2024
This is a GGUF format text generation model converted from the CoForAI/c4ai-command-r-08-2024 model, supporting multiple languages.
Large Language Model Supports Multiple Languages
C
KimChen
22
2
Meta Llama 3 8B Instruct GGUF
Other
GGUF quantized version of Meta-Llama-3-8B-Instruct, suitable for local deployment and inference
Large Language Model English
M
LiteLLMs
76
2
Tinyllama V0 GGUF
MIT
TinyLLama-v0 is a lightweight language model provided in GGUF format, suitable for text generation tasks.
Large Language Model English
T
aladar
72
2
Pygmalion 2 13B SuperCOT Weighed GGUF
This is an experimental weighted fusion model of Pygmalion-2-13b and SuperCOT, supporting instruction-based interaction and suitable for text generation tasks.
Large Language Model English
P
TheBloke
1,468
9
Featured Recommended AI Models