Model Selection

GGUF efficient inference

# GGUF efficient inference

Gemma 2 9b It Russian Function Calling GGUF

This model is a fine-tuned version of google/gemma-2-9b-it for function calling tasks, with training data entirely manually annotated using the Russian version of the DiTy/function-calling dataset.

Large Language Model Other

3b De Ft Research Release Q4 K M GGUF

This is a GGUF format model converted from the canopylabs/3b-de-ft-research_release model, specifically optimized for German text processing.

Large Language Model German

Slim Orpheus 3b JAPANESE Ft Q4 K M GGUF

This is a GGUF-format model converted from the slim-orpheus-3b-JAPANESE-ft model, specifically optimized for Japanese text processing.

Large Language Model Japanese

Llama 3 3 Nemotron Super 49B V1 Q6 K GGUF

This model is a GGUF format version converted from NVIDIA's Llama-3_3-Nemotron-Super-49B-v1, suitable for text generation tasks.

Large Language Model English

Qwen2.5 VL 32B Instruct GGUF

Qwen2.5-VL-32B-Instruct is a 32B-parameter multimodal vision-language model that supports joint understanding and generation tasks for images and text.

Image-to-Text English

T5 Small Q4 K M GGUF

This model is a quantized version converted from google-t5/t5-small to GGUF format using llama.cpp via ggml.ai's GGUF-my-repo space.

Machine Translation Supports Multiple Languages

Mental Health FineTuned Mistral 7B Instruct V0.2 I1 GGUF

This is a mental health counseling dialogue model fine-tuned based on the Mistral-7B-Instruct-v0.2 model, offering multiple quantized versions to suit different needs.

Large Language Model English

T5 Small Q8 0 GGUF

This model is a quantized version converted from google-t5/t5-small to GGUF format using llama.cpp via ggml.ai's GGUF-my-repo space.

Machine Translation Supports Multiple Languages

Summllama3 8B Q3 K M GGUF

This model is a GGUF format conversion of DISLab/SummLlama3-8B, suitable for text summarization tasks.

Text Generation

C4ai Command R 08 2024

This is a GGUF format text generation model converted from the CoForAI/c4ai-command-r-08-2024 model, supporting multiple languages.

Large Language Model Supports Multiple Languages

Meta Llama 3 8B Instruct GGUF

GGUF quantized version of Meta-Llama-3-8B-Instruct, suitable for local deployment and inference

Large Language Model English

Tinyllama V0 GGUF

TinyLLama-v0 is a lightweight language model provided in GGUF format, suitable for text generation tasks.

Large Language Model English

Pygmalion 2 13B SuperCOT Weighed GGUF

This is an experimental weighted fusion model of Pygmalion-2-13b and SuperCOT, supporting instruction-based interaction and suitable for text generation tasks.

Large Language Model English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase