Model Selection

Inference Optimization

# Inference Optimization

Nvidia AceReason Nemotron 7B GGUF

AceReason-Nemotron-7B is a large language model based on the Nemotron architecture with 7B parameters, offering multiple quantized versions to accommodate different hardware requirements.

Large Language Model

Nvidia AceReason Nemotron 14B GGUF

AceReason-Nemotron-14B is a large language model with 14B parameters, offering multiple quantization versions to accommodate different hardware requirements.

Large Language Model

Llama 3.1 Nemotron Nano 4B V1.1

Llama-3.1-Nemotron-Nano-4B-v1.1 is a large language model derived from Llama 3.1 8B through compression, optimized for inference efficiency and task execution, suitable for local deployment on a single RTX GPU.

Large Language Model

Transformers English

MiMo-7B-RL is a reinforcement learning model trained based on the MiMo-7B-SFT model, demonstrating outstanding performance in mathematical and code reasoning tasks, comparable to OpenAI o1-mini.

Large Language Model

A 7B-parameter specialized inference language model series launched by Xiaomi, significantly enhancing mathematical and code reasoning capabilities through optimized pre-training and post-training strategies

Large Language Model

Cognitivecomputations Dolphin3.0 R1 Mistral 24B GGUF

Dolphin3.0-R1-Mistral-24B is a 24B-parameter large language model based on the Mistral architecture, trained by Eric Hartford, focusing on reasoning and first-principles analysis.

Large Language Model English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase