N

Nvidia OpenReasoning Nemotron 32B GGUF

Developed by bartowski
A quantized version of NVIDIA OpenReasoning - Nemotron - 32B, quantized through llama.cpp to reduce model storage and computational resource requirements for easy deployment.
Downloads 2,382
Release Time : 7/18/2025

Model Overview

This is a large language model with 32B parameters, focusing on reasoning tasks and offering multiple quantized versions to meet different hardware requirements.

Model Features

Multiple quantization options
Offer multiple quantized versions from Q8_0 to IQ2_XS to meet different hardware and performance requirements.
Efficient reasoning
Reduce the model size through quantization technology while maintaining high reasoning performance.
Extensive deployment support
Support running on multiple platforms such as LM Studio and llama.cpp.
Optimized prompt format
Adopt a structured prompt format to facilitate the distinction between system instructions and user input.

Model Capabilities

Text generation
Logical reasoning
Multi-round dialogue

Use Cases

Intelligent assistant
Dialogue system
Build an intelligent dialogue system capable of understanding complex instructions.
Education
Teaching assistance
Used to answer students' questions and provide learning guidance.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase