N

Nemotron H 8B Base 8K

Developed by nvidia
The NVIDIA Nemotron-H-8B-Base-8K is a large language model (LLM) developed by NVIDIA, designed to generate completions for given text fragments. The model adopts a hybrid architecture primarily composed of Mamba-2 and MLP layers, incorporating only four attention layers. It supports a context length of 8K and covers multiple languages including English, German, Spanish, French, Italian, Korean, Portuguese, Russian, Japanese, and Chinese.
Downloads 5,437
Release Time : 3/19/2025

Model Overview

This model is a foundational language model primarily intended for text generation tasks and supports multiple languages. Users are recommended to fine-tune the model using the customization tools provided by the NeMo Framework to achieve optimal performance on specific tasks.

Model Features

Hybrid Architecture
Combines Mamba-2 and MLP layers with only four attention layers for efficient performance.
Multilingual Support
Supports multiple languages including English, German, Spanish, French, Italian, Korean, Portuguese, Russian, Japanese, and Chinese.
Long-Context Support
Supports an 8K context length, making it suitable for long-text tasks.
Efficient Inference
Optimized for NVIDIA GPU-accelerated systems, enabling faster training and inference speeds.

Model Capabilities

Text Generation
Multilingual Text Completion
Code Generation
Mathematical Problem Solving
Common-Sense Reasoning

Use Cases

Research & Development
Language Model Research
Used for developing and testing new methods and techniques for large language models.
Multilingual Application Development
Developing multilingual text generation and completion applications.
Education
Mathematical Problem Solving
Used to solve elementary to advanced mathematical problems, aiding learning.
Achieved an accuracy of 87.11 on the GSM8K dataset.
Programming Assistance
Code Generation
Generates solutions for Python programming tasks.
Achieved an accuracy of 65.37 on the MBPP dataset.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase