I

Instella 3B

Developed by amd
AMD's fully open 3-billion-parameter language model family trained on Instinct MI300X GPUs, outperforming open models of similar scale
Downloads 3,048
Release Time : 3/5/2025

Model Overview

Instella is a fully open-source language model series developed by AMD, including pretrained, supervised fine-tuned, and DPO-aligned versions, supporting 4096 tokens context length

Model Features

Fully Open Model
Completely open model weights, training configurations, and datasets to foster community collaboration
High Performance
Outperforms fully open models of similar scale, approaching the performance of open-weight models
AMD Hardware Optimization
Specifically optimized for Instinct MI300X GPUs and ROCm software stack
Four-Stage Training
Complete training pipeline including pretraining, enhanced training, supervised fine-tuning, and DPO alignment

Model Capabilities

Text Generation
Instruction Following
Question Answering
Conversational Interaction
Knowledge Reasoning

Use Cases

Intelligent Assistant
Dialogue Systems
Build conversational AI capable of understanding complex instructions
Excellent performance in Alpaca evaluation
Education & Research
AI Teaching Assistant
Answers subject questions with step-by-step explanations
Achieved 57.81 score on MMLU comprehensive evaluation
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase