I

Instella 3B Stage1

Developed by amd
Instella is a series of 3-billion-parameter open-source language models developed by AMD, trained on AMD Instinct™ MI300X GPUs, outperforming other fully open-source models of the same scale.
Downloads 397
Release Time : 3/5/2025

Model Overview

The Instella series are fully open-source advanced 3-billion-parameter language models that surpass existing fully open-source models of the same scale in performance and are comparable to top open-weight models.

Model Features

High Performance
Outperforms existing fully open-source models at the 3-billion-parameter scale and is comparable to top open-weight models.
Fully Open-Source
Completely open-sourced model weights, training configurations, datasets, and code.
Efficient Training
Employs efficient training techniques such as FlashAttention-2, Torch Compile, and FSDP with hybrid sharding.
Multi-Stage Training
Includes multiple training stages: pre-training, supervised fine-tuning, and DPO alignment.

Model Capabilities

Text generation
Instruction following
Question answering
Conversational interaction

Use Cases

Natural Language Processing
Intelligent Q&A
Answers various user questions
Performs excellently on benchmarks like OLMES and FastChat MT-Bench.
Text Generation
Generates coherent text content based on prompts
Supports a context length of 4096 tokens.
Education
Learning Assistance
Helps students understand complex concepts
Achieves 96.6% accuracy on the SciQ task.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase