Qwen3 4B Base
Qwen3-4B-Base is the latest generation of the Qwen series with 4 billion parameters, supporting 32k context length and multilingual processing.
Downloads 15.15k
Release Time : 4/28/2025
Model Overview
A large language model developed based on an innovative three-stage pre-training paradigm, focusing on general language modeling, enhanced STEM/programming/logical reasoning capabilities, and long-text comprehension.
Model Features
Multilingual Coverage
Pre-training data covers 36 trillion tokens across 119 languages, with language coverage three times that of the previous generation.
Three-Stage Pre-Training
Phased enhancement of general language capabilities, STEM/programming/logical reasoning abilities, and long-text comprehension.
Long Context Support
Supports ultra-long context processing of up to 32,768 tokens.
Training Technology Innovation
Employs MoE load balancing loss and full-model qk layer normalization to improve training stability.
Model Capabilities
Multilingual text generation
Long-text comprehension
Programming code generation
Logical reasoning
STEM problem-solving
Use Cases
Intelligent Assistant
Multilingual Customer Service Bot
Build a multilingual intelligent customer service system
Can handle user queries in 119 languages
EdTech
Programming Learning Assistant
Assists programming learners in understanding code and solving problems
Enhanced programming capabilities provide more accurate code explanations
Featured Recommended AI Models
Š 2025AIbase