G

Granite 3b Code Base 2k

Developed by ibm-granite
Granite-3B-Code-Base-2K is a decoder-only model developed by IBM Research specifically designed for code generation tasks, featuring 3B parameters and supporting 116 programming languages.
Downloads 711
Release Time : 4/23/2024

Model Overview

The model adopts a two-phase training strategy: the first phase trains on 4 trillion code tokens, and the second phase fine-tunes on 500 billion high-quality code and natural language tokens, focusing on tasks such as code generation, explanation, and repair.

Model Features

Two-Phase Training Strategy
The first phase involves pre-training on extensive programming language data, while the second phase fine-tunes on carefully selected high-quality data to enhance reasoning and instruction-following capabilities.
Aggressive Deduplication Strategy
Employs precise and fuzzy deduplication techniques to effectively remove duplicate code content, improving data quality.
Comprehensive Security Filtering
Applies HAP content filtering, PII removal, and malware scanning to reduce the risk of the model generating harmful content.

Model Capabilities

Code Generation
Code Explanation
Code Repair
Unit Test Generation
Documentation Generation
Technical Debt Resolution
Vulnerability Detection
Code Translation

Use Cases

Software Development
Python Function Generation
Automatically generates Python function code based on natural language descriptions
Achieves a pass@1 rate of 36% on the MBPP dataset
Code Repair
Automatically fixes erroneous code snippets
Achieves a Python repair pass rate of 18.3% on the HumanEval repair task
Education
Code Explanation
Generates natural language explanations for complex code segments
Achieves a Python explanation pass rate of 25% on the HumanEval explanation task
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase