Granite-3B-Code-Instruct-128K Open-Source Coding Model - Free Deployment to Assist Various Coding Tasks

Granite 3b Code Instruct 128k

Developed by ibm-granite

Granite-3B-Code-Instruct-128K is a long-context instruction model with 3 billion parameters, fine-tuned from Granite-3B-Code-Base-128K, specializing in coding-related tasks.

Large Language Model

Transformers

Open Source License:Apache-2.0 #Long-context code generation #Multi-programming language support #Code instruction fine-tuning

Downloads 1,516

Release Time : 7/12/2024

Model Overview

This model is designed to respond to coding-related instructions within long-context inputs of up to 128K length, suitable for building coding assistants. It combines training on both short and long-context data to enhance its long-context capabilities.

Model Features

Long-context support

Supports context lengths of up to 128K tokens, suitable for handling large codebases and complex programming tasks

Code instruction optimization

Specifically optimized for coding-related instructions, capable of understanding and generating high-quality code

Multi-language support

Supports multiple programming languages, including Python, C++, Java, TypeScript, and Rust

High-performance inference

Performs excellently in multiple code generation benchmarks, such as HumanEval and RepoQA

Model Capabilities

Code generation

Code explanation

Code debugging

Long-context code understanding

Multi-turn code interaction

API call generation

Use Cases

Programming assistance

Code autocompletion

Automatically generates code snippets based on context

Improves development efficiency

Code explanation

Explains the functionality and logic of complex code

Helps developers understand code

Code debugging

Identifies and fixes errors in code

Improves code quality

Education

Programming instruction

Generates teaching examples and exercises

Assists in programming learning

🚀 Granite-3B-Code-Instruct-128K

Granite-3B-Code-Instruct-128K is a 3B parameter long-context instruct model. It's fine - tuned from Granite-3B-Code-Base-128K on a combination of permissively licensed data used in training the original Granite code instruct models, along with synthetically generated code instruction datasets for long - context problem - solving. The goal is to enhance long - context capability without sacrificing short - input context code generation performance.

image/png

✨ Features

Long - Context Capability: Can handle coding instructions with long - context input up to 128K length.
Code Generation: Suitable for building coding assistants.

📦 Installation

The installation details are mainly about setting up the necessary Python libraries. You need to have torch and transformers installed. You can install them using pip:

pip install torch transformers

💻 Usage Examples

Basic Usage

This is a simple example of how to use the Granite-3B-Code-Instruct model.

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer
device = "cuda" # or "cpu"
model_path = "ibm-granite/granite-3b-code-instruct-128k"
tokenizer = AutoTokenizer.from_pretrained(model_path)
# drop device_map if running on CPU
model = AutoModelForCausalLM.from_pretrained(model_path, device_map=device)
model.eval()
# change input text as desired
chat = [
    { "role": "user", "content": "Write a code to find the maximum value in a list of numbers." },
]
chat = tokenizer.apply_chat_template(chat, tokenize=False, add_generation_prompt=True)
# tokenize the text
input_tokens = tokenizer(chat, return_tensors="pt")
# transfer tokenized inputs to the device
for i in input_tokens:
    input_tokens[i] = input_tokens[i].to(device)
# generate output tokens
output = model.generate(**input_tokens, max_new_tokens=100)
# decode output tokens into text
output = tokenizer.batch_decode(output)
# loop over the batch to print, in this example the batch size is 1
for i in output:
    print(i)

📚 Documentation

Intended Use

The model is designed to respond to coding - related instructions over long - context input up to 128K length and can be used to build coding assistants.

Model Information

Property	Details
Model Type	Granite-3B-Code-Instruct-128K
Developers	IBM Research
GitHub Repository	ibm-granite/granite-code-models
Paper	Scaling Granite Code Models to 128K Context
Release Date	July 18th, 2024
License	Apache 2.0

Training Data

The Granite Code Instruct models are trained on a mix of short and long context data:

Short - Context Instruction Data: CommitPackFT, BigCode - SC2 - Instruct, [MathInstruct](https://huggingface.co/datasets/TIGER - Lab/MathInstruct), [MetaMathQA](https://huggingface.co/datasets/meta - math/MetaMathQA), [Glaive - Code - Assistant - v3](https://huggingface.co/datasets/glaiveai/glaive - code - assistant - v3), [Glaive - Function - Calling - v2](https://huggingface.co/datasets/glaiveai/glaive - function - calling - v2), [NL2SQL11](https://huggingface.co/datasets/bugdaryan/sql - create - context - instruction), HelpSteer, [OpenPlatypus](https://huggingface.co/datasets/garage - bAInd/Open - Platypus). It also includes a synthetically generated dataset for API calling and multi - turn code interactions with execution feedback, and a collection of hardcoded prompts.
Long - Context Instruction Data: A synthetically - generated dataset by bootstrapping the repository - level file - packed documents through Granite - 8b - Code - Instruct to improve the long - context capability of the model.

Infrastructure

The Granite Code models are trained using two of IBM's super - computing clusters, Vela and Blue Vela, which are equipped with NVIDIA A100 and H100 GPUs respectively. These clusters offer a scalable and efficient infrastructure for training the models over thousands of GPUs.

Model Performance

The model has the following performance metrics:

Task	Dataset	Metric	Value
Text Generation	bigcode/humanevalpack (HumanEvalSynthesis (Python))	pass@1	53.7
Text Generation	bigcode/humanevalpack (HumanEvalSynthesis (Average))	pass@1	41.4
Text Generation	bigcode/humanevalpack (HumanEvalExplain (Average))	pass@1	25.1
Text Generation	bigcode/humanevalpack (HumanEvalFix (Average))	pass@1	26.2
Text Generation	repoqa (RepoQA (Python@16K))	pass@1 (thresh = 0.5)	48.0
Text Generation	repoqa (RepoQA (C++@16K))	pass@1 (thresh = 0.5)	36.0
Text Generation	repoqa (RepoQA (Java@16K))	pass@1 (thresh = 0.5)	38.0
Text Generation	repoqa (RepoQA (TypeScript@16K))	pass@1 (thresh = 0.5)	39.0
Text Generation	repoqa (RepoQA (Rust@16K))	pass@1 (thresh = 0.5)	29.0

🔧 Technical Details

The model is fine - tuned from Granite-3B-Code-Base-128K. By exposing it to both short and long context data, the developers aim to enhance its long - context capability without sacrificing code generation performance at short input context.

📄 License

The model is released under the Apache 2.0 license.

⚠️ Important Note

The model is primarily finetuned using instruction - response pairs across a specific set of programming languages. Its performance may be limited with out - of - domain programming languages. In such cases, providing few - shot examples can help steer the model's output.
Developers should perform safety testing and target - specific tuning before deploying these models on critical applications. The model also inherits ethical considerations and limitations from its base model. For more information, refer to the [Granite-3B-Code-Base-128K](https://huggingface.co/ibm - granite/granite - 3b - code - base - 128k) model card.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご