C

Codet5p 16b

Developed by Salesforce
CodeT5+ 16B is an open-source family of large language models for code, featuring an encoder-decoder architecture that supports multiple modes, suitable for a wide range of code understanding and generation tasks.
Downloads 292
Release Time : 5/17/2023

Model Overview

CodeT5+ is a novel open-source family of large language models for code, featuring an encoder-decoder architecture that flexibly supports various modes (e.g., encoder-only, decoder-only, encoder-decoder), suitable for a wide range of code understanding and generation tasks.

Model Features

Diverse Pretraining Tasks
Trained with various pretraining tasks such as span denoising, causal language modeling, contrastive learning, and text-code matching to learn rich representations from both unimodal code data and bimodal code-text data.
Efficient Pretraining Methods
Utilizes off-the-shelf frozen large language models (e.g., CodeGen) to initialize model components for efficient scaling (2B/6B/16B parameter sizes) and adopts a 'shallow encoder-deep decoder' architecture.
Instruction Fine-tuning
Fine-tuned based on Code Alpaca to align with natural language instructions (see InstructCodeT5+ 16B version).

Model Capabilities

Code Understanding
Code Generation
Text-Code Retrieval
Line-Level Code Completion
Retrieval-Augmented Code Generation

Use Cases

Code Generation
Function Generation
Generate code functions based on natural language descriptions.
In the zero-shot text-to-code generation task on the HumanEval benchmark, InstructCodeT5+ 16B achieved 35.0% pass@1 and 54.5% pass@10, setting a new SOTA for open-source models.
Code Understanding
Code Retrieval
Retrieve relevant code snippets based on natural language queries.
Achieved an average MRR improvement of 3.2 across 8 text-code retrieval tasks.
Mathematical Programming
Mathematical Problem Solving
Solve mathematical programming problems such as MathQA-Python and GSM8K-Python.
CodeT5+ models with sub-billion parameters significantly outperformed several 137B-parameter large models.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase