C

Codet5p 770m

Developed by Salesforce
CodeT5+ is an open-source family of large language models for code, featuring an encoder-decoder architecture that supports multiple modes, suitable for a wide range of code understanding and generation tasks.
Downloads 4,801
Release Time : 5/13/2023

Model Overview

CodeT5+ is a novel open-source family of large language models for code, featuring an encoder-decoder architecture that flexibly supports multiple modes (including encoder-only, decoder-only, and encoder-decoder), suitable for a wide range of code understanding and generation tasks.

Model Features

Diverse Pretraining Tasks
Learns rich representations from unimodal code data and bimodal code-text data through various pretraining tasks such as span denoising, causal language modeling, contrastive learning, and text-code matching.
Computationally Efficient Pretraining
Adopts an innovative computationally efficient pretraining method by freezing components initialized from existing large language models to efficiently scale model size.
Flexible Support for Multiple Modes
Supports multiple modes including encoder-only, decoder-only, and encoder-decoder, suitable for a wide range of code understanding and generation tasks.

Model Capabilities

Code Understanding
Code Generation
Text-Code Retrieval
Line-level Code Completion
Retrieval-Augmented Code Generation

Use Cases

Code Generation
Function Completion
Automatically completes the function body based on the function signature
In the zero-shot text-to-code generation task on the HumanEval benchmark, InstructCodeT5+ 16B set a new record for open-source models with 35.0% pass@1 and 54.5% pass@10.
Code Understanding
Code Retrieval
Retrieves relevant code snippets based on natural language descriptions
Achieved an average MRR improvement of 3.2 on 8 text-code retrieval tasks.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase