C

Codemodernbert Owl 3.0

Developed by Shuu12121
CodeModernBERT-Owl-3.0 is the final pre-trained version of the multilingual long context encoder model in the CodeModernBERT series, optimized for downstream code-related tasks such as code search, code summarization, error repair, and representation learning.
Downloads 119
Release Time : 6/20/2025

Model Overview

This model is built on the pre-trained checkpoint CodeModernBERT-Owl-3.0-Pre and further pre-trained to better capture the structural patterns and semantics in the source code of multiple programming languages.

Model Features

Long context window
Supports a context window of 2048 tokens, suitable for understanding long code.
Multilingual support
Trained on 11.2 million functions in 8 programming languages, supporting multilingual code understanding.
Downstream task optimization
Fine-tuned for downstream tasks such as code search, semantic embedding, summarization, and cloze-style error repair.
High performance
Achieved the highest MRR in all languages of the CodeSearchNet test set, demonstrating excellent cross-language consistency.

Model Capabilities

Code search
Code summarization
Error repair
Representation learning
Multilingual code understanding

Use Cases

Code search
Cross-language code search
Use model embeddings for cross-language code search tasks.
On the CodeSearchNet test set, the MRR reaches 0.8814 (Python).
Code summarization
Automatically generate code summaries
Use the model to generate natural language summaries of code snippets.
Error repair
Cloze-style error repair
Use the model's fill-mask function for code error repair.
Featured Recommended AI Models
ยฉ 2025AIbase