L

Llm Jp Modernbert Base

Developed by llm-jp
A Japanese large language model based on the modernBERT-base architecture, supporting a maximum sequence length of 8192, trained on 3.4TB of Japanese corpus
Downloads 1,398
Release Time : 4/25/2025

Model Overview

This model is a BERT variant optimized for Japanese, adopting the modernBERT architecture and llm-jp-tokenizer, suitable for Japanese text understanding and generation tasks

Model Features

Long Context Support
Supports a maximum sequence length of 8192, suitable for processing long texts
Large-scale Training Data
Trained using the Japanese subset (3.4TB) of llm-jp-corpus v4
Optimized Tokenizer
Uses the llm-jp-tokenizer, specifically optimized for Japanese text

Model Capabilities

Japanese Text Understanding
Masked Language Prediction
Long Text Processing

Use Cases

Natural Language Processing
Japanese Text Completion
Predicts masked parts in the text
Example correctly predicts 'Tokyo' in '日本の首都は東京です'
Japanese Text Classification
Can be used for tasks such as sentiment analysis and topic classification
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase