I

Indobert Large P1

Developed by indobenchmark
IndoBERT is an advanced Indonesian language model based on the BERT model, trained with masked language modeling and next - sentence prediction objectives.
Downloads 1,686
Release Time : 3/2/2022

Model Overview

IndoBERT is a pre - trained language model optimized for the Indonesian language, suitable for various natural language processing tasks.

Model Features

Large - scale pre - training
Pre - trained using the Indo4B dataset (23.43GB of text)
Case - insensitive
The model does not distinguish between uppercase and lowercase when processing text
Two - phase training
The model undergoes a two - phase training process (P1 and P2)

Model Capabilities

Text representation learning
Language understanding
Text classification
Question - answering system
Named entity recognition

Use Cases

Natural language processing
Text classification
Classify Indonesian texts
Question - answering system
Build an Indonesian question - answering system
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase