B

Bangla Bert Base

Developed by sagorsarker
Bangla BERT Base is a pre-trained Bengali language model based on the BERT architecture, supporting various downstream NLP tasks.
Downloads 7,282
Release Time : 3/2/2022

Model Overview

This is a BERT model specifically optimized for Bengali, pre-trained using masked language modeling, suitable for natural language processing tasks such as text classification and named entity recognition.

Model Features

Bengali-Specific Pre-training
Pre-trained specifically for Bengali, outperforming multilingual models on Bengali language tasks.
Optimized Vocabulary
Uses the BNLP toolkit to train a Bengali sentence-piece model containing 102,025 vocabulary items.
Comprehensive Evaluation
Achieves state-of-the-art results on multiple Bengali benchmark tests.

Model Capabilities

Text Classification
Named Entity Recognition
Masked Language Prediction
Sentence Tokenization

Use Cases

Sentiment Analysis
Bengali Sentiment Classification
Analyze the sentiment tendency of Bengali text
Achieved 70.37% accuracy in benchmark tests
Content Moderation
Hate Speech Detection
Identify hate speech in Bengali
Achieved 71.83% accuracy in benchmark tests
News Classification
News Topic Classification
Classify Bengali news by topic
Achieved 89.19% accuracy in benchmark tests
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase