B

Bert Large Arabic

Developed by asafaya
Pretrained BERT large language model for Arabic, supporting Modern Standard Arabic and some dialects
Downloads 278
Release Time : 3/2/2022

Model Overview

This is a large Arabic pretrained language model based on the BERT architecture, specifically optimized for Arabic text processing tasks and suitable for various natural language processing applications.

Model Features

Large-scale Pretraining
Trained on 8.2 billion words of Arabic corpus, including OSCAR and Wikipedia data
Dialect Support
Supports not only Modern Standard Arabic but also some Arabic dialect content
Optimized Training
Adjusted original BERT training parameters, increasing training steps to 3 million

Model Capabilities

Text representation learning
Masked language modeling
Arabic text understanding
Named entity recognition

Use Cases

Social Media Analysis
Offensive Speech Detection
Used to identify offensive content in Arabic social media
Achieved good performance in SemEval-2020 Task 12
Text Classification
Arabic Text Classification
Can be used for news classification, sentiment analysis, and other tasks
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase