B

Bert Base Qarib60 1790k

Developed by ahmedabdelali
QARiB is an Arabic and dialect BERT model trained on approximately 420 million tweets and 180 million text sentences, supporting various downstream NLP tasks.
Downloads 16
Release Time : 3/2/2022

Model Overview

This model is specifically optimized for Arabic and its dialects, suitable for masked language modeling and various natural language processing tasks, excelling in multiple Arabic NLP tasks.

Model Features

Large-scale Arabic Training Data
Trained on 420 million tweets and 180 million text sentences, covering Modern Standard Arabic and dialects.
Multi-domain Data Integration
Integrates Twitter data, Arabic Billion Word Corpus, Abulkhair Corpus, and OPUS multilingual corpus.
Dialect Support
Specially optimized for processing Arabic dialects.
High Performance
Outperforms multilingual BERT/AraBERT/ArabicBERT in five downstream NLP tasks.

Model Capabilities

Arabic Text Understanding
Dialect Identification
Sentiment Analysis
Named Entity Recognition
Offensive Language Detection

Use Cases

Social Media Analysis
Arabic Tweet Sentiment Analysis
Analyze sentiment tendencies in Arabic tweets.
Outperforms other Arabic BERT models.
Dialect Identification
Identify Arabic dialects in text.
High accuracy.
Text Processing
Named Entity Recognition
Identify entities such as person names and locations in Arabic text.
Offensive Language Detection
Detect offensive content in Arabic text.
Featured Recommended AI Models
ยฉ 2025AIbase