B

Bert Large Arabertv02 Twitter

Developed by aubmindlab
AraBERTv0.2-Twitter is a pre-trained language model optimized for Arabic dialects and tweets, developed based on the BERT architecture, with added support for emojis and common vocabulary.
Downloads 312
Release Time : 3/2/2022

Model Overview

This model was obtained through continued pre-training on approximately 60 million Arabic tweets, specifically optimized for understanding Arabic dialects and social media texts.

Model Features

Dialect Optimization
Specially optimized for Arabic dialects and tweet content.
Emoji Support
Added emojis and common social media vocabulary to the lexicon.
Short Text Optimization
Trained for sequence lengths of 64 tokens, suitable for social media short texts.

Model Capabilities

Arabic Text Understanding
Social Media Text Processing
Masked Language Prediction

Use Cases

Social Media Analysis
Arabic Tweet Sentiment Analysis
Analyze sentiment tendencies in Arabic tweets.
Dialect Text Understanding
Process dialect texts from different Arabic regions.
Language Model Applications
Text Completion
Predict masked words or phrases.
Example: 'The capital of Lebanon is [MASK]' can be predicted as 'Beirut'.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase