Dziribert
The first Transformer-based language model pre-trained specifically for Algerian dialect, supporting mixed Arabic and Latin script input.
Downloads 7,759
Release Time : 3/2/2022
Model Overview
DziriBERT is a BERT model specifically designed for Algerian dialect, capable of processing text content with mixed writing systems, achieving excellent classification performance with limited data.
Model Features
Dialect Support
The first pre-trained model specifically for Algerian dialect, supporting mixed Arabic and Latin script input
Efficient with Small Data
Pre-trained with only about 1 million tweets, yet achieved state-of-the-art performance in classification tasks
Mixed Script Processing
Capable of processing Algerian dialect content written in both Arabic and Latin scripts
Model Capabilities
Masked language modeling
Dialect text understanding
Social media text processing
Text classification
Use Cases
Social Media Analysis
Dialect Tweet Classification
Classification and analysis of social media content in Algerian dialect
Achieved optimal performance on Algerian text classification datasets
Linguistic Research
Dialect Variation Study
Supports linguistic research on Algerian dialect variations
Featured Recommended AI Models