Darijabert
D
Darijabert
Developed by SI2M-Lab
The first BERT model specifically designed for the Moroccan Arabic dialect 'Darija', based on the BERT-base architecture, trained on approximately 3 million Darija dialect text sequences.
Downloads 554
Release Time : 3/2/2022
Model Overview
DarijaBERT is a natural language processing model tailored for the Moroccan dialect, capable of understanding and processing Darija dialect text, suitable for tasks such as text classification and sentiment analysis.
Model Features
First Darija dialect model
The first BERT model specifically designed for the Moroccan Arabic dialect 'Darija', filling the gap in NLP for this dialect.
Diverse training data
Training data comes from Darija dialect stories, YouTube comments, and tweets, covering various text types and sources.
Open-source availability
The model is open-sourced via the Huggingface library, making it easy for researchers and developers to use.
Model Capabilities
Text understanding
Text classification
Sentiment analysis
Dialect processing
Use Cases
Social media analysis
Darija dialect comment analysis
Analyze Darija dialect comments on Moroccan social media for sentiment analysis or topic classification.
Cultural studies
Darija dialect text research
Used to study the grammar, vocabulary, and cultural characteristics of the Moroccan dialect.
Featured Recommended AI Models