D

Darijabert

Developed by SI2M-Lab
The first BERT model specifically designed for the Moroccan Arabic dialect 'Darija', based on the BERT-base architecture, trained on approximately 3 million Darija dialect text sequences.
Downloads 554
Release Time : 3/2/2022

Model Overview

DarijaBERT is a natural language processing model tailored for the Moroccan dialect, capable of understanding and processing Darija dialect text, suitable for tasks such as text classification and sentiment analysis.

Model Features

First Darija dialect model
The first BERT model specifically designed for the Moroccan Arabic dialect 'Darija', filling the gap in NLP for this dialect.
Diverse training data
Training data comes from Darija dialect stories, YouTube comments, and tweets, covering various text types and sources.
Open-source availability
The model is open-sourced via the Huggingface library, making it easy for researchers and developers to use.

Model Capabilities

Text understanding
Text classification
Sentiment analysis
Dialect processing

Use Cases

Social media analysis
Darija dialect comment analysis
Analyze Darija dialect comments on Moroccan social media for sentiment analysis or topic classification.
Cultural studies
Darija dialect text research
Used to study the grammar, vocabulary, and cultural characteristics of the Moroccan dialect.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase