K

Kubert Central Kurdish BERT Model

Developed by asosoft
KuBERT is a Central Kurdish language model based on the BERT framework, designed to address the scarcity of Kurdish language resources and enhance computational linguistics capabilities.
Downloads 128.71k
Release Time : 2/15/2024

Model Overview

This model utilizes BERT technology to process Central Kurdish, filling the gap in Kurdish NLP tools and supporting various language tasks.

Model Features

Large-scale Kurdish training
Trained on three corpora totaling 296.5 million tokens to ensure robust language understanding.
Dedicated tokenizer
Equipped with a tokenizer specifically optimized for Kurdish language processing.
Multi-source data integration
Integrates Kurdish data from multiple sources including AsoSoft and Oscar 2019, covering diverse language scenarios.

Model Capabilities

Text classification
Language understanding
Sentiment analysis

Use Cases

Natural Language Processing
Kurdish sentiment analysis
Used to analyze the sentiment tendencies of Kurdish texts
Performs excellently in low-resource environments
Text classification
Classifies Kurdish texts
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase