X

Xlm Roberta Large Ner Kazakh

Developed by yeshpanovrustem
A Kazakh named entity recognition model based on XLM-RoBERTa-large architecture, trained on the KazNERD dataset, supporting multiple entity type recognition
Downloads 99
Release Time : 5/19/2023

Model Overview

This model is specifically designed for named entity recognition tasks in Kazakh texts, capable of identifying various entity types such as geographical locations, personal names, and organization names

Model Features

High-precision Kazakh NER
Achieves F1 scores exceeding 96% on both validation and test sets, demonstrating excellent performance
Multi-category entity recognition
Supports recognition of various entity types including proverbs, artworks, cardinal numbers, contact information, dates, and more
Based on KazNERD dataset
Trained using a cleaned Kazakh named entity recognition dataset with high data quality

Model Capabilities

Kazakh text processing
Named entity recognition
Sequence labeling

Use Cases

Text analysis
News text analysis
Extracting key entity information from Kazakh news articles
Can accurately identify key information such as geographical locations and personal names
Business document processing
Analyzing key data in trade documents between Kazakhstan and the EU
Capable of recognizing key business entities such as amounts and dates
Academic research
Kazakh linguistic research
Supporting studies on Kazakh language structure and entity distribution
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase