X

Xlm Roberta Base Multilingual Text Genre Classifier

Developed by classla
The X-GENRE Classifier is a multilingual text genre classification model based on xlm-roberta-base, supporting automatic genre recognition for multiple languages.
Downloads 774
Release Time : 11/11/2022

Model Overview

This model has been fine-tuned on the manually annotated multilingual X-GENRE genre dataset and can be used to automatically identify text genres, suitable for any language text supported by xlm-roberta-base.

Model Features

Multilingual support
Supports text genre classification in multiple languages, suitable for any language text supported by xlm-roberta-base.
High performance
Performs better than other technologies, including GPT models, in the AGILE benchmark test.
Extensive genre coverage
Supports classification of 9 different text genres, including news, law, promotion, etc.

Model Capabilities

Multilingual text genre classification
Automatic genre recognition
Text classification

Use Cases

Text analysis
Genre annotation of large text collections
Automatically add genre information to large text collections for subsequent analysis and processing.
After post-processing, the performance reaches a macro F1 and micro F1 value of 0.92.
Multilingual text genre recognition
Recognize the genres of texts in multiple languages, supporting multiple languages such as Albanian, Catalan, Croatian, etc.
On the multilingual test dataset (X-GINCO), the macro F1 value is 0.847, and the micro F1 value is 0.845.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase