X

Xlm Roberta Base Language Detection

Developed by papluca
Multilingual detection model based on XLM-RoBERTa, supporting text classification in 20 languages
Downloads 2.7M
Release Time : 3/2/2022

Model Overview

This model is a fine-tuned version of XLM-RoBERTa on language identification datasets, used to recognize the language category of text.

Model Features

High accuracy
Achieves an average accuracy of 99.6% on the test set
Multilingual support
Supports detection of 20 common languages
Based on XLM-RoBERTa
Utilizes the powerful cross-lingual pre-trained model as foundation

Model Capabilities

Text language identification
Multilingual text classification

Use Cases

Content classification
Multilingual website content classification
Automatically identifies the language category of user-submitted content
Accuracy as high as 99.6%
Data preprocessing
Multilingual dataset preprocessing
Automatically identifies text language before NLP tasks
Improves subsequent processing efficiency
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase