L

Langdetect

Developed by ERCDiDip
A language detection model fine-tuned based on XLM-RoBERTa-base, supporting text classification for 41 modern and medieval languages
Downloads 6,687
Release Time : 11/25/2022

Model Overview

This model is designed for language detection tasks, capable of identifying 41 languages including both modern and medieval languages. Suitable for scenarios requiring multilingual text classification.

Model Features

Multilingual support
Supports detection of 41 modern and medieval languages, including some rare ancient languages
High accuracy
Achieves an average accuracy of 99.59% on test datasets
Based on XLM-RoBERTa
Utilizes the powerful XLM-RoBERTa-base model for fine-tuning, with excellent cross-language representation capabilities

Model Capabilities

Text classification
Language detection
Multilingual processing

Use Cases

Document processing
Historical document language identification
Identifying the language of medieval documents
Accurately identifies medieval languages such as Old French and Latin
Multilingual content classification
Classifying texts containing multiple languages
Accurately distinguishes between all 41 supported languages
Academic research
Linguistic analysis
Assisting linguistics researchers in analyzing text language features
Provides high-precision language identification results
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase