D

Deoffxlmr Mono Tamil

Developed by Hate-speech-CNERG
This model is used to detect offensive content in Tamil code-mixed text, trained based on the XLM-Roberta-Base model, and performed excellently in the EACL 2021 Dravidian Language Offensive Language Identification Shared Task.
Downloads 100
Release Time : 3/2/2022

Model Overview

A monolingual model specifically designed to identify offensive content in Tamil (including pure text and code-mixed forms), using Transformer architecture, achieving high detection accuracy on specific datasets.

Model Features

Monolingual Focus Optimization
Specifically optimized for Tamil (including code-mixed forms), outperforming multilingual models in specific language tasks.
Integration Strategy Advantage
Utilized genetic algorithm integration techniques, achieving first place in the Tamil sub-task of the shared task.
Low-Resource Language Solution
Provides an effective solution for offensive content detection in low-resource languages such as Tamil.

Model Capabilities

Tamil Text Classification
Code-Mixed Text Processing
Offensive Content Recognition

Use Cases

Content Moderation
Social Media Content Filtering
Automatically detects offensive speech in Tamil social media
Achieved a weighted F1 score of 0.76 on the test set
Language Research
Dravidian Language Family Analysis
Used to study offensive language features in low-resource languages such as Tamil
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase