S

Sage M2m100 1.2B

Developed by ai-forever
A Russian spell checker trained on the M2M100-1.2B model for correcting spelling and typing errors
Downloads 184
Release Time : 3/11/2024

Model Overview

This model corrects spelling and typing errors by standardizing all words in the text to Russian norms. The training corpus uses a broad dataset containing 'artificial' errors, built from Russian Wikipedia and Russian video transcriptions.

Model Features

Multi-domain applicability
Performs well on various Russian datasets across different domains, including social media, medical, and technical texts
High-precision correction
Achieves 88.8% precision and 71.5% recall on the RUSpellRU dataset
Large model-based
Fine-tuned on the 1.2B-parameter M2M100 model, with strong language understanding capabilities

Model Capabilities

Russian spell checking
Typo correction
Text normalization

Use Cases

Text processing
Social media text correction
Corrects non-standard spellings and typos in social media content
Achieves an F1 score of 79.2 on the RUSpellRU dataset
Medical text standardization
Corrects spelling errors in professional medical terminology
Achieves an F1 score of 74.9 on the MedSpellchecker dataset
Technical document processing
Code comment correction
Corrects spelling errors in GitHub commit messages
Achieves an F1 score of 44.9 on the GitHubTypoCorpusRu dataset
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase