M

Macbert4csc Base Chinese

Developed by shibing624
MacBERT-based Chinese spelling correction model, achieving state-of-the-art performance on the SIGHAN2015 test set
Downloads 9,623
Release Time : 3/2/2022

Model Overview

This model focuses on detecting and correcting spelling errors in Chinese text, using an improved MacBERT architecture, suitable for various Chinese text proofreading scenarios

Model Features

Best Performance
Achieves character-level F1 score of 89.91 and sentence-level F1 score of 77.89 on the SIGHAN2015 test set, reaching current state-of-the-art performance
Improved Architecture
Improved MacBERT architecture based on softmaskedbert, optimizing model performance through MLM correction pre-training tasks
Comprehensive Training Data
Trained using SIGHAN+Wang271K Chinese correction dataset, containing 270,000 high-quality correction samples

Model Capabilities

Chinese Spelling Error Detection
Chinese Text Auto-correction
Typo Recognition and Correction

Use Cases

Text Proofreading
Daily Text Correction
Automatically corrects spelling errors in daily texts such as chats and emails
Example: 'Today my mood is good' → 'Today my mood is good'
Formal Document Proofreading
Assists in checking the accuracy of text in formal documents such as reports and papers
Educational Assistance
Chinese Learning Assistance
Helps Chinese learners identify and correct errors in their writing
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase