Macbert4csc Scalarmix Base Chinese
A masked language model fine-tuned on MacBERT for Chinese typo correction
Downloads 15
Release Time : 4/14/2022
Model Overview
This model is a masked language model fine-tuned on MacBERT, specifically designed for detecting and correcting typos in Chinese text. Through techniques such as adjusting loss weights and introducing ScalarMix layers, its error detection capability has been enhanced.
Model Features
Improved Loss Weight Allocation
Adjusted the loss weights between MLM and error detection binary classification tasks to 0.9:0.1, optimizing model learning effectiveness
ScalarMix Layer Fusion
Introduced ScalarMix layers in the error detection task to fuse hidden layer representations, avoiding the impact of overly deep representations on learning
Model Capabilities
Chinese Text Error Correction
Typo Detection
Automatic Text Correction
Use Cases
Text Processing
Chinese Document Proofreading
Automatically detect and correct typos in Chinese documents
Achieved 72% accuracy on the general corpus test set
Input Method Error Correction
Correct spelling errors in user input
Achieved 79.73% accuracy on the SIGHAN2015 test set
Featured Recommended AI Models
Š 2025AIbase