M

Macbert4csc Scalarmix Base Chinese

Developed by x180
A masked language model fine-tuned on MacBERT for Chinese typo correction
Downloads 15
Release Time : 4/14/2022

Model Overview

This model is a masked language model fine-tuned on MacBERT, specifically designed for detecting and correcting typos in Chinese text. Through techniques such as adjusting loss weights and introducing ScalarMix layers, its error detection capability has been enhanced.

Model Features

Improved Loss Weight Allocation
Adjusted the loss weights between MLM and error detection binary classification tasks to 0.9:0.1, optimizing model learning effectiveness
ScalarMix Layer Fusion
Introduced ScalarMix layers in the error detection task to fuse hidden layer representations, avoiding the impact of overly deep representations on learning

Model Capabilities

Chinese Text Error Correction
Typo Detection
Automatic Text Correction

Use Cases

Text Processing
Chinese Document Proofreading
Automatically detect and correct typos in Chinese documents
Achieved 72% accuracy on the general corpus test set
Input Method Error Correction
Correct spelling errors in user input
Achieved 79.73% accuracy on the SIGHAN2015 test set
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase