Chinese Bigbird Wwm Base 4096
Chinese pre-trained model based on the BigBird architecture, employing Whole Word Masking (WWM) strategy, supporting a 4096-length context window
Downloads 13
Release Time : 3/2/2022
Model Overview
This model is a Chinese pre-trained language model based on Google's BigBird architecture, trained using the Whole Word Masking (WWM) strategy, particularly excelling in long-text sequence tasks.
Model Features
Long Text Processing Capability
Supports a 4096-token context window, making it particularly suitable for long-document tasks
Whole Word Masking Pre-training
Employs Whole Word Masking (WWM) strategy, better suited for Chinese language characteristics
Sparse Attention Mechanism
Based on BigBird's sparse attention mechanism, maintaining efficiency in long-sequence tasks
Model Capabilities
Text Classification
Named Entity Recognition
Question Answering Systems
Text Summarization
Long Document Understanding
Use Cases
Legal Document Processing
Contract Analysis
Processing and analyzing lengthy legal contract texts
Accurately identifies key contract clauses and entities
Medical Text Processing
Medical Record Analysis
Processing lengthy electronic medical record texts
Extracts key medical entities and diagnostic information
Featured Recommended AI Models