Chinese ELECTRA Large Generator Open-Source Model - A Powerful Chinese Pre-trained Tool with Fewer Parameters but Superior Performance

Chinese Electra Large Generator

Developed by hfl

Chinese ELECTRA is a pre-trained model developed by the HIT-iFLYTEK Joint Lab based on Google's ELECTRA model, featuring a small parameter size but superior performance.

Large Language Model

Transformers

ChineseOpen Source License:Apache-2.0 #Efficient Pre-training #Lightweight Model #Chinese NLP

Downloads 14

Release Time : 3/2/2022

Model Overview

This model adopts the ELECTRA architecture and is pre-trained using a generator-discriminator mechanism, making it suitable for various Chinese natural language processing tasks.

Model Features

Efficient Pre-training

Adopts the ELECTRA architecture, offering higher training efficiency compared to BERT.

Parameter Efficiency

ELECTRA-small achieves comparable performance with only 1/10 the parameters of BERT and its variants.

Chinese Optimization

Specifically optimized for Chinese natural language processing tasks.

Model Capabilities

Text Understanding

Text Generation

Semantic Analysis

Use Cases

Natural Language Processing

Text Classification

Can be used for tasks such as sentiment analysis and news classification.

Question Answering System

Can be used to build Chinese question answering systems.

🚀 Chinese ELECTRA

Google and Stanford University introduced ELECTRA, a pre - trained model. It has a more compact size and competitive performance compared to BERT and its variants. The HIT and iFLYTEK Research (HFL) Joint Laboratory released Chinese ELECTRA models to speed up Chinese pre - trained model research.

🚀 Quick Start

Please use ElectraForPreTraining for discriminator and ElectraForMaskedLM for generator if you are re - training these models.

Google and Stanford University released a new pre - trained model called ELECTRA, which has a much compact model size and relatively competitive performance compared to BERT and its variants. For further accelerating the research of the Chinese pre - trained model, the Joint Laboratory of HIT and iFLYTEK Research (HFL) has released the Chinese ELECTRA models based on the official code of ELECTRA. ELECTRA - small could reach similar or even higher scores on several NLP tasks with only 1/10 parameters compared to BERT and its variants.

This project is based on the official code of ELECTRA: [https://github.com/google - research/electra](https://github.com/google - research/electra)

You may also interested in:

Chinese BERT series: https://github.com/ymcui/Chinese - BERT - wwm
Chinese ELECTRA: https://github.com/ymcui/Chinese - ELECTRA
Chinese XLNet: https://github.com/ymcui/Chinese - XLNet
Knowledge Distillation Toolkit - TextBrewer: https://github.com/airaria/TextBrewer

More resources by HFL: https://github.com/ymcui/HFL - Anthology

📚 Documentation

Citation

If you find our resource or paper is useful, please consider including the following citation in your paper.

https://arxiv.org/abs/2004.13922

@inproceedings{cui-etal-2020-revisiting,
    title = "Revisiting Pre-Trained Models for {C}hinese Natural Language Processing",
    author = "Cui, Yiming  and
      Che, Wanxiang  and
      Liu, Ting  and
      Qin, Bing  and
      Wang, Shijin  and
      Hu, Guoping",
    booktitle = "Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings",
    month = nov,
    year = "2020",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2020.findings-emnlp.58",
    pages = "657--668",
}

📄 License

This project is licensed under the Apache 2.0 license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご