Randeng-T5-77M Open-Source Model - Free Deployment, Effortlessly Handle Natural Language Transformation Tasks!

Home

Randeng T5 77M

Developed by IDEA-CCNL

A lightweight Chinese version of mT5-small model specialized in natural language transformation tasks

Large Language Model

Transformers

ChineseOpen Source License:Apache-2.0 #Chinese Text Generation #Lightweight T5 #Text Fragment Reconstruction

Downloads 104

Release Time : 6/8/2022

Model Overview

Adapted from mT5-small architecture for Chinese language, focusing on natural language transformation tasks, suitable for Chinese text processing

Model Features

Chinese Optimization

Specially optimized for Chinese language, enhancing Chinese text processing capabilities

Lightweight Model

Only 77M parameters, suitable for resource-constrained environments

Efficient Training

Utilized Corpus-Adaptive Pre-training (CAPT) technique for incremental training on 180GB WuDao corpus, achieving high training efficiency

Model Capabilities

Text Generation

Natural Language Transformation

Use Cases

Text Processing

Text Completion

Complete missing text fragments based on context

Example input: 'Beijing has a long history of <extra_id_0> and <extra_id_1>.'

🚀 Randeng-T5-77M

A Chinese version of mT5-small, excelling in handling NLT tasks.

Main Page: Fengshenbang
Github: Fengshenbang-LM

🚀 Quick Start

Prerequisites

Make sure you have installed the transformers and torch libraries.

Installation

pip install transformers torch

Usage Example

from transformers import T5ForConditionalGeneration, AutoTokenizer
import torch

tokenizer = AutoTokenizer.from_pretrained('IDEA-CCNL/Randeng-T5-77M', use_fast=False)
model = T5ForConditionalGeneration.from_pretrained('IDEA-CCNL/Randeng-T5-77M')

✨ Features

Chinese Adaptation: Specifically tailored for Chinese language processing, making it highly effective in Chinese NLT tasks.
Efficient Training: Utilizes Corpus-Adaptive Pre-Training (CAPT) on the WuDao Corpora (180 GB version) to accelerate the training process.

📦 Installation

pip install transformers torch

💻 Usage Examples

Basic Usage

from transformers import T5ForConditionalGeneration, AutoTokenizer
import torch

tokenizer = AutoTokenizer.from_pretrained('IDEA-CCNL/Randeng-T5-77M', use_fast=False)
model = T5ForConditionalGeneration.from_pretrained('IDEA-CCNL/Randeng-T5-77M')

input_text = "北京有悠久的 <extra_id_0>和 <extra_id_1>。"
input_ids = tokenizer(input_text, return_tensors='pt').input_ids
outputs = model.generate(input_ids)
output_text = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(output_text)

📚 Documentation

Model Taxonomy

Property	Details
Demand	General
Task	Natural Language Transformation (NLT)
Series	Randeng
Model	mT5
Parameter	77M
Extra	Chinese

Model Information

Based on mT5-small, we implement its Chinese version. In order to accelerate training, we only retrain the vocabulary and embedding corresponding to Chinese and English in T5tokenizer (sentence piece), and Corpus-Adaptive Pre-Training (CAPT) on the WuDao Corpora (180 GB version). The pretraining objective is span corruption. Specifically, we use the fengshen framework in the pre-training phase which cost about 24 hours with 8 A100 GPUs.

📄 License

This project is licensed under the Apache-2.0 license.

📖 Citation

If you are using the resource for your work, please cite the our paper:

@article{fengshenbang,
  author    = {Jiaxing Zhang and Ruyi Gan and Junjie Wang and Yuxiang Zhang and Lin Zhang and Ping Yang and Xinyu Gao and Ziwei Wu and Xiaoqun Dong and Junqing He and Jianheng Zhuo and Qi Yang and Yongfeng Huang and Xiayu Li and Yanghan Wu and Junyu Lu and Xinyu Zhu and Weifeng Chen and Ting Han and Kunhao Pan and Rui Wang and Hao Wang and Xiaojun Wu and Zhongshen Zeng and Chongpei Chen},
  title     = {Fengshenbang 1.0: Being the Foundation of Chinese Cognitive Intelligence},
  journal   = {CoRR},
  volume    = {abs/2209.02970},
  year      = {2022}
}

You can also cite our website:

@misc{Fengshenbang-LM,
  title={Fengshenbang-LM},
  author={IDEA-CCNL},
  year={2021},
  howpublished={\url{https://github.com/IDEA-CCNL/Fengshenbang-LM}},
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご