Roformer Chinese Char Small Open-Source Model - Free Deployment to Support Text Filling Tasks

Roformer Chinese Char Small

Developed by junnyu

RoFormer is a Chinese Transformer model enhanced with Rotary Position Embedding, suitable for text infilling tasks.

Large Language Model Chinese#Chinese text infilling #Rotary Position Embedding #Small-scale pretraining

Downloads 24

Release Time : 3/2/2022

Model Overview

This model employs rotary position embedding technology, outperforming traditional Transformers in long-text processing, particularly excelling in Chinese text mask prediction tasks.

Model Features

Rotary Position Embedding

Utilizes innovative Rotary Position Embedding (RoPE) technology, better handling long-sequence dependencies compared to traditional position encoding

Chinese Optimization

Specially optimized for Chinese text processing, demonstrating excellent performance in character-level tasks

Multi-framework Support

Provides implementations for both PyTorch and TensorFlow 2.0

Model Capabilities

Text infilling

Mask prediction

Chinese text understanding

Use Cases

Text completion

Sentence completion

Predicting masked portions in sentences

Can accurately predict contextually appropriate words

Language model pretraining

Downstream task fine-tuning

Can serve as a pretrained model for various NLP downstream tasks

🚀 RoFormer Project

RoFormer is a powerful model with implementations in both TensorFlow and PyTorch, offering efficient natural language processing capabilities.

🚀 Quick Start

This project provides two versions of RoFormer, one in TensorFlow and the other in PyTorch and TensorFlow 2.0.

TensorFlow Version

You can access the TensorFlow version of RoFormer at https://github.com/ZhuiyiTechnology/roformer.

PyTorch and TensorFlow 2.0 Version

The PyTorch and TensorFlow 2.0 versions are available at https://github.com/JunnYu/RoFormer_pytorch.

💻 Usage Examples

Basic Usage in PyTorch

import torch
from transformers import RoFormerForMaskedLM, RoFormerTokenizer

text = "今天[MASK]很好，我[MASK]去公园玩。"
tokenizer = RoFormerTokenizer.from_pretrained("junnyu/roformer_chinese_char_small")
pt_model = RoFormerForMaskedLM.from_pretrained("junnyu/roformer_chinese_char_small")
pt_inputs = tokenizer(text, return_tensors="pt")
with torch.no_grad():
    pt_outputs = pt_model(**pt_inputs).logits[0]
pt_outputs_sentence = "pytorch: "
for i, id in enumerate(tokenizer.encode(text)):
    if id == tokenizer.mask_token_id:
        tokens = tokenizer.convert_ids_to_tokens(pt_outputs[i].topk(k=5)[1])
        pt_outputs_sentence += "[" + "||".join(tokens) + "]"
    else:
        pt_outputs_sentence += "".join(
            tokenizer.convert_ids_to_tokens([id], skip_special_tokens=True))
print(pt_outputs_sentence)
# pytorch: 今天[也||都||又||还||我]很好，我[就||想||去||也||又]去公园玩。

Basic Usage in TensorFlow 2.0

import tensorflow as tf
from transformers import RoFormerTokenizer, TFRoFormerForMaskedLM
text = "今天[MASK]很好，我[MASK]去公园玩。"
tokenizer = RoFormerTokenizer.from_pretrained("junnyu/roformer_chinese_char_small")
tf_model = TFRoFormerForMaskedLM.from_pretrained("junnyu/roformer_chinese_char_small")
tf_inputs = tokenizer(text, return_tensors="tf")
tf_outputs = tf_model(**tf_inputs, training=False).logits[0]
tf_outputs_sentence = "tf2.0: "
for i, id in enumerate(tokenizer.encode(text)):
    if id == tokenizer.mask_token_id:
        tokens = tokenizer.convert_ids_to_tokens(
            tf.math.top_k(tf_outputs[i], k=5)[1])
        tf_outputs_sentence += "[" + "||".join(tokens) + "]"
    else:
        tf_outputs_sentence += "".join(
            tokenizer.convert_ids_to_tokens([id], skip_special_tokens=True))
print(tf_outputs_sentence)
# tf2.0: 今天[也||都||又||还||我]很好，我[就||想||去||也||又]去公园玩。

📄 Citation

If you use this project in your research, please cite the following BibTeX entry:

@misc{su2021roformer,
      title={RoFormer: Enhanced Transformer with Rotary Position Embedding}, 
      author={Jianlin Su and Yu Lu and Shengfeng Pan and Bo Wen and Yunfeng Liu},
      year={2021},
      eprint={2104.09864},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご