klue-bert-base-aihub-mrc开源模型 - 免费实现韩语机器阅读理解应用

首页

Klue Bert Base Aihub Mrc

由 bespin-global 开发

基于KLUE BERT-base微调的韩语机器阅读理解模型，使用AIHub数据集训练

问答系统

Transformers

韩语#韩语阅读理解 #高精度答案抽取 #AIHub微调

下载量 29

发布时间 : 3/2/2022

模型简介

该模型是针对韩语机器阅读理解任务优化的BERT模型，能够从给定文本中提取问题答案。

模型特点

韩语优化

基于KLUE BERT-base模型，专门针对韩语特性优化

大规模训练数据

使用AIHub 35m条韩语阅读理解数据训练

精确答案定位

可返回答案文本及其在原文中的精确位置偏移量

模型能力

韩语文本理解

问题回答

答案位置定位

使用案例

智能客服

产品信息查询

从产品文档中自动回答用户问题

准确提取相关产品规格信息

教育

阅读理解辅助

帮助学生理解课文并回答问题

提供准确答案及原文位置参考

🚀 基于BERT的机器阅读理解模型

本项目是一个基于BERT的机器阅读理解（MRC）模型，使用AIHub数据集进行微调，可用于回答问题。通过该模型，用户能够在给定的上下文中准确找到问题的答案，具有良好的性能和应用价值。

🚀 快速开始

演示地址

https://huggingface.co/spaces/bespin-global/Bespin-QuestionAnswering

✨ 主要特性

使用预训练的klue/bert-base模型进行微调。
采用AIHub的机器阅读理解数据集进行训练，包含标准数据集和可解释数据集。
提供了详细的训练参数和使用示例。

📦 安装指南

文档未提供安装步骤，故跳过该章节。

💻 使用示例

基础用法

## Load Transformers library
import torch
from transformers import AutoModelForQuestionAnswering, AutoTokenizer

device = torch.device('cuda') if torch.cuda.is_available() else torch.device('cpu')

def predict_answer(qa_text_pair):
    # Encoding
    encodings = tokenizer(context, question, 
                      max_length=512, 
                      truncation=True,
                      padding="max_length", 
                      return_token_type_ids=False,
                      return_offsets_mapping=True
                      )
    encodings = {key: torch.tensor([val]).to(device) for key, val in encodings.items()}             

    # Predict
    pred = model(encodings["input_ids"], attention_mask=encodings["attention_mask"])
    start_logits, end_logits = pred.start_logits, pred.end_logits
    token_start_index, token_end_index = start_logits.argmax(dim=-1), end_logits.argmax(dim=-1)
    pred_ids = encodings["input_ids"][0][token_start_index: token_end_index + 1]
    answer_text = tokenizer.decode(pred_ids)

    # Offset
    answer_start_offset = int(encodings['offset_mapping'][0][token_start_index][0][0])
    answer_end_offset = int(encodings['offset_mapping'][0][token_end_index][0][1])
    answer_offset = (answer_start_offset, answer_end_offset)
 
    return {'answer_text':answer_text, 'answer_offset':answer_offset}


## Load fine-tuned MRC model by HuggingFace Model Hub ##
HUGGINGFACE_MODEL_PATH = "bespin-global/klue-bert-base-aihub-mrc"
tokenizer = AutoTokenizer.from_pretrained(HUGGINGFACE_MODEL_PATH)
model = AutoModelForQuestionAnswering.from_pretrained(HUGGINGFACE_MODEL_PATH).to(device)


## Predict ## 
context = '''애플 M2(Apple M2)는 애플이 설계한 중앙 처리 장치(CPU)와 그래픽 처리 장치(GPU)의 ARM 기반 시스템이다. 
인텔 코어(Intel Core)에서 맥킨토시 컴퓨터용으로 설계된 2세대 ARM 아키텍처이다. 애플은 2022년 6월 6일 WWDC에서 맥북 에어, 13인치 맥북 프로와 함께 M2를 발표했다. 
애플 M1의 후속작이다. M2는 TSMC의 '향상된 5나노미터 기술' N5P 공정으로 만들어졌으며, 이전 세대 M1보다 25% 증가한 200억개의 트랜지스터를 포함하고 있으며, 최대 24기가바이트의 RAM과 2테라바이트의 저장공간으로 구성할 수 있다. 
8개의 CPU 코어(성능 4개, 효율성 4개)와 최대 10개의 GPU 코어를 가지고 있다. M2는 또한 메모리 대역폭을 100 GB/s로 증가시킨다. 
애플은 기존 M1 대비 CPU가 최대 18%, GPU가 최대 35% 향상됐다고 주장하고 있으며,[1] 블룸버그통신은 M2맥스에 CPU 코어 12개와 GPU 코어 38개가 포함될 것이라고 보도했다.'''
question = "m2가 m1에 비해 얼마나 좋아졌어?"

qa_text_pair = {'context':context, 'question':question}
result = predict_answer(qa_text_pair)
print('Answer Text: ', result['answer_text'])  # 기존 M1 대비 CPU가 최대 18 %, GPU가 최대 35 % 향상
print('Answer Offset: ', result['answer_offset'])  # (410, 446)

📚 详细文档

微调信息

预训练模型：klue/bert-base
微调数据集：AIHub机器阅读理解数据集
- 标准数据集(25m) + 可解释数据集(10m)
- 随机采样 (随机种子: 1234)
  - 训练集：30m
  - 测试集：5m
训练参数

{
    "epochs": 4,
    "batch_size":8,
    "optimizer_class": "<class 'transformers.optimization.AdamW'>",
    "optimizer_params": {
        "lr": 3e-05
    },
    "weight_decay": 0.01
}