Klue-Bert-Base-AIHub-MRC Open Source Model - Free Realization of Korean Machine Reading Comprehension Application

Home

Klue Bert Base Aihub Mrc

Developed by bespin-global

Korean machine reading comprehension model fine-tuned from KLUE BERT-base, trained using AIHub dataset

Question Answering System

Transformers

Korean#Korean Reading Comprehension #High-precision Answer Extraction #AIHub Fine-tuning

Downloads 29

Release Time : 3/2/2022

Model Overview

This BERT model is optimized for Korean machine reading comprehension tasks, capable of extracting answers from given texts.

Model Features

Korean Language Optimization

Based on KLUE BERT-base model, specifically optimized for Korean language characteristics

Large-scale Training Data

Trained with 35 million Korean reading comprehension data points from AIHub

Precise Answer Localization

Can return answer text along with its exact positional offset in the original text

Model Capabilities

Korean Text Understanding

Question Answering

Answer Position Localization

Use Cases

Customer Service

Product Information Query

Automatically answer user questions from product documentation

Accurately extracts relevant product specification information

Education

Reading Comprehension Assistance

Helps students understand texts and answer questions

Provides accurate answers with reference to original text locations

🚀 Bespin Question Answering Model

This project is a fine - tuned question - answering model based on BERT, aiming to provide high - quality question - answering services using the AIHub dataset.

🚀 Quick Start

You can try out the model through the following demo link:

https://huggingface.co/spaces/bespin-global/Bespin-QuestionAnswering

✨ Features

Model Type: Fine - tuned BERT - based question - answering model.
Training Data: AIHub machine reading comprehension dataset.

📦 Installation

This section is skipped as no installation steps are provided in the original document.

💻 Usage Examples

Basic Usage

## Load Transformers library
import torch
from transformers import AutoModelForQuestionAnswering, AutoTokenizer

device = torch.device('cuda') if torch.cuda.is_available() else torch.device('cpu')

def predict_answer(qa_text_pair):
    # Encoding
    encodings = tokenizer(context, question, 
                      max_length=512, 
                      truncation=True,
                      padding="max_length", 
                      return_token_type_ids=False,
                      return_offsets_mapping=True
                      )
    encodings = {key: torch.tensor([val]).to(device) for key, val in encodings.items()}             

    # Predict
    pred = model(encodings["input_ids"], attention_mask=encodings["attention_mask"])
    start_logits, end_logits = pred.start_logits, pred.end_logits
    token_start_index, token_end_index = start_logits.argmax(dim=-1), end_logits.argmax(dim=-1)
    pred_ids = encodings["input_ids"][0][token_start_index: token_end_index + 1]
    answer_text = tokenizer.decode(pred_ids)

    # Offset
    answer_start_offset = int(encodings['offset_mapping'][0][token_start_index][0][0])
    answer_end_offset = int(encodings['offset_mapping'][0][token_end_index][0][1])
    answer_offset = (answer_start_offset, answer_end_offset)
 
    return {'answer_text':answer_text, 'answer_offset':answer_offset}


## Load fine - tuned MRC model by HuggingFace Model Hub ##
HUGGINGFACE_MODEL_PATH = "bespin-global/klue-bert-base-aihub-mrc"
tokenizer = AutoTokenizer.from_pretrained(HUGGINGFACE_MODEL_PATH)
model = AutoModelForQuestionAnswering.from_pretrained(HUGGINGFACE_MODEL_PATH).to(device)


## Predict ## 
context = '''애플 M2(Apple M2)는 애플이 설계한 중앙 처리 장치(CPU)와 그래픽 처리 장치(GPU)의 ARM 기반 시스템이다. 
인텔 코어(Intel Core)에서 맥킨토시 컴퓨터용으로 설계된 2세대 ARM 아키텍처이다. 애플은 2022년 6월 6일 WWDC에서 맥북 에어, 13인치 맥북 프로와 함께 M2를 발표했다. 
애플 M1의 후속작이다. M2는 TSMC의 '향상된 5나노미터 기술' N5P 공정으로 만들어졌으며, 이전 세대 M1보다 25% 증가한 200억개의 트랜지스터를 포함하고 있으며, 최대 24기가바이트의 RAM과 2테라바이트의 저장공간으로 구성할 수 있다. 
8개의 CPU 코어(성능 4개, 효율성 4개)와 최대 10개의 GPU 코어를 가지고 있다. M2는 또한 메모리 대역폭을 100 GB/s로 증가시킨다. 
애플은 기존 M1 대비 CPU가 최대 18%, GPU가 최대 35% 향상됐다고 주장하고 있으며,[1] 블룸버그통신은 M2맥스에 CPU 코어 12개와 GPU 코어 38개가 포함될 것이라고 보도했다.'''
question = "m2가 m1에 비해 얼마나 좋아졌어?"

qa_text_pair = {'context':context, 'question':question}
result = predict_answer(qa_text_pair)
print('Answer Text: ', result['answer_text'])  # 기존 M1 대비 CPU가 최대 18 %, GPU가 최대 35 % 향상
print('Answer Offset: ', result['answer_offset'])  # (410, 446)

📚 Documentation

Finetuning

Pretrain Model: klue/bert-base
Dataset for fine - tuning: AIHub Machine Reading Comprehension Dataset
- Standard dataset (25m) + Explainable dataset (10m)
- Random Sampling (random_seed: 1234)
  - Train: 30m
  - Test: 5m
Parameters of Training

{
    "epochs": 4,
    "batch_size":8,
    "optimizer_class": "<class 'transformers.optimization.AdamW'>",
    "optimizer_params": {
        "lr": 3e-05
    },
    "weight_decay": 0.01
}

🔧 Technical Details

This section is skipped as there is no detailed technical implementation information in the original document.

📄 License

This project is licensed under the CC - BY - NC - 4.0 license.

📖 Citing & Authors

You can find more information about the authors here: Jaehyeong at Bespin Global

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご