Kobart - news: An Open - source Korean News Abstract Generation Model - Efficiently Extract Key Information from Long

Kobart News

Developed by ainize

A Korean news summarization model fine-tuned based on the KoBART framework, suitable for extracting key information from long news articles to generate concise summaries.

Text Generation

Transformers

KoreanOpen Source License:MIT #Korean News Summarization #BART Fine-tuning #Business Data Analysis

Downloads 1,241

Release Time : 3/2/2022

Model Overview

This model is specifically optimized for Korean news articles and can automatically generate summaries that retain core information. It is a sequence-to-sequence model based on the BART architecture, fine-tuned using the Korean AI Hub's news article summarization dataset.

Model Features

Korean Language Optimization

Specially optimized for Korean grammar and news writing style.

Domain Adaptation

Fine-tuned using professional news datasets, with good adaptability to various news genres.

Multi-Length Control

Supports adjusting the length range of generated summaries through parameters.

Model Capabilities

Korean Text Understanding

Key Information Extraction

Coherent Summary Generation

Multi-Paragraph Text Processing

Use Cases

Media Industry

News Brief Generation

Automatically generates concise key-point summaries for long news reports.

Saves editorial time and improves content distribution efficiency.

Business Intelligence

Business Report Summarization

Extracts key market trend information from large volumes of industry reports.

Helps decision-makers quickly grasp core business intelligence.

🚀 kobart-news

This model is a fine - tuned version of kobart on the 문서요약 텍스트/신문기사 using [Ainize Teachable - NLP](https://ainize.ai/teachable - nlp), aiming to provide summarization capabilities.

🚀 Quick Start

✨ Features

This model is a fine - tuned kobart model. It is trained on the 문서요약 텍스트/신문기사 dataset using [Ainize Teachable - NLP](https://ainize.ai/teachable - nlp), which can be used for text summarization tasks.

📦 Installation

No specific installation steps are provided in the original document.

💻 Usage Examples

Basic Usage

from transformers import PreTrainedTokenizerFast, BartForConditionalGeneration
#  Load Model and Tokenize
tokenizer = PreTrainedTokenizerFast.from_pretrained("ainize/kobart-news")
model = BartForConditionalGeneration.from_pretrained("ainize/kobart-news")
# Encode Input Text
input_text = '국내 전반적인 경기침체로 상가 건물주의 수익도 전국적인 감소세를 보이고 있는 것으로 나타났다. 수익형 부동산 연구개발기업 상가정보연구소는 한국감정원 통계를 분석한 결과 전국 중대형 상가 순영업소득(부동산에서 발생하는 임대수입, 기타수입에서 제반 경비를 공제한 순소득)이 1분기 ㎡당 3만4200원에서 3분기 2만5800원으로 감소했다고 17일 밝혔다. 수도권, 세종시, 지방광역시에서 순영업소득이 가장 많이 감소한 지역은 3분기 1만3100원을 기록한 울산으로, 1분기 1만9100원 대비 31.4% 감소했다. 이어 대구(-27.7%), 서울(-26.9%), 광주(-24.9%), 부산(-23.5%), 세종(-23.4%), 대전(-21%), 경기(-19.2%), 인천(-18.5%) 순으로 감소했다. 지방 도시의 경우도 비슷했다. 경남의 3분기 순영업소득은 1만2800원으로 1분기 1만7400원 대비 26.4% 감소했으며 제주(-25.1%), 경북(-24.1%), 충남(-20.9%), 강원(-20.9%), 전남(-20.1%), 전북(-17%), 충북(-15.3%) 등도 감소세를 보였다. 조현택 상가정보연구소 연구원은 "올해 내수 경기의 침체된 분위기가 유지되며 상가, 오피스 등을 비롯한 수익형 부동산 시장의 분위기도 경직된 모습을 보였고 오피스텔, 지식산업센터 등의 수익형 부동산 공급도 증가해 공실의 위험도 늘었다"며 "실제 올 3분기 전국 중대형 상가 공실률은 11.5%를 기록하며 1분기 11.3% 대비 0.2% 포인트 증가했다"고 말했다. 그는 "최근 소셜커머스(SNS를 통한 전자상거래), 음식 배달 중개 애플리케이션, 중고 물품 거래 애플리케이션 등의 사용 증가로 오프라인 매장에 영향을 미쳤다"며 "향후 지역, 콘텐츠에 따른 상권 양극화 현상은 심화될 것으로 보인다"고 덧붙였다.'
input_ids = tokenizer.encode(input_text, return_tensors="pt")
# Generate Summary Text Ids
summary_text_ids = model.generate(
    input_ids=input_ids,
    bos_token_id=model.config.bos_token_id,
    eos_token_id=model.config.eos_token_id,
    length_penalty=2.0,
    max_length=142,
    min_length=56,
    num_beams=4,
)
# Decoding Text
print(tokenizer.decode(summary_text_ids[0], skip_special_tokens=True))

Advanced Usage

You can experience this model through [ainize - api](https://ainize.ai/gkswjdzz/summarize - torchserve?branch=main) and [ainize - demo](https://main - summarize - torchserve - gkswjdzz.endpoint.ainize.ai/).

📄 License

This project is licensed under the MIT license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご