K

Kobart Base V2

Developed by gogamza
KoBART is a Korean encoder-decoder language model based on the BART architecture, trained with text infilling noise functions, supporting Korean text feature extraction and generation tasks.
Downloads 5,937
Release Time : 3/2/2022

Model Overview

Korean BART model, trained in an autoencoder format, suitable for Korean text feature extraction and generation tasks.

Model Features

Korean Optimization
Specially trained for Korean, including Korean Wikipedia and various other Korean corpora
Emoji Support
High-frequency emojis are specially added to the vocabulary to enhance emoji recognition capabilities
Efficient Tokenization
Trained with a character-level BPE tokenizer for high tokenization efficiency

Model Capabilities

Korean text feature extraction
Korean text generation
Text infilling
Text summarization

Use Cases

Text Processing
Sentiment Analysis
Used for sentiment classification of Korean text
Achieved 90.24% accuracy on the NSMC dataset
Text Similarity Calculation
Calculates semantic similarity between Korean sentences
Spearman coefficient of 81.66 on the KorSTS dataset
Question Pairing
Determines whether two Korean questions are semantically identical
Accuracy reached 94.34%
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase