P

Pko T5 Base

Developed by paust
pko-t5 is a T5 model specifically optimized for Korean, trained exclusively on Korean data using BBPE tokenization to address Korean segmentation issues.
Downloads 874
Release Time : 5/16/2022

Model Overview

pko-t5 is a Korean-optimized model based on the T5 v1.1 architecture, trained on Korean corpora through unsupervised learning, suitable for various Korean NLP tasks.

Model Features

Korean optimization
Specially designed and optimized for Korean, trained exclusively on Korean data.
BBPE tokenization
Uses BBPE (no OOV issues) instead of sentencepiece for Korean tokenization, improving segmentation performance.
Multi-task support
Supports various NLP tasks, including text generation, classification, question answering, and more.

Model Capabilities

Text generation
Text classification
Question answering system
Named entity recognition
Semantic similarity calculation

Use Cases

Natural language processing
Korean question answering system
Build a Korean question answering system to respond to user queries.
Performs well on the KLUE benchmark
Text classification
Classify Korean texts, such as news categorization or sentiment analysis.
Achieves 87.29 macro-F1 on the YNAT task
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase