Pko T5 Base
pko-t5 is a T5 model specifically optimized for Korean, trained exclusively on Korean data using BBPE tokenization to address Korean segmentation issues.
Downloads 874
Release Time : 5/16/2022
Model Overview
pko-t5 is a Korean-optimized model based on the T5 v1.1 architecture, trained on Korean corpora through unsupervised learning, suitable for various Korean NLP tasks.
Model Features
Korean optimization
Specially designed and optimized for Korean, trained exclusively on Korean data.
BBPE tokenization
Uses BBPE (no OOV issues) instead of sentencepiece for Korean tokenization, improving segmentation performance.
Multi-task support
Supports various NLP tasks, including text generation, classification, question answering, and more.
Model Capabilities
Text generation
Text classification
Question answering system
Named entity recognition
Semantic similarity calculation
Use Cases
Natural language processing
Korean question answering system
Build a Korean question answering system to respond to user queries.
Performs well on the KLUE benchmark
Text classification
Classify Korean texts, such as news categorization or sentiment analysis.
Achieves 87.29 macro-F1 on the YNAT task
Featured Recommended AI Models
Š 2025AIbase