O

OPEN SOLAR KO 10.7B

Developed by beomi
Korean-enhanced version based on SOLAR-10.7B-v1.0, continuously pre-trained by expanding vocabulary and Korean corpus
Downloads 1,151
Release Time : 1/2/2024

Model Overview

Open-Solar-Ko is a 10.7B-parameter large language model focused on Korean processing, improving Korean text generation capabilities through vocabulary expansion and Korean corpus training

Model Features

Korean-optimized vocabulary
Expanded original vocabulary to 46,592, significantly improving Korean tokenization efficiency (example text token count reduced from 26 to 8)
Curated public corpus
Trained exclusively on publicly available Korean corpora like AI Hub, Modu Corpus, and Korean Wikipedia, complying with open-source licenses
Efficient architecture
Adopts optimized architecture with 4k context length and supports GQA (Grouped Query Attention)

Model Capabilities

Korean text generation
English text generation
Korean understanding tasks

Use Cases

Natural language processing
Korean text generation
Generate contextually appropriate Korean text content
Sentiment analysis
Analyze sentiment tendencies in Korean text
Achieved 0.896 accuracy on nsmc test set (50-shot)
Featured Recommended AI Models
ยฉ 2025AIbase