L

Llama DNA 1.0 8B Instruct

Developed by dnotitia
A state-of-the-art bilingual language model based on the Llama architecture, specially optimized for Korean understanding and generation while maintaining strong English capabilities.
Downloads 661
Release Time : 12/6/2024

Model Overview

The DNA 1.0 8B Instruct Model was developed through a complex model merging process, including spherical linear interpolation (SLERP) with the Llama 3.1 8B Instruct Model and knowledge distillation (KD) using Llama 3.1 405B as the teacher model. It underwent extensive training via continual pre-training (CPT) on high-quality Korean datasets and completed the training process with supervised fine-tuning (SFT) and direct preference optimization (DPO).

Model Features

Optimized Korean Capabilities
Specially optimized for Korean understanding and generation while maintaining strong English capabilities.
Advanced Training Methods
Utilizes various advanced training techniques including spherical linear interpolation (SLERP), knowledge distillation (KD), continual pre-training (CPT), supervised fine-tuning (SFT), and direct preference optimization (DPO).
Long Context Support
Supports long context processing of up to 131,072 tokens (128k).
Human Preference Alignment
Outputs are more aligned with human preferences through the direct preference optimization (DPO) training process.

Model Capabilities

Korean text generation
English text generation
Multi-turn dialogue
Complex instruction understanding
Knowledge Q&A

Use Cases

Intelligent Assistants
Korean Chatbot
Intelligent conversational assistant for Korean environments
Excellent performance on Korean benchmarks such as KMMLU and KoBEST
Education
Language Learning Assistant
Helps learners practice Korean and English
Business Applications
Bilingual Customer Service System
Handles customer inquiries in Korean and English
Featured Recommended AI Models
ยฉ 2025AIbase