L

Llama 3.1 Swallow 8B Instruct V0.2

Developed by tokyotech-llm
Llama 3.1 Swallow is a series of large language models that are continuously pre - trained based on the Meta Llama 3.1 model, enhancing Japanese capabilities while retaining English capabilities.
Downloads 2,283
Release Time : 10/30/2024

Model Overview

Llama 3.1 Swallow is a series of large language models built through continuous pre - training on the Meta Llama 3.1 model, focusing on enhancing Japanese capabilities while maintaining English capabilities. Two parameter scales of 8B and 70B are available, suitable for multilingual text generation and understanding tasks.

Model Features

Enhanced multilingual capabilities
Significantly improved Japanese language processing capabilities while retaining English capabilities
Continuous pre - training
Conducted continuous pre - training of approximately 200 billion tokens based on the Meta Llama 3.1 model
Optimized instruction fine - tuning
Used specially constructed Japanese synthetic data for supervised fine - tuning to improve instruction - following capabilities
Performance balance
Maintained a high performance level in both Japanese and English tasks

Model Capabilities

Japanese text generation
English text generation
Multi - turn dialogue
Machine reading comprehension
Automatic summarization
Machine translation
Mathematical reasoning
Code generation

Use Cases

Content creation
Japanese story creation
Generate creative stories that conform to the Japanese cultural background
Can generate coherent stories rich in Japanese cultural characteristics
Technical document writing
Write technical documents in Japanese or English
Can generate well - structured technical content
Language services
Japanese - English machine translation
Perform text translation between Japanese and English
Performed well in the WMT20 evaluation
Japanese question - answering system
Build Japanese question - answering and customer service robots
Achieved high accuracy in Japanese question - answering tasks
Educational assistance
Japanese learning assistant
Help non - native Japanese speakers learn Japanese
Can explain grammar and cultural background
Mathematical problem solving
Solve mathematical problems in Japanese or English
Performed well in the MGSM mathematical reasoning evaluation
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase