L

Llama 3 Swallow 8B Instruct V0.1

Developed by tokyotech-llm
A Japanese-optimized large language model built on Meta Llama 3, enhancing Japanese capabilities through continuous pre-training and improving instruction-following abilities through supervised fine-tuning.
Downloads 13.88k
Release Time : 6/26/2024

Model Overview

Llama3 Swallow is a Japanese-optimized model that undergoes continuous pre-training based on the Llama 3 series. It mainly adds Japanese data and uses SFT fine-tuning to support multilingual task processing in Japanese and English.

Model Features

Japanese optimization
Enhance Japanese processing capabilities through continuous pre-training and perform excellently in Japanese benchmark tests.
Multilingual support
Support both Japanese and English and handle cross-lingual tasks.
Instruction fine-tuning
Use supervised fine-tuning (SFT) and chat vector technology to improve instruction-following abilities.
High performance
Achieve excellent results in various benchmark tests for Japanese and English.

Model Capabilities

Japanese text generation
English text generation
Machine translation
Question-answering system
Code generation
Text summarization
Mathematical reasoning

Use Cases

Content creation
Japanese story creation
Generate creative stories that conform to the Japanese cultural background.
Such as the heartwarming story of swallows and llamas generated in the example.
Education
Japanese learning assistance
Help learners understand and generate Japanese content.
Business application
Japanese customer service robot
Build an intelligent customer service system in a Japanese environment.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase