L

Llama 3.1 Swallow 70B Instruct V0.3

Developed by tokyotech-llm
Llama 3.1 Swallow is a series of large language models built on Meta Llama 3.1. It enhances Japanese language capabilities through continuous pre-training while retaining English language capabilities.
Downloads 1,659
Release Time : 12/25/2024

Model Overview

Llama 3.1 Swallow is a series of large language models (8B, 70B) built by continuously pre-training on the Meta Llama 3.1 model, enhancing Japanese language capabilities while retaining English language capabilities.

Model Features

Multilingual capabilities
Supports English and Japanese, enhancing Japanese language capabilities while retaining English language capabilities.
Continuous pre-training
Continuously pre-trained on the Meta Llama 3.1 model to improve model performance.
Instruction tuning
Instruction tuning is performed using synthetic data specifically built for Japanese, enabling the model to better understand and respond to user instructions.

Model Capabilities

Japanese text generation
English text generation
Multi-round dialogue
Instruction understanding and response

Use Cases

Dialogue system
Japanese dialogue assistant
Used to build a Japanese dialogue assistant that can understand and generate natural Japanese dialogues.
Performs excellently in the Japanese MT-Bench test.
Content generation
Japanese story generation
Generates Japanese stories or content, such as the story of Tokyo Momiji Park in the example.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase