L

Llama 3.1 Swallow 8B Instruct V0.3

Developed by tokyotech-llm
Llama 3.1 Swallow is a series of large language models built on Meta Llama 3.1. It enhances Japanese capabilities through continuous pre-training while retaining English capabilities.
Downloads 16.48k
Release Time : 12/18/2024

Model Overview

This model enhances Japanese capabilities on the basis of Llama 3.1 and is suitable for text generation tasks in Japanese and English, especially for scenarios that require Japanese support.

Model Features

Enhanced Japanese capabilities
Significantly improved Japanese processing capabilities through continuous pre-training with approximately 200 billion tokens.
Multilingual support
Retains the original English capabilities while enhancing Japanese capabilities.
Instruction fine-tuning
An instruction fine-tuned model built through supervised fine-tuning can better respond to instructions.

Model Capabilities

Japanese text generation
English text generation
Multi-round dialogue
Instruction response

Use Cases

Dialogue system
Japanese customer service assistant
Used in customer service dialogue systems in Japanese environments.
Performs well on the Japanese MT-Bench
Content creation
Japanese story creation
Generate Japanese short stories or creative writing.
Can generate coherent Japanese narratives
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase