L

Llama 3 Swallow 8B V0.1

Developed by tokyotech-llm
A large Japanese - enhanced language model built on Meta Llama 3, which improves Japanese processing capabilities through continuous pre - training and instruction fine - tuning.
Downloads 2,230
Release Time : 5/20/2024

Model Overview

Llama3 Swallow is a variant of the Meta Llama 3 series models trained with enhanced Japanese data. It comes in two parameter scales, 8B and 70B, and supports English and Japanese text generation tasks.

Model Features

Enhanced Japanese capabilities
Significantly improves performance on Japanese tasks through continuous pre - training with a large amount of Japanese data.
Bilingual support
Supports both English and Japanese processing and performs excellently in bilingual tasks.
Optimized instruction version
Provides an instruction version optimized through supervised fine - tuning (SFT) and chat vector technology.

Model Capabilities

Japanese text generation
English text generation
Machine translation
Question - answering system
Code generation
Mathematical reasoning
Summary generation

Use Cases

Natural language processing
Japanese question - answering system
Build an intelligent question - answering application for Japanese users.
Achieved an accuracy of 89.45% on the JCommonsenseQA benchmark.
English - Japanese machine translation
Achieve high - quality bidirectional English - Japanese translation.
BLEU score of 0.2758 on the WMT20 English - Japanese translation.
Educational applications
Japanese learning assistant
Assist Japanese learners in language practice and knowledge query.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase