S

Swallow 7b Instruct Hf

Developed by tokyotech-llm
Japanese-enhanced large language model optimized based on the Llama 2 series, with improved instruction-following capabilities through supervised fine-tuning
Downloads 1,938
Release Time : 12/7/2023

Model Overview

The Swallow Model is a Japanese-optimized large language model developed by the Tokyo Institute of Technology LLM team. It enhances Japanese processing capabilities through continual pre-training and instruction fine-tuning based on Llama 2, supporting both Japanese and English tasks.

Model Features

Japanese-optimized vocabulary
Expanded Japanese-specific tokens, significantly improving Japanese text encoding efficiency
Bilingual support
Supports both Japanese and English task processing
Instruction fine-tuning
Enhanced instruction understanding and execution capabilities through supervised fine-tuning (SFT)

Model Capabilities

Japanese text generation
English text generation
Common sense reasoning
Open-ended question answering
Reading comprehension
Summarization
Mathematical reasoning
Machine translation

Use Cases

Education
Japanese learning assistant
Helps students understand Japanese grammar and vocabulary
Achieved 48.08% accuracy on the JCommonsenseQA Japanese common sense test
Content creation
Japanese article generation
Generates coherent Japanese articles based on prompts
Scored 18.30% on the XL-Sum summarization task
Translation services
Japanese-English translation
Enables mutual translation between Japanese and English
Achieved a BLEU score of 25.10% on WMT20 English-Japanese translation
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase