S

Swallow MS 7b Instruct V0.1

Developed by tokyotech-llm
Japanese-enhanced large language model continuously pre-trained based on Mistral-7B-v0.1
Downloads 48
Release Time : 3/29/2024

Model Overview

Swallow-MS-7b-v0.1 is a Japanese-enhanced large language model based on the Mistral-7B-v0.1 architecture, optimized for Japanese text processing through additional Japanese data training.

Model Features

Japanese-optimized tokenizer
Tokenizer expanded with Japanese vocabulary that can represent text more efficiently with fewer tokens, significantly speeding up inference
Bilingual support
Supports both Japanese and English processing, with special optimization for Japanese capabilities
Instruction-following capability
Provides good instruction-following performance through the instruction-tuned version (Swallow-MS-7b-instruct-v0.1)

Model Capabilities

Japanese text generation
English text generation
Instruction understanding and execution
Multi-turn dialogue

Use Cases

Intelligent assistant
Japanese Q&A system
Used to build Japanese intelligent Q&A assistants
Outperforms multiple Japanese models in the MT-Bench JA benchmark
Content generation
Japanese content creation
Generates Japanese articles, reports, and other text content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase