G

Gemma 2 Llama Swallow 27b It V0.1

Developed by tokyotech-llm
A Japanese-enhanced large language model based on the Gemma-2 architecture, significantly improving Japanese capabilities while retaining original English proficiency
Downloads 27
Release Time : 4/24/2025

Model Overview

This model is one of a series built through continued pretraining of Google Gemma-2, specifically optimized for Japanese processing capabilities, suitable for Japanese-English bilingual text generation and comprehension tasks

Model Features

Enhanced Bilingual Capabilities
Significantly improved Japanese processing while retaining the original Gemma 2 English capabilities
Large-scale Pretraining
Continued pretraining using approximately 200 billion tokens of mixed corpus, including specialized Japanese data
Instruction Fine-tuning Optimization
Employed supervised fine-tuning (SFT) with specially constructed synthetic data for Japanese

Model Capabilities

Japanese text generation
English text generation
Japanese-English bilingual comprehension
Multi-turn dialogue processing
Code generation

Use Cases

Language Services
Japanese Chat Assistant
Building fluent and natural Japanese dialogue systems
Excellent performance in Japanese MT-Bench evaluations
Japanese-English Translation
Achieving high-quality bidirectional translation
Competitive performance on WMT20 benchmark
Education
Japanese Learning Assistance
Helping non-native speakers learn Japanese
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase