D

Deepseek R1 Distill Qwen 7B Japanese

Developed by lightblue
This is the Japanese version of the DeepSeek R1 model, specifically fine-tuned for Japanese reasoning tasks and capable of reliably and accurately responding to prompts in Japanese.
Downloads 1,067
Release Time : 1/24/2025

Model Overview

This model is a fine-tuned version of DeepSeek-R1-Distill-Qwen-7B on a Japanese reasoning dataset, which solves the problem of inconsistent output of the original model under Japanese prompts.

Model Features

Japanese optimization
Specifically fine-tuned for Japanese, solving the problem of inconsistent output of the original model under Japanese prompts
Efficient training
Trained on Alibaba Cloud 8 x L20 instances in less than 10 minutes
Reasoning ability
Retains the excellent reasoning ability of the original model, especially suitable for solving mathematical and logical problems
Output consistency
More stable and reliable in Japanese output compared to the original model

Model Capabilities

Japanese text generation
Mathematical reasoning
Logical problem solving
Multi-round dialogue

Use Cases

Education
Mathematical problem solving
Solve Japanese mathematical problems, especially those requiring multi-step reasoning
Achieved 70% accuracy on the GSM8K Japanese test set
Customer service
Japanese customer consultation
Handle consultations and questions from Japanese customers
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase