DeepSeek Math 7B Instruct Open-source Mathematical Model - Freely Available to Aid in Solving Business Mathematical Problems

Home

Deepseek Math 7b Instruct

Developed by deepseek-ai

DeepSeekMath is a model specialized in the field of mathematics, supporting commercial use under the MIT License.

Large Language Model

Transformers

Open Source License:Other #Mathematical Reasoning #Commercial License #MIT License

Downloads 26.24k

Release Time : 2/5/2024

Model Overview

DeepSeekMath is a specialized model for mathematics, designed to handle math-related tasks such as problem-solving and formula derivation.

Model Features

Commercial Use Support

DeepSeekMath explicitly supports commercial use, making it suitable for businesses and developers to integrate into commercial products.

Math-Specialized

Focused on the field of mathematics, capable of efficiently handling math problems and formula derivation tasks.

Model Capabilities

Mathematical problem-solving

Formula derivation

Use Cases

Education

Math Tutoring

Helps students solve math problems by providing detailed step-by-step solutions and explanations.

Improves learning efficiency and enhances mathematical understanding.

Research

Formula Derivation Assistance

Assists researchers in complex mathematical formula derivation and verification.

Accelerates research progress and reduces manual calculation errors.

🚀 DeepSeekMath

DeepSeekMath is a powerful math model. It provides effective solutions for math - related tasks and supports commercial use.

🚀 Quick Start

For more details about DeepSeekMath, see the Introduction.

💻 Usage Examples

Basic Usage

Here are some examples of how to use our model.

Chat Completion

⚠️ Important Note

Please use chain - of - thought prompt to test DeepSeekMath - Instruct and DeepSeekMath - RL:

English questions: {question}\nPlease reason step by step, and put your final answer within \boxed{}.

Chinese questions: {question}\n请通过逐步推理来解答问题，并把最终答案放置于\boxed{}中。

import torch
from transformers import AutoTokenizer, AutoModelForCausalLM, GenerationConfig

model_name = "deepseek-ai/deepseek-math-7b-instruct"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name, torch_dtype=torch.bfloat16, device_map="auto")
model.generation_config = GenerationConfig.from_pretrained(model_name)
model.generation_config.pad_token_id = model.generation_config.eos_token_id

messages = [
    {"role": "user", "content": "what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}."}
]
input_tensor = tokenizer.apply_chat_template(messages, add_generation_prompt=True, return_tensors="pt")
outputs = model.generate(input_tensor.to(model.device), max_new_tokens=100)

result = tokenizer.decode(outputs[0][input_tensor.shape[1]:], skip_special_tokens=True)
print(result)

Advanced Usage

Avoiding the use of the provided function apply_chat_template, you can also interact with our model following the sample template. Note that messages should be replaced by your input.

User: {messages[0]['content']}

Assistant: {messages[1]['content']}<｜end▁of▁sentence｜>User: {messages[2]['content']}

Assistant:

⚠️ Important Note

By default (add_special_tokens=True), our tokenizer automatically adds a bos_token (<｜begin▁of▁sentence｜>) before the input text. Additionally, since the system prompt is not compatible with this version of our models, we DO NOT RECOMMEND including the system prompt in your input.

📄 License

This code repository is licensed under the MIT License. The use of DeepSeekMath models is subject to the Model License. DeepSeekMath supports commercial use.

See the LICENSE - MODEL for more details.

📞 Contact

If you have any questions, please raise an issue or contact us at service@deepseek.com.

[🏠Homepage] | [🤖 Chat with DeepSeek LLM] | [Discord] | [Wechat(微信)]

Paper Link👁️

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご