🚀 DeepSeekMath
DeepSeekMath is a powerful math model. It provides effective solutions for math - related tasks and supports commercial use.
🚀 Quick Start
For more details about DeepSeekMath, see the Introduction.
💻 Usage Examples
Basic Usage
Here are some examples of how to use our model.
Chat Completion
⚠️ Important Note
Please use chain - of - thought prompt to test DeepSeekMath - Instruct and DeepSeekMath - RL:
- English questions: {question}\nPlease reason step by step, and put your final answer within \boxed{}.
- Chinese questions: {question}\n请通过逐步推理来解答问题,并把最终答案放置于\boxed{}中。
import torch
from transformers import AutoTokenizer, AutoModelForCausalLM, GenerationConfig
model_name = "deepseek-ai/deepseek-math-7b-instruct"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name, torch_dtype=torch.bfloat16, device_map="auto")
model.generation_config = GenerationConfig.from_pretrained(model_name)
model.generation_config.pad_token_id = model.generation_config.eos_token_id
messages = [
{"role": "user", "content": "what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}."}
]
input_tensor = tokenizer.apply_chat_template(messages, add_generation_prompt=True, return_tensors="pt")
outputs = model.generate(input_tensor.to(model.device), max_new_tokens=100)
result = tokenizer.decode(outputs[0][input_tensor.shape[1]:], skip_special_tokens=True)
print(result)
Advanced Usage
Avoiding the use of the provided function apply_chat_template
, you can also interact with our model following the sample template. Note that messages
should be replaced by your input.
User: {messages[0]['content']}
Assistant: {messages[1]['content']}<|end▁of▁sentence|>User: {messages[2]['content']}
Assistant:
⚠️ Important Note
By default (add_special_tokens=True
), our tokenizer automatically adds a bos_token
(<|begin▁of▁sentence|>
) before the input text. Additionally, since the system prompt is not compatible with this version of our models, we DO NOT RECOMMEND including the system prompt in your input.
📄 License
This code repository is licensed under the MIT License. The use of DeepSeekMath models is subject to the Model License. DeepSeekMath supports commercial use.
See the LICENSE - MODEL for more details.
📞 Contact
If you have any questions, please raise an issue or contact us at service@deepseek.com.
[🏠Homepage] | [🤖 Chat with DeepSeek LLM] | [Discord] | [Wechat(微信)]
Paper Link👁️