Mistral Nemo 12B R1 V0.2
模型简介
该模型在1台Nvidia H100上进行了12小时的微调,擅长处理需要分步推理的任务,如数学问题解答和逻辑分析。
模型特点
系统性推理能力
采用<think>标签格式进行结构化思考,提供至少6个逻辑步骤的详细推理过程
高效微调
仅需12小时H100训练即可获得优质推理能力
数学逻辑处理
擅长数值比较、计数等需要逐步验证的任务
模型能力
分步逻辑推理
数学问题解答
文本生成
结构化思考
使用案例
教育辅助
数学问题解答
解决需要分步验证的数学比较问题
如示例中准确比较9.9和9.11的大小
逻辑测试
字母计数
执行精确的字母位置分析和计数
如示例中正确识别'strawberry'包含3个R
🚀 Mistral Nemo 12B R1
Mistral Nemo 12B R1 是一个文本生成模型,它基于 Mistral-Nemo-Instruct-2407 进行微调,在特定数据集上训练后,可用于生成具有推理过程的文本。
🚀 快速开始
本模型在 1 块英伟达 H100 显卡上进行了 12 小时 的微调,具体设置如下:
- 批次大小:26
- 梯度累积步数:1
- 训练轮数:1
- 学习率:2e - 5
- 热身比例:0.1
运行模型的代码如下:
import torch
from transformers import pipeline
model_id = "CreitinGameplays/Mistral-Nemo-12B-R1-v0.2"
pipe = pipeline(
"text-generation",
model=model_id,
torch_dtype=torch.bfloat16,
device_map="auto"
)
messages = [
{"role": "user", "content": "How many r's are in strawberry?"}
]
outputs = pipe(
messages,
max_new_tokens=2048
)
print(outputs[0]["generated_text"][-1])
💻 使用示例
基础用法
以下是使用该模型生成回复的示例:
- 用户输入:
how many r's are in the word strawberry
- 模型输出:
<think>
Okay, so the question is asking how many R's are in the word "strawberry." Hmm, let me think about this step by step.
First, I need to write down the word to make it easier to count. So, "strawberry" is the word. Now, I should go through each letter one by one to check for the letter R.
Starting from the beginning, the first letter is S. Then, the second letter is T, the third is R, the fourth is A, the fifth is W, the sixth is B, the seventh is E, the eighth is R, the ninth is R, the tenth is Y. Wait, so I've counted up to the tenth letter, which is Y.
Now, I need to count how many of those are R's. Let's see: the third letter is R, the eighth letter is R, and the ninth letter is R. So that's three R's in total.
Wait, hold on, let me double-check to make sure I didn't miss any. The word is "strawberry." S, T, R, A, W, B, E, R, R, Y. Yeah, that's three R's. So, the answer should be three.
I think that's it. I don't see any other R's in the word. So, the number of R's in "strawberry" is three.
**Final Answer**
The word "strawberry" contains three R's.
</think>
The word "strawberry" contains three R's.
**Answer:** Three R's.</s>
高级用法
- 用户输入:
which number has bigger value, 9.9 or 9.11? think step by step.
- 模型输出:
<think>
Okay, so I have this question here: which number is bigger, 9.9 or 9.11? Hmm, let me think about this step by step. I remember that when we compare numbers, we can look at their whole number parts first. So, both numbers start with 9. That means they are equal in terms of the whole number part, right? So, they are both 9.
Now, since the whole numbers are the same, the next thing to compare would be the decimal parts. The decimal part of 9.9 is 0.9, and for 9.11, it's 0.11. I need to figure out which of these decimal parts is larger.
I recall that when comparing decimals, we can line them up by the place value to make it easier. So, let's write them out:
9.9 is equal to 9.900000...
And 9.11 is equal to 9.110000...
Now, if I line them up by the tenths place, it looks like this:
9.900000...
9.110000...
Looking at the tenths place, the first number has 9 and the second has 1. Since 9 is larger than 1, the first number is bigger. Therefore, 9.9 is larger than 9.11.
Wait, but let me double-check to make sure I didn't make a mistake. Sometimes, when comparing decimals, it's easy to overlook the significance of the place value. So, let me consider the hundredths place as well. In 9.9, the hundredths place is 9, and in 9.11, it's 1. So, again, 9 is larger than 1, which confirms that 9.9 is bigger.
Is there another way to think about this? Maybe by converting them to fractions or something? Let me try that. 9.9 is equal to 9 and 9/10, and 9.11 is equal to 9 and 11/100. So, 9 and 9/10 is larger than 9 and 11/100 because 9/10 is larger than 11/100.
Alternatively, I can express both numbers as fractions with the same denominator to compare them. 9.9 is 99/10, and 9.11 is 911/100. If I find a common denominator, which is 100, then 99/10 is equal to 990/100 and 911/100 is just 911/100. Comparing 990 and 911, 990 is definitely larger than 911, so 9.9 is bigger.
Another way is to think about how many times 0.1 is in each number. 9.9 is 9 and a half, which is 9.5, and 9.11 is 9 and a bit more than half, which is less than 9.5. So, 9.9 is larger.
Wait, but maybe I should consider the context of the question. Is this a math problem or something else? If it's a math problem, then the answer is straightforward. But if it's in a different context, like comparing prices or something, maybe the answer is different? Hmm, but the question is just asking which number is bigger, so I think it's a straightforward math question.
Let me also think about the place value in a different way. The number 9.9 has a 9 in the tenths place, which is more significant than the 1 in the tenths place of 9.11. So, that means 9.9 is larger.
Is there a way to visualize this? Maybe by using a number line. On the number line, 9.9 is to the right of 9.11, so it's larger. That makes sense.
Another thought: if I add 0.1 to both numbers, 9.9 becomes 10, and 9.11 becomes 9.21. Clearly, 10 is larger than 9.21, so 9.9 is bigger. That seems to confirm my earlier conclusion.
Wait, let me try adding 0.01 to both numbers. 9.9 becomes 9.91, and 9.11 becomes 9.12. Now, 9.91 is still larger than 9.12. So, that also shows that 9.9 is bigger.
I think I've considered this from multiple angles, and each time, 9.9 comes out as the larger number. So, I'm confident that 9.9 is bigger than 9.11.
**Final Answer**
The number with the larger value is \boxed{9.9}.
</think>
To determine which number is larger, 9.9 or 9.11, we can follow these steps:
1. **Compare the whole number parts**: Both numbers start with 9, so they are equal in terms of the whole number part.
2. **Compare the decimal parts**: The decimal part of 9.9 is 0.9, and the decimal part of 9.11 is 0.11. We need to compare these two decimals.
3. **Line up the decimals by place value**:
- 9.9 is equal to 9.900000...
- 9.11 is equal to 9.110000...
4. **Compare the tenths place**: The first number has 9 and the second has 1. Since 9 is larger than 1, the first number is bigger.
5. **Compare the hundredths place**: The first number has 9 and the second has 1. Again, 9 is larger than 1, confirming that the first number is bigger.
6. **Convert to fractions for verification**:
- 9.9 is equal to 99/10.
- 9.11 is equal to 911/100.
- Comparing 99/10 and 911/100, we find that 99/10 is larger than 911/100.
7. **Consider the number line**: On the number line, 9.9 is to the right of 9.11, indicating that it is larger.
8. **Add small numbers to both**:
- Adding 0.1 to both: 9.9 becomes 10, and 9.11 becomes 9.21. Clearly, 10 is larger.
- Adding 0.01 to both: 9.9 becomes 9.91, and 9.11 becomes 9.12. Again, 9.91 is larger.
All methods lead to the conclusion that 9.9 is larger than 9.11.
\[
\boxed{9.9}
\]</s>
📚 详细文档
激活推理步骤的推荐系统提示如下:
You are an AI focused on providing systematic, well-reasoned responses. Response Structure: - Format: <think>{reasoning}</think>{answer} - Reasoning: Minimum 6 logical steps only when it required in <think> block - Process: Think first, then answer.
📄 许可证
本项目采用 MIT 许可证。
⚠️ 重要提示
- 该模型主要在英文数据集上进行微调,这意味着它在其他语言上的表现可能不佳。
- 根据你输入的最后一个提示,模型 可能 会丢失上下文或忽略你最后一轮的输入。
Phi 2 GGUF
其他
Phi-2是微软开发的一个小型但强大的语言模型,具有27亿参数,专注于高效推理和高质量文本生成。
大型语言模型 支持多种语言
P
TheBloke
41.5M
205
Roberta Large
MIT
基于掩码语言建模目标预训练的大型英语语言模型,采用改进的BERT训练方法
大型语言模型 英语
R
FacebookAI
19.4M
212
Distilbert Base Uncased
Apache-2.0
DistilBERT是BERT基础模型的蒸馏版本,在保持相近性能的同时更轻量高效,适用于序列分类、标记分类等自然语言处理任务。
大型语言模型 英语
D
distilbert
11.1M
669
Llama 3.1 8B Instruct GGUF
Meta Llama 3.1 8B Instruct 是一个多语言大语言模型,针对多语言对话用例进行了优化,在常见的行业基准测试中表现优异。
大型语言模型 英语
L
modularai
9.7M
4
Xlm Roberta Base
MIT
XLM-RoBERTa是基于100种语言的2.5TB过滤CommonCrawl数据预训练的多语言模型,采用掩码语言建模目标进行训练。
大型语言模型 支持多种语言
X
FacebookAI
9.6M
664
Roberta Base
MIT
基于Transformer架构的英语预训练模型,通过掩码语言建模目标在海量文本上训练,支持文本特征提取和下游任务微调
大型语言模型 英语
R
FacebookAI
9.3M
488
Opt 125m
其他
OPT是由Meta AI发布的开放预训练Transformer语言模型套件,参数量从1.25亿到1750亿,旨在对标GPT-3系列性能,同时促进大规模语言模型的开放研究。
大型语言模型 英语
O
facebook
6.3M
198
1
基于transformers库的预训练模型,适用于多种NLP任务
大型语言模型
Transformers

1
unslothai
6.2M
1
Llama 3.1 8B Instruct
Llama 3.1是Meta推出的多语言大语言模型系列,包含8B、70B和405B参数规模,支持8种语言和代码生成,优化了多语言对话场景。
大型语言模型
Transformers 支持多种语言

L
meta-llama
5.7M
3,898
T5 Base
Apache-2.0
T5基础版是由Google开发的文本到文本转换Transformer模型,参数规模2.2亿,支持多语言NLP任务。
大型语言模型 支持多种语言
T
google-t5
5.4M
702
精选推荐AI模型
Llama 3 Typhoon V1.5x 8b Instruct
专为泰语设计的80亿参数指令模型,性能媲美GPT-3.5-turbo,优化了应用场景、检索增强生成、受限生成和推理任务
大型语言模型
Transformers 支持多种语言

L
scb10x
3,269
16
Cadet Tiny
Openrail
Cadet-Tiny是一个基于SODA数据集训练的超小型对话模型,专为边缘设备推理设计,体积仅为Cosmo-3B模型的2%左右。
对话系统
Transformers 英语

C
ToddGoldfarb
2,691
6
Roberta Base Chinese Extractive Qa
基于RoBERTa架构的中文抽取式问答模型,适用于从给定文本中提取答案的任务。
问答系统 中文
R
uer
2,694
98