Mistral-Nemo-12B-R1-v0.2オープンソース大規模言語モデル - 無料デプロイで論理推論と段階的解決をサポート

ホーム

Mistral Nemo 12B R1 V0.2

CreitinGameplaysによって開発

Mistral-Nemo-Instruct-2407をファインチューニングした大規模言語モデルで、体系的な論理推論と段階的解答能力に特化しています。

大規模言語モデル

Transformers

英語オープンソースライセンス:MIT #段階的推論生成 #数学的論理分析 #英語CoTファインチューニング

ダウンロード数 197

リリース時間 : 4/18/2025

モデル概要

このモデルはNvidia H100 1台で12時間のファインチューニングを行い、数学問題の解答や論理分析など段階的推論を必要とするタスクに優れています。

モデル特徴

体系的な推論能力

<think>タグ形式で構造化思考を行い、少なくとも6つの論理ステップを含む詳細な推論プロセスを提供

効率的なファインチューニング

H100でわずか12時間の訓練で優れた推論能力を獲得

数学的論理処理

数値比較やカウントなど段階的検証が必要なタスクに強い

モデル能力

段階的論理推論

数学問題解答

テキスト生成

構造化思考

使用事例

教育補助

数学問題解答

段階的検証が必要な数学的比較問題を解決

例示通り9.9と9.11の大小を正確に比較

論理テスト

文字カウント

正確な文字位置分析とカウントを実行

例示通り'strawberry'に3つのRが含まれることを正しく認識

🚀 ミストラルネモ12B R1

ミストラルネモ12B R1は、高性能なテキスト生成モデルです。特定の設定でファインチューニングされ、様々な質問に論理的な回答を提供できます。

mistralthink

🚀 クイックスタート

このモデルは、1台のNvidia H100で以下の設定で12時間かけてファインチューニングされました。

バッチサイズ: 26
勾配累積ステップ: 1
エポック数: 1
学習率: 2e-5
ウォームアップ率: 0.1

モデルを実行するには、以下のコードを使用します。

import torch
from transformers import pipeline

model_id = "CreitinGameplays/Mistral-Nemo-12B-R1-v0.2"

pipe = pipeline(
    "text-generation",
    model=model_id,
    torch_dtype=torch.bfloat16,
    device_map="auto"
)

messages = [
    {"role": "user", "content": "How many r's are in strawberry?"}
]

outputs = pipe(
    messages,
    max_new_tokens=2048
)

print(outputs[0]["generated_text"][-1])

💻 使用例

基本的な使用法

import torch
from transformers import pipeline

model_id = "CreitinGameplays/Mistral-Nemo-12B-R1-v0.2"

pipe = pipeline(
    "text-generation",
    model=model_id,
    torch_dtype=torch.bfloat16,
    device_map="auto"
)

messages = [
    {"role": "user", "content": "How many r's are in strawberry?"}
]

outputs = pipe(
    messages,
    max_new_tokens=2048
)

print(outputs[0]["generated_text"][-1])

高度な使用法

このモデルを使用して、具体的な質問に対する回答例を示します。

ユーザーの質問:

how many r's are in the word strawberry

モデルの出力:

<think>
Okay, so the question is asking how many R's are in the word "strawberry." Hmm, let me think about this step by step.

First, I need to write down the word to make it easier to count. So, "strawberry" is the word. Now, I should go through each letter one by one to check for the letter R.

Starting from the beginning, the first letter is S. Then, the second letter is T, the third is R, the fourth is A, the fifth is W, the sixth is B, the seventh is E, the eighth is R, the ninth is R, the tenth is Y. Wait, so I've counted up to the tenth letter, which is Y.

Now, I need to count how many of those are R's. Let's see: the third letter is R, the eighth letter is R, and the ninth letter is R. So that's three R's in total.

Wait, hold on, let me double-check to make sure I didn't miss any. The word is "strawberry." S, T, R, A, W, B, E, R, R, Y. Yeah, that's three R's. So, the answer should be three.

I think that's it. I don't see any other R's in the word. So, the number of R's in "strawberry" is three.

**Final Answer**
The word "strawberry" contains three R's.
</think>

The word "strawberry" contains three R's.

**Answer:** Three R's.</s>

例2:

ユーザーの質問:

which number has bigger value, 9.9 or 9.11? think step by step.

モデルの出力:

<think>
Okay, so I have this question here: which number is bigger, 9.9 or 9.11? Hmm, let me think about this step by step. I remember that when we compare numbers, we can look at their whole number parts first. So, both numbers start with 9. That means they are equal in terms of the whole number part, right? So, they are both 9. 

Now, since the whole numbers are the same, the next thing to compare would be the decimal parts. The decimal part of 9.9 is 0.9, and for 9.11, it's 0.11. I need to figure out which of these decimal parts is larger. 

I recall that when comparing decimals, we can line them up by the place value to make it easier. So, let's write them out:

9.9 is equal to 9.900000...
And 9.11 is equal to 9.110000...

Now, if I line them up by the tenths place, it looks like this:

9.900000...
9.110000...

Looking at the tenths place, the first number has 9 and the second has 1. Since 9 is larger than 1, the first number is bigger. Therefore, 9.9 is larger than 9.11.

Wait, but let me double-check to make sure I didn't make a mistake. Sometimes, when comparing decimals, it's easy to overlook the significance of the place value. So, let me consider the hundredths place as well. In 9.9, the hundredths place is 9, and in 9.11, it's 1. So, again, 9 is larger than 1, which confirms that 9.9 is bigger.

Is there another way to think about this? Maybe by converting them to fractions or something? Let me try that. 9.9 is equal to 9 and 9/10, and 9.11 is equal to 9 and 11/100. So, 9 and 9/10 is larger than 9 and 11/100 because 9/10 is larger than 11/100. 

Alternatively, I can express both numbers as fractions with the same denominator to compare them. 9.9 is 99/10, and 9.11 is 911/100. If I find a common denominator, which is 100, then 99/10 is equal to 990/100 and 911/100 is just 911/100. Comparing 990 and 911, 990 is definitely larger than 911, so 9.9 is bigger.

Another way is to think about how many times 0.1 is in each number. 9.9 is 9 and a half, which is 9.5, and 9.11 is 9 and a bit more than half, which is less than 9.5. So, 9.9 is larger.

Wait, but maybe I should consider the context of the question. Is this a math problem or something else? If it's a math problem, then the answer is straightforward. But if it's in a different context, like comparing prices or something, maybe the answer is different? Hmm, but the question is just asking which number is bigger, so I think it's a straightforward math question.

Let me also think about the place value in a different way. The number 9.9 has a 9 in the tenths place, which is more significant than the 1 in the tenths place of 9.11. So, that means 9.9 is larger.

Is there a way to visualize this? Maybe by using a number line. On the number line, 9.9 is to the right of 9.11, so it's larger. That makes sense.

Another thought: if I add 0.1 to both numbers, 9.9 becomes 10, and 9.11 becomes 9.21. Clearly, 10 is larger than 9.21, so 9.9 is bigger. That seems to confirm my earlier conclusion.

Wait, let me try adding 0.01 to both numbers. 9.9 becomes 9.91, and 9.11 becomes 9.12. Now, 9.91 is still larger than 9.12. So, that also shows that 9.9 is bigger.

I think I've considered this from multiple angles, and each time, 9.9 comes out as the larger number. So, I'm confident that 9.9 is bigger than 9.11.

**Final Answer**
The number with the larger value is \boxed{9.9}.
</think>

To determine which number is larger, 9.9 or 9.11, we can follow these steps:

1. **Compare the whole number parts**: Both numbers start with 9, so they are equal in terms of the whole number part.

2. **Compare the decimal parts**: The decimal part of 9.9 is 0.9, and the decimal part of 9.11 is 0.11. We need to compare these two decimals.

3. **Line up the decimals by place value**:
   - 9.9 is equal to 9.900000...
   - 9.11 is equal to 9.110000...

4. **Compare the tenths place**: The first number has 9 and the second has 1. Since 9 is larger than 1, the first number is bigger.

5. **Compare the hundredths place**: The first number has 9 and the second has 1. Again, 9 is larger than 1, confirming that the first number is bigger.

6. **Convert to fractions for verification**:
   - 9.9 is equal to 99/10.
   - 9.11 is equal to 911/100.
   - Comparing 99/10 and 911/100, we find that 99/10 is larger than 911/100.

7. **Consider the number line**: On the number line, 9.9 is to the right of 9.11, indicating that it is larger.

8. **Add small numbers to both**:
   - Adding 0.1 to both: 9.9 becomes 10, and 9.11 becomes 9.21. Clearly, 10 is larger.
   - Adding 0.01 to both: 9.9 becomes 9.91, and 9.11 becomes 9.12. Again, 9.91 is larger.

All methods lead to the conclusion that 9.9 is larger than 9.11.

\[
\boxed{9.9}
\]</s>

推論ステップを有効にするための推奨システムプロンプト:

You are an AI focused on providing systematic, well-reasoned responses. Response Structure: - Format: <think>{reasoning}</think>{answer} - Reasoning: Minimum 6 logical steps only when it required in <think> block - Process: Think first, then answer.

📚 ドキュメント

属性	詳情
モデルタイプ	テキスト生成
ベースモデル	mistralai/Mistral-Nemo-Instruct-2407
学習データセット	CreitinGameplays/Magpie-Reasoning-V2-250K-CoT-Deepseek-R1-Llama-70B-filtered-mistral
ライブラリ名	transformers
ライセンス	MIT