LLaMA-3-8B-Instruct-Abliterated-TR開源模型 - 強制用土耳其語響應，便於土耳其語交流

首頁

Llama 3 8B Instruct Abliterated TR

由Metin開發

LLaMA-3-8B-Instruct的消融版本，通過特定技術手段強制模型用土耳其語響應

大型語言模型

Transformers

其他#土耳其語優化 #指令消融技術 #多輪對話支持

下載量 25

發布時間 : 4/25/2025

模型概述

該模型是基於Meta-Llama-3-8B-Instruct的消融版本，通過修改激活方向強制模型使用土耳其語回答問題。適用於土耳其語文本生成任務。

模型特點

土耳其語強制響應

通過消融技術強制模型使用土耳其語回答問題

基於LLaMA-3架構

繼承LLaMA-3的強大文本生成能力

指令優化

針對指令跟隨任務進行了優化

模型能力

土耳其語文本生成

指令理解與執行

問答系統

知識解釋

使用案例

教育

土耳其語學習輔助

幫助學生理解土耳其語問題和概念

提供準確的知識解釋

客服

土耳其語客服機器人

處理土耳其語客戶諮詢

提供流暢自然的土耳其語響應

🚀 🦙LLaMA-3-8B-Instruct-Abliterated-TR

LLaMA-3-8B-Instruct-Abliterated-TR 是 Meta-LLaMA-3-8B-Instruct 的消融版本，該模型通過特定方法消除了大語言模型中的拒絕機制，並強制其以土耳其語進行響應。

📦 模型信息

屬性	詳情
許可證	llama3
支持語言	土耳其語
任務類型	文本生成
基礎模型	meta-llama/Meta-Llama-3-8B-Instruct

📊 模型評估結果

數據集	評估指標	得分	驗證情況
MMLU_TR_V0.2	5-shot	49.08%	未驗證
Truthful_QA_TR_V0.2	0-shot	49.62%	未驗證
ARC_TR_V0.2	25-shot	43.77%	未驗證
HellaSwag_TR_V0.2	10-shot	44.86%	未驗證
GSM8K_TR_V0.2	5-shot	53.23%	未驗證
Winogrande_TR_V0.2	5-shot	55.13%	未驗證
平均得分	-	49.28%	-

🔍 什麼是消融（Abliteration）？

Arditi 等人在他們的博客文章中指出，大語言模型中的拒絕機制是由殘差流中的單一方向介導的。他們發現，阻止模型表示這個方向可以使其回答有害問題。若想深入瞭解這一概念，你可以參考 Maxime Labonne 的文章。

為了強制模型用土耳其語進行響應，研究人員使用了 LIMA 數據集的 stackexchange 子集製作了並行指令。這些指令隨後被翻譯成土耳其語，並在運行時附加了一個額外的句子，提示模型用土耳其語回答。

你可以通過以下鏈接找到本實驗中使用的數據集：

https://huggingface.co/datasets/Metin/abliteration_en
https://huggingface.co/datasets/Metin/abliteration_tr

💻 使用示例

你可以使用以下代碼片段來使用該模型：

from transformers import BitsAndBytesConfig
import transformers
import torch

bnb_config = BitsAndBytesConfig(
            load_in_4bit=True,
            bnb_4bit_use_double_quant=True,
            bnb_4bit_quant_type="nf4",
            bnb_4bit_compute_dtype=torch.bfloat16
)

model_id = "Metin/LLaMA-3-8B-Instruct-Abliterated-TR"

pipeline = transformers.pipeline(
    "text-generation",
    model=model_id,
    model_kwargs={"torch_dtype": torch.bfloat16 ,'quantization_config': bnb_config},
    device_map="auto",
)

messages = [
    {"role": "system", "content": "You are a helpful assistant."}, # Ideally we should not have to tell the model to answer in Turkish after abliteration.
    {"role": "user", "content": "Python'da bir öğenin bir listede geçip geçmediğini nasıl kontrol edebilirim?"},
]

prompt = pipeline.tokenizer.apply_chat_template(
        messages,
        tokenize=False,
        add_generation_prompt=True
)

terminators = [
    pipeline.tokenizer.eos_token_id,
    pipeline.tokenizer.convert_tokens_to_ids("<|eot_id|>")
]

outputs = pipeline(
    prompt,
    max_new_tokens=512,
    eos_token_id=terminators,
    do_sample=True,
    temperature=0.2,
    top_p=0.9,
)

print(outputs[0]["generated_text"][len(prompt):])

📈 模型細節

使用 40 個樣本找出激活值之間的均值差異。
選擇第 7 層作為最有可能的土耳其語表達方向的層。

⚠️ 注意事項

這些得分可能與你運行相同基準測試時得到的結果不同，因為本實驗未使用任何推理引擎（如 vLLM、TensorRT-LLM 等）。

📋 輸出示例（消融模型與基礎模型對比）

雖然用單個示例測試模型並不準確，但這裡仍提供一個示例來展示模型的能力。

模型：LLaMA-3-8B-Instruct-Abliterated-TR

輸入

messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Gökyüzü neden mavi?"},
]

輸出

A classic question!

Gökyüzü (the sky) neden mavi (blue) sorusu, Türkçe'de "Why is the sky blue?" olarak çevrilebilir. Bu soruya cevap vermek için, ilk olarak, gökyüzünün mavi olmasının nedenini açıklamak gerekir.

Gökyüzünün mavi olmasının nedeni, ışığın farklı uzunluklarındaki yayılımına dayanır. Işık, gökyüzüne ulaşmadan önce, atmosferde bulunan gaz moleküllerine çarpınır. Bu çarpma, ışığın farklı uzunluklarındaki yayılımına neden olur.

Gökyüzünün mavi olmasının nedeni, ışığın kısa dalga boyunda (yaklaşık 450-495 nm) yayılımına dayanır. Bu dalga boyunda ışığın, gökyüzüne ulaşmadan önce, atmosferde bulunan gaz moleküllerine çarpınır. Bu çarpma, ışığın mavi renkini oluşturur.

Diğer renkler, farklı dalga boyunda yayılımına neden olur. Örneğin, kırmızı renk, ışığın uzun dalga boyunda (yaklaşık 620-750 nm) yayılımına neden olur. Bu nedenle, gökyüzünün mavi olmasının nedeni, ışığın kısa dalga boyunda yayılımına dayanır.

Bu cevap, gökyüzünün mavi olmasının nedenini açıklamaktadır.

模型：LLaMA-3-8B-Instruct

輸入

A question that has puzzled humans for centuries!

The short answer is that the sky appears blue because of a phenomenon called Rayleigh scattering. This is the scattering of light by small particles or molecules in the atmosphere, such as nitrogen (N2) and oxygen (O2).

When sunlight enters the Earth's atmosphere, it encounters these tiny molecules and scatters in all directions. However, shorter (blue) wavelengths are scattered more than longer (red) wavelengths. This is because the smaller molecules are more effective at scattering the shorter wavelengths.

As a result, our eyes perceive the blue light as being more abundant, giving the sky its blue appearance. The color of the sky can vary depending on the time of day, atmospheric conditions, and other factors, but Rayleigh scattering is the main reason why the sky appears blue.

In Turkish, the phrase "Gökyüzü neden mavi" can be translated to "Why is the sky blue?" or "What makes the sky blue?"