🚀 DeepMount00/Murai-350M-v0.1-beta
这是一个用于文本生成的模型,基于transformers库构建,具备高效的参数架构和出色的文本生成能力。
🚀 快速开始
以下是使用该模型进行文本生成的示例代码:
from transformers import pipeline, AutoTokenizer, AutoModelForCausalLM
model_id = "DeepMount00/Murai-350M-v0.1-beta"
model = AutoModelForCausalLM.from_pretrained(model_id, trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained(model_id)
t_pipeline = pipeline(
"text-generation",
model=model,
tokenizer=tokenizer,
device_map="auto",
return_full_text=True,
top_p = 0.95,
top_k = 50,
repetition_penalty=1.2
)
SYSTEM_PROMPT = """Sei un assistente utile."""
TEMPERATURE = 0.1
MAX_NEW_TOKENS = 250
messages = [
{"role": "system", "content": SYSTEM_PROMPT},
{"role": "user", "content": """Scrivi una funzione python che somma due numeri"""},
]
conv_template = tokenizer.apply_chat_template(
messages,
tokenize=False,
add_generation_prompt=True
)
outputs = t_pipeline(
conv_template,
max_new_tokens=MAX_NEW_TOKENS,
do_sample=True,
temperature=TEMPERATURE,
num_return_sequences=1,
)
print(outputs[0]["generated_text"])
💻 使用示例
基础用法
from transformers import pipeline, AutoTokenizer, AutoModelForCausalLM
model_id = "DeepMount00/Murai-350M-v0.1-beta"
model = AutoModelForCausalLM.from_pretrained(model_id, trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained(model_id)
t_pipeline = pipeline(
"text-generation",
model=model,
tokenizer=tokenizer,
device_map="auto",
return_full_text=True,
top_p = 0.95,
top_k = 50,
repetition_penalty=1.2
)
SYSTEM_PROMPT = """Sei un assistente utile."""
TEMPERATURE = 0.1
MAX_NEW_TOKENS = 250
messages = [
{"role": "system", "content": SYSTEM_PROMPT},
{"role": "user", "content": """Scrivi una funzione python che somma due numeri"""},
]
conv_template = tokenizer.apply_chat_template(
messages,
tokenize=False,
add_generation_prompt=True
)
outputs = t_pipeline(
conv_template,
max_new_tokens=MAX_NEW_TOKENS,
do_sample=True,
temperature=TEMPERATURE,
num_return_sequences=1,
)
print(outputs[0]["generated_text"])
🔧 技术细节
该模型使用了针对参数效率进行优化的深度架构:
- 采用RMSNorm的预归一化架构
- 采用分组查询注意力机制以提高内存效率
- 采用SwiGLU激活函数以提升性能
- 采用RoPE位置编码以实现更好的长度泛化能力
📚 引用说明
@misc{deepmount_llm_2024,
title={Deep LLM: A 350M Parameter Language Model with 42 Layers},
author={MicheleMontebovi},
year={2025},
url={https://huggingface.co/DeepMount00/Murai-350M-v0.1-beta}
}
📄 许可证
Apache 2.0