FineMedLM-o1開源醫療大語言模型 - 高級醫學推理多步推敲精準答覆

首頁

Finemedlm O1

由hongzhouyu開發

FineMedLM-o1是一款專為高級醫學推理設計的專業醫療大語言模型，採用多步推理機制，在給出最終答覆前會反覆推敲並完善其思考過程。

大型語言模型

Transformers

支持多種語言開源協議:MIT #醫學推理 #多步思考 #中英雙語

下載量 55

發布時間 : 1/23/2025

模型概述

FineMedLM-o1是一款基於Llama-3.1-8B和FineMedLM的醫學專業大語言模型，專注於醫學推理任務，支持中英文交互。

模型特點

多步推理機制

採用慢思考模式，在給出最終答覆前會反覆推敲並完善其思考過程

醫學專業優化

針對醫學領域進行了專門優化，能夠處理複雜的醫學推理問題

雙語支持

支持中英文交互，滿足不同語言用戶的需求

模型能力

醫學問答

醫學推理

醫學知識解釋

中英文交互

使用案例

醫學研究

醫學機制分析

分析神經元活動、性腺激素和神經營養因子之間的相互作用如何影響損傷後的軸突再生

提供詳細的機制分析和潛在治療意義

臨床輔助

醫學問題解答

回答專業醫學問題，提供詳細推理過程

幫助醫生和研究人員獲取專業見解

🚀 FineMedLM-o1

FineMedLM-o1是一款專門為高級醫學推理設計的醫學大語言模型。它採用多步推理過程，在給出最終答案之前，會反覆思考並完善推理過程。

🚀 快速開始

FineMedLM-o1可按照與Llama-3.1-8B-Instruct相同的方式使用：

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("hongzhouyu/FineMedLM-o1")
tokenizer = AutoTokenizer.from_pretrained("hongzhouyu/FineMedLM-o1")

prompt = "How do the interactions between neuronal activity, gonadal hormones, and neurotrophins influence axon regeneration post-injury, and what are the potential therapeutic implications of this research? Please think step by step."
messages = [
    {"role": "system", "content": "You are a helpful professional doctor."},
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
model_inputs = tokenizer([text], return_tensors="pt")

generated_ids = model.generate(
    model_inputs.input_ids,
    max_new_tokens=4096
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]
response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]

print(response)

FineMedLM-o1採用“慢思考”方式，輸出格式如下：

**Thought**
[推理過程]

**Summarization**
[輸出內容]

✨ 主要特性

FineMedLM-o1是專門為高級醫學推理設計的醫學大語言模型，它採用多步推理過程，在給出最終答案之前，會反覆思考並完善推理過程。

📚 詳細文檔

如需更多信息，請訪問我們的GitHub倉庫和論文。

📄 許可證

本項目採用MIT許可證。

📦 相關信息

屬性	詳情
基礎模型	meta-llama/Llama-3.1-8B、hongzhouyu/FineMedLM
訓練數據集	hongzhouyu/FineMed-SFT、hongzhouyu/FineMed-DPO
庫名稱	transformers
標籤	medical

📖 引用

@misc{yu2025finemedlmo1enhancingmedicalreasoning,
    title={FineMedLM-o1: Enhancing the Medical Reasoning Ability of LLM from Supervised Fine-Tuning to Test-Time Training}, 
    author={Hongzhou Yu and Tianhao Cheng and Ying Cheng and Rui Feng},
    year={2025},
    eprint={2501.09213},
    archivePrefix={arXiv},
    primaryClass={cs.CL},
    url={https://arxiv.org/abs/2501.09213}, 
}