qwen25-05b-multiclinsum-distil开源文本模型 - 免费支持多语言临床报告摘要生成

Home

Qwen25 05b Multiclinsum Distil

Developed by nicolay-r

本模型是基于 Qwen2.5-0.5B-Instruct 微调的文本生成模型，专注于多语言临床报告摘要生成任务。

大型语言模型

Transformers

Supports Multiple LanguagesOpen Source License:MIT #临床报告摘要生成 #多语言医疗文本处理 #蒸馏微调模型

Downloads 147

Release Time : 6/2/2025

Model Overview

该模型在 MultiClinSum 数据集上进行微调，专门用于生物医学领域的临床报告摘要生成，支持英语、法语、葡萄牙语和西班牙语。

Model Features

多语言支持

专门针对英语、法语、葡萄牙语和西班牙语的临床报告进行优化

知识蒸馏

使用 Qwen2.5-72B-Instruct 生成 rationale 进行知识蒸馏

高效微调

在 A100 GPU 上仅需约1小时即可完成微调

Model Capabilities

临床报告摘要生成

多语言文本处理

生物医学信息提取

Use Cases

医疗健康

临床报告自动摘要

自动生成患者临床报告的简明摘要

提高医疗专业人员处理信息的效率

跨语言医疗信息处理

处理不同语言的临床报告并生成统一格式的摘要

促进国际医疗信息交流

🚀 文本生成模型

本模型专注于文本生成领域，特别是临床报告摘要生成。它基于 Qwen/Qwen2.5-0.5B-Instruct 模型进行微调，在多语言临床报告摘要任务中表现出色，为生物医学领域的信息处理提供了高效解决方案。

🚀 快速开始

本模型是 Qwen/Qwen2.5-0.5B-Instruct 在 MultiClinSum 训练数据及其 rationale 上的蒸馏微调版本。该模型的结果用于提交 BioASQ-2025 研讨会 / CLEF 2025 的相关成果。

模型图片

我们首先采用 Qwen/Qwen2.5-72B-Instruct 为训练数据推断 rationale（更多细节请继续阅读）。

基线版本：https://huggingface.co/nicolay-r/qwen25-05b-multiclinsum-standard

✨ 主要特性

模型类型：基于解码器的模型
支持语言（NLP）：Qwen2.5 原生支持语言，并在 en、fr、pt、es 语言的摘要上进行了微调
许可证：MIT
微调基础模型：https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct

属性	详情
模型类型	基于解码器的模型
支持语言（NLP）	Qwen2.5 原生支持语言，并在 `en`、`fr`、`pt`、`es` 语言的摘要上进行了微调
许可证	MIT
微调基础模型	https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct

📦 安装指南

暂未提供相关安装步骤。

💻 使用示例

基础用法

from bulk_chain.api import iter_content
from bulk_chain.core.utils import dynamic_init

content_it = iter_content(
  schema={"schema": [
      {"prompt": "Summarize: {input}", "out": "summary"}]
  },
  llm=dynamic_init(
    class_filepath="huggingface_qwen.py",
    class_name="Qwen2")(
      api_token="YOUR_HF_API_KEY_GOES_HERE",
      model_name="nicolay-r/qwen25-05b-multiclinsum-distil",
      temp=0.1,
      use_bf16=True,
      max_new_tokens=args.max_tokens,
      device=args.device
  ),
  infer_mode="batch",
  batch_size=4,
  return_mode="record",
  # INPUT TEXTS:
  input_dicts_it=[
     {"input": "A patient 62 years old with ..."}
  ],
)

for record in content_it:
  # here is the result dictionary that includes summary.
  print(record["summary"])

高级用法

暂未提供相关高级用法示例。

📚 详细文档

模型来源

代码仓库：https://github.com/nicolay-r/distil-tuning-llm
论文：待公布
演示：https://colab.research.google.com/drive/1TXGaz39o73nBucEQw12gbad7Tw11j2Ol?usp=sharing

🔧 技术细节

训练数据

MultiClinSum
- 我们使用以下脚本下载数据集。
- 官网：https://temu.bsc.es/multiclinsum
- 数据：https://zenodo.org/records/15463353
- BioASQ：http://bioasq.org/

训练过程

训练过程包括：

为摘要蒸馏准备 rationale。
启动微调过程。

准备工作：我们采用 Qwen/Qwen2.5-72B-Instruct 通过以下脚本来推断 rationale：

https://github.com/nicolay-r/distil-tuning-llm/blob/master/predict/annotate_train_rationale.py
上述脚本依赖 open-router 作为远程 API 提供者：https://openrouter.ai/qwen/qwen-2.5-72b-instruct

微调：请遵循此脚本，在 GoogleColab A100（40GB VRAM）+ 80GB RAM 上使用 MultiClinSum 数据集进行微调：

https://github.com/nicolay-r/distil-tuning-llm/blob/master/distil_ft_qwen25_05b_A100-40GB_80GB_dis.sh

预处理

参考以下脚本进行 微调 预处理：

https://github.com/nicolay-r/distil-tuning-llm/blob/master/resources/make_dataset_mult.py

训练超参数

我们参考原始参数：

https://github.com/QwenLM/Qwen2.5-VL/tree/main/qwen-vl-finetune 并使用以下脚本：
https://github.com/nicolay-r/distil-tuning-llm/blob/master/distil_ft_qwen25_05b_A100-40GB_80GB_dis.sh