llmlingua-2-bert-base-multilingual-cased-meetingbank開源模型

首頁

Llmlingua 2 Bert Base Multilingual Cased Meetingbank

由microsoft開發

基於多語言BERT基礎模型微調的提示壓縮標記分類模型，用於任務無關的提示壓縮

大型語言模型

Transformers

開源協議:Apache-2.0 #會議記錄壓縮 #多語言提示優化 #任務無關壓縮

下載量 28.67k

發布時間 : 3/17/2024

模型概述

該模型用於執行任務無關的提示壓縮標記分類，每個標記的保留概率將作為壓縮度量指標。特別適用於會議記錄等文本的壓縮處理。

模型特點

任務無關提示壓縮

能夠在不依賴特定下游任務的情況下進行有效的提示壓縮

多語言支持

基於多語言BERT模型，支持多種語言的文本壓縮

數據蒸餾訓練

採用LLMLingua-2提出的數據蒸餾方法訓練，提高壓縮質量

模型能力

文本壓縮

標記分類

會議記錄處理

多語言文本處理

使用案例

會議記錄處理

會議記錄壓縮

壓縮冗長的會議記錄，保留關鍵信息

可顯著減少文本長度同時保持關鍵信息

下游任務預處理

為問答和摘要生成等下游任務預處理輸入文本

提高下游任務效率而不顯著影響準確性

🚀 LLMLingua-2-Bert-base-Multilingual-Cased-MeetingBank

本模型在論文 LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression (Pan 等人, 2024) 中被提出。它是一個基於 BERT 多語言基礎模型（區分大小寫）微調得到的模型，用於執行與任務無關的提示壓縮的標記分類任務。每個標記 $x_i$ 的保留概率 $p_{preserve}$ 被用作壓縮的度量標準。該模型在抽取式文本壓縮數據集上進行訓練，此數據集是使用 LLMLingua-2 中提出的方法構建的，以 MeetingBank (Hu 等人, 2023) 中的訓練示例作為種子數據。

你可以使用此數據集在下游任務（如問答（QA）和會議記錄壓縮後的摘要生成）上評估該模型。

更多詳細信息，請查看 LLMLingua-2 和 LLMLingua 系列的項目頁面。

🚀 快速開始

本模型基於 BERT 多語言基礎模型微調，用於任務無關的提示壓縮，可在問答、摘要生成等下游任務中評估使用。

💻 使用示例

基礎用法

from llmlingua import PromptCompressor

compressor = PromptCompressor(
    model_name="microsoft/llmlingua-2-bert-base-multilingual-cased-meetingbank",
    use_llmlingua2=True
)

original_prompt = """John: So, um, I've been thinking about the project, you know, and I believe we need to, uh, make some changes. I mean, we want the project to succeed, right? So, like, I think we should consider maybe revising the timeline.
Sarah: I totally agree, John. I mean, we have to be realistic, you know. The timeline is, like, too tight. You know what I mean? We should definitely extend it.
"""
results = compressor.compress_prompt_llmlingua2(
    original_prompt,
    rate=0.6,
    force_tokens=['\n', '.', '!', '?', ','],
    chunk_end_tokens=['.', '\n'],
    return_word_label=True,
    drop_consecutive=True
)

print(results.keys())
print(f"Compressed prompt: {results['compressed_prompt']}")
print(f"Original tokens: {results['origin_tokens']}")
print(f"Compressed tokens: {results['compressed_tokens']}")
print(f"Compression rate: {results['rate']}")

# get the annotated results over the original prompt
word_sep = "\t\t|\t\t"
label_sep = " "
lines = results["fn_labeled_original_prompt"].split(word_sep)
annotated_results = []
for line in lines:
    word, label = line.split(label_sep)
    annotated_results.append((word, '+') if label == '1' else (word, '-')) # list of tuples: (word, label)
print("Annotated results:")
for word, label in annotated_results[:10]:
    print(f"{word} {label}")