LongCite-llama3.1-8b開源模型 - 免費實現長上下文問答與細粒度引用生成

首頁

Longcite Llama3.1 8b

由THUDM開發

LongCite-llama3.1-8b 是基於 Meta-Llama-3.1-8B 訓練的模型，專注於長上下文問答並生成細粒度引用，支持最大 128K 標記的上下文窗口。

大型語言模型

Transformers

支持多種語言#長上下文問答 #細粒度引用生成 #128K標記窗口

下載量 4,469

發布時間 : 9/2/2024

模型概述

該模型專為長上下文問答設計，能夠在回答問題時提供細粒度的引用，適用於需要處理大量文本信息的場景。

模型特點

長上下文支持

支持最大 128K 標記的上下文窗口，能夠處理超長文本輸入。

細粒度引用生成

在回答問題時能夠生成詳細的引用，幫助用戶追溯信息來源。

高效推理

基於 Llama-3.1 架構優化，提供高效的推理性能。

模型能力

長文本理解

問答生成

引用生成

多輪對話

使用案例

學術研究

文獻綜述

幫助研究人員快速理解大量文獻內容並生成引用。

提高文獻綜述的效率和準確性。

知識問答

長文檔問答

從長文檔中提取信息並生成帶有引用的回答。

提供準確且可追溯的答案。

🚀 LongCite-llama3.1-8b

LongCite-llama3.1-8b 是基於 Meta-Llama-3.1-8B 訓練的模型，能夠在長上下文問答中生成細粒度的引用。該模型支持最大達 128K 標記的上下文窗口。

📚 [LongCite 數據集] • 💻 [GitHub 倉庫] • 📃 [LongCite 論文]

🚀 快速開始

環境要求

環境：transforemrs>=4.43.0。

模型部署示例

以下是一個簡單的模型部署示例：

import json
import torch
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained('THUDM/LongCite-llama3.1-8b', trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained('THUDM/LongCite-llama3.1-8b', torch_dtype=torch.bfloat16, trust_remote_code=True, device_map='auto')

context = '''
W. Russell Todd, 94, United States Army general (b. 1928). February 13. Tim Aymar, 59, heavy metal singer (Pharaoh) (b. 1963). Marshall \"Eddie\" Conway, 76, Black Panther Party leader (b. 1946). Roger Bonk, 78, football player (North Dakota Fighting Sioux, Winnipeg Blue Bombers) (b. 1944). Conrad Dobler, 72, football player (St. Louis Cardinals, New Orleans Saints, Buffalo Bills) (b. 1950). Brian DuBois, 55, baseball player (Detroit Tigers) (b. 1967). Robert Geddes, 99, architect, dean of the Princeton University School of Architecture (1965–1982) (b. 1923). Tom Luddy, 79, film producer (Barfly, The Secret Garden), co-founder of the Telluride Film Festival (b. 1943). David Singmaster, 84, mathematician (b. 1938).
'''
query = "What was Robert Geddes' profession?"
result = model.query_longcite(context, query, tokenizer=tokenizer, max_input_length=128000, max_new_tokens=1024)

print("Answer:\n{}\n".format(result['answer']))
print("Statement with citations:\n{}\n".format(
  json.dumps(result['statements_with_citations'], indent=2, ensure_ascii=False)))
print("Context (divided into sentences):\n{}\n".format(result['splited_context']))

你也可以使用 vllm 來部署該模型。具體代碼示例請參考 vllm_inference.py。

📄 許可證

Llama-3.1 許可證

📚 引用

如果你覺得我們的工作有幫助，請考慮引用 LongCite：

@article{zhang2024longcite,
  title = {LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA} 
  author={Jiajie Zhang and Yushi Bai and Xin Lv and Wanjun Gu and Danqing Liu and Minhao Zou and Shulin Cao and Lei Hou and Yuxiao Dong and Ling Feng and Juanzi Li},
  journal={arXiv preprint arXiv:2409.02897},
  year={2024}
}