LongCite-llama3.1-8bオープンソースモデル - 長文脈の質問応答と細粒度な引用生成を無料で実現

ホーム

Longcite Llama3.1 8b

THUDMによって開発

LongCite-llama3.1-8bはMeta-Llama-3.1-8Bをベースに訓練されたモデルで、長文脈質問応答と細粒度な引用生成に特化し、最大128Kトークンのコンテキストウィンドウをサポートします。

大規模言語モデル

Transformers

複数言語対応#长文脈質問応答 #細粒度な引用生成 #128Kトークンのウィンドウ

ダウンロード数 4,469

リリース時間 : 9/2/2024

モデル概要

このモデルは長文脈質問応答に特化して設計されており、質問に回答する際に細粒度な引用を提供でき、大量のテキスト情報を処理する必要があるシナリオに適しています。

モデル特徴

長文脈サポート

最大128Kトークンのコンテキストウィンドウをサポートし、超長テキスト入力を処理できます。

細粒度な引用生成

質問に回答する際に詳細な引用を生成し、ユーザーが情報源を追跡できるようにします。

効率的な推論

Llama-3.1アーキテクチャを最適化し、効率的な推論性能を提供します。

モデル能力

長文理解

質問応答生成

引用生成

多輪対話

使用事例

学術研究

文献レビュー

研究者が大量の文献内容を迅速に理解し、引用を生成するのを支援します。

文献レビューの効率と精度を向上させます。

知識質問応答

長文書質問応答

長文書から情報を抽出し、引用付きの回答を生成します。

正確で追跡可能な回答を提供します。

🚀 LongCite-llama3.1-8b

LongCite-llama3.1-8bは、Meta-Llama-3.1-8Bをベースに学習されたモデルで、長文コンテキストの質問応答において、細粒度の引用を生成することができます。このモデルは最大128Kトークンのコンテキストウィンドウをサポートしています。

📚 [LongCiteデータセット] • 💻 [Githubリポジトリ] • 📃 [LongCite論文]

🚀 クイックスタート

環境: transforemrs>=4.43.0。

このモデルをデプロイするための簡単なデモは以下の通りです。

基本的な使用法

import json
import torch
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained('THUDM/LongCite-llama3.1-8b', trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained('THUDM/LongCite-llama3.1-8b', torch_dtype=torch.bfloat16, trust_remote_code=True, device_map='auto')

context = '''
W. Russell Todd, 94, United States Army general (b. 1928). February 13. Tim Aymar, 59, heavy metal singer (Pharaoh) (b. 1963). Marshall \"Eddie\" Conway, 76, Black Panther Party leader (b. 1946). Roger Bonk, 78, football player (North Dakota Fighting Sioux, Winnipeg Blue Bombers) (b. 1944). Conrad Dobler, 72, football player (St. Louis Cardinals, New Orleans Saints, Buffalo Bills) (b. 1950). Brian DuBois, 55, baseball player (Detroit Tigers) (b. 1967). Robert Geddes, 99, architect, dean of the Princeton University School of Architecture (1965–1982) (b. 1923). Tom Luddy, 79, film producer (Barfly, The Secret Garden), co-founder of the Telluride Film Festival (b. 1943). David Singmaster, 84, mathematician (b. 1938).
'''
query = "What was Robert Geddes' profession?"
result = model.query_longcite(context, query, tokenizer=tokenizer, max_input_length=128000, max_new_tokens=1024)

print("Answer:\n{}\n".format(result['answer']))
print("Statement with citations:\n{}\n".format(
  json.dumps(result['statements_with_citations'], indent=2, ensure_ascii=False)))
print("Context (divided into sentences):\n{}\n".format(result['splited_context']))

また、vllmを使ってモデルをデプロイすることもできます。詳細なコード例は、vllm_inference.pyを参照してください。

📄 ライセンス

Llama-3.1 License

引用

もしこの研究が役に立った場合は、LongCiteを引用していただけると幸いです。

@article{zhang2024longcite,
  title = {LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA} 
  author={Jiajie Zhang and Yushi Bai and Xin Lv and Wanjun Gu and Danqing Liu and Minhao Zou and Shulin Cao and Lei Hou and Yuxiao Dong and Ling Feng and Juanzi Li},
  journal={arXiv preprint arXiv:2409.02897},
  year={2024}
}