finetuned-t5-xsumオープンソーステキスト要約モデル - 無料で迅速かつ正確な内容要点の抽出をサポート

ホーム

Finetuned T5 Xsum

Lakshan2003によって開発

T5-smallモデルをベースに、XSumデータセットでLoRA技術を使用してファインチューニングしたテキスト要約モデル

テキスト生成

Safetensors

英語オープンソースライセンス:Apache-2.0 #LoRAファインチューニング #英語要約 #軽量T5

ダウンロード数 22

リリース時間 : 2/9/2025

モデル概要

このモデルはLoRA（低ランク適応）技術を使用してT5-smallをファインチューニングし、特にニュース記事などの抽象的な要約タスクに適した高品質なテキスト要約を生成するために設計されています。

モデル特徴

LoRAファインチューニング技術

低ランク適応技術を使用した効率的なファインチューニングにより、モデル性能を維持しながらトレーニングパラメータを大幅に削減

専門的な要約能力

XSumニュース要約データセットで最適化されており、簡潔で正確な抽象的要約を生成するのに優れている

軽量デプロイメント

T5-smallアーキテクチャに基づき、リソースが限られた環境でのデプロイに適している

モデル能力

テキスト要約生成

ニュースコンテンツの抽出

長文テキストの圧縮

使用事例

ニュースメディア

ニュース自動要約

ニュース機関向けに記事の要点を自動生成

簡潔で正確なニュース要約を生成し、編集時間を節約

コンテンツ分析

レポート自動要約

長文の研究レポートから実行概要を生成

キー情報を迅速に抽出し、読解効率を向上

🚀 ローラー微調整済みXSum-T5要約器

このモデルは、xsumデータセットで t5-small を微調整したバージョンです。テキスト要約に最適化された、T5-smallのLoRA（低ランク適応）微調整版です。

✨ 主な機能

このモデルは、XSumデータセットを使用して抽象的な要約のために訓練された、T5-smallのLoRA（低ランク適応）微調整版です。テキスト要約タスクに最適化されています。

📦 インストール

このREADMEには具体的なインストール手順が記載されていないため、このセクションは省略されます。

💻 使用例

基本的な使用法

from peft import PeftModel
from transformers import AutoModelForSeq2SeqLM
from transformers import AutoTokenizer
import torch

base_model = AutoModelForSeq2SeqLM.from_pretrained("t5-small")
my_model = PeftModel.from_pretrained(base_model, "Lakshan2003/finetuned-t5-xsum")

def test_peft_summarizer(text, model, max_length=128, min_length=30):
    """
    Test the PEFT-loaded summarization model
    
    Args:
        text (str): Input text to summarize
        model: The loaded PEFT model
        max_length (int): Maximum length of the summary
        min_length (int): Minimum length of the summary
    """
    # Load tokenizer for t5-small (base model)
    tokenizer = AutoTokenizer.from_pretrained("Lakshan2003/finetuned-t5-xsum")
    
    # Move model to GPU if available
    device = "cuda" if torch.cuda.is_available() else "cpu"
    model = model.to(device)
    
    # Prepare the input text
    prefix = "summarize: "
    input_text = prefix + text
    
    # Tokenize
    inputs = tokenizer(input_text, return_tensors="pt", max_length=512, truncation=True)
    inputs = {k: v.to(device) for k, v in inputs.items()}
    
    # Generate summary
    with torch.no_grad():
        output_ids = model.generate(
            input_ids=inputs["input_ids"],
            attention_mask=inputs["attention_mask"],
            max_length=max_length,
            min_length=min_length,
            num_beams=4,
            length_penalty=2.0,
            early_stopping=True,
            no_repeat_ngram_size=3
        )
    
    # Decode the summary
    summary = tokenizer.decode(output_ids[0], skip_special_tokens=True)
    
    return summary

# Test text
test_text = """
The United Nations has warned that climate change poses an unprecedented threat to human civilization. In a landmark report, scientists detailed how rising temperatures are affecting everything from weather patterns to food production. The report emphasizes that without immediate and substantial action to reduce greenhouse gas emissions, the world faces severe consequences including rising sea levels, more frequent extreme weather events, and widespread ecosystem collapse. Many countries have pledged to reduce their carbon emissions, but experts say current commitments fall short of what's needed to prevent the worst impacts of climate change. The report also highlights the disproportionate effect of climate change on developing nations, which often lack the resources to adapt to changing conditions.
"""

# Generate summary
summary = test_peft_summarizer(test_text, my_model)

print("Original Text:")
print(test_text)
print("\nGenerated Summary:")
print(summary)