Llama-3.1-8B-VaaniSetu-EN2PAオープンソース翻訳モデル - 無料で英語からパンジャーブ語への高精度翻訳を実現

ホーム

Llama 3.1 8B VaaniSetu EN2PA

partex-nvによって開発

LLaMA 3.1 8Bアーキテクチャをファインチューニングした英語からパンジャブ語への翻訳モデルで、Bharat並列コーパスでトレーニングされています。

機械翻訳

Safetensors

複数言語対応#英語-パンジャブ語翻訳 #司法文書翻訳 #LLaMA3.1ファインチューニング

ダウンロード数 48

リリース時間 : 9/25/2024

モデル概要

このモデルは英語からパンジャブ語への翻訳専用に設計されており、司法文書や政府命令などの文書翻訳に適しており、パンジャブ語話者にサービスを提供します。

モデル特徴

高品質翻訳

1000万の英語<>パンジャブ語並列文ペアでトレーニングされ、高品質な翻訳結果を提供します。

オープンソースモデル

オープンソースの英語からパンジャブ語への翻訳モデルの空白を埋めます。

専門分野に適応

特に司法文書や政府命令などの専門文書の翻訳に適しています。

モデル能力

英語からパンジャブ語への翻訳

テキスト生成

使用事例

文書翻訳

司法文書翻訳

英語の司法文書をパンジャブ語に翻訳します。

政府命令翻訳

英語の政府命令をパンジャブ語に翻訳します。

🚀 🦙📝 LLAMA-VaaniSetu-EN2PA: 大規模言語モデルによる英語からパンジャビ語への翻訳

このモデル「LLAMA-VaaniSetu-EN2PA」は、LLaMA 3.1 8Bアーキテクチャモデルをファインチューニングしたもので、英語からパンジャビ語への翻訳を専門に行うように設計されています。このモデルは、約1000万の英語<>パンジャビ語のペアを含むBharat Parallel Corpus Collection (BPCC) を使用してトレーニングされています。BPCCはAI4Bharatによって公開されています。

このモデルの目的は、オープンソースの英語からパンジャビ語への翻訳モデルの不足を解消し、司法文書、政府命令、裁判判決などの文書をパンジャビ語圏の人々に対応するための翻訳に応用することです。

🚀 クイックスタート

このモデルを使用するには、まず必要な依存関係をインストールする必要があります。その後、提供されたコード例を使って英語からパンジャビ語への翻訳を行うことができます。

✨ 主な機能

英語からパンジャビ語への翻訳：専用にファインチューニングされたモデルで、高品質な翻訳を提供します。
大規模データセットでのトレーニング：約1000万の英語<>パンジャビ語のペアを含むBPCCを使用してトレーニングされています。

📦 インストール

このモデルを使用するには、以下の依存関係をインストールする必要があります。

pip install torch transformers huggingface_hub

💻 使用例

基本的な使用法

from transformers import AutoModelForCausalLM, AutoTokenizer
import torch


# Load model and tokenizer
def load_model():
    tokenizer = AutoTokenizer.from_pretrained("partex-nv/Llama-3.1-8B-VaaniSetu-EN2PA")
    model = AutoModelForCausalLM.from_pretrained(
        "partex-nv/Llama-3.1-8B-VaaniSetu-EN2PA",
        torch_dtype=torch.bfloat16,
        device_map="auto",  # Automatically moves model to GPU
    )
    return model, tokenizer

model, tokenizer = load_model()

# Define the function for translation
# Define the function for translation which translated from English to Punjabi
def translate_to_punjabi(english_text):
    # Create the  prompt
    translate_prompt = """Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.
    
    ### Instruction:
    {}
    
    ### Input:
    {}
    
    ### Response:
    {}"""
    
    # Format the prompt
    formatted_input = translate_prompt.format(
        "You are given the english text, read it and understand it. After reading translate the english text to Punjabi and provide the output strictly",  # Instruction
        english_text,  # Input text to be translated
        ""  # Output - leave blank for generation
    )
    
    # Tokenize the input
    inputs = tokenizer([formatted_input], return_tensors="pt").to("cuda")

    # Generate the translation output
    output_ids = model.generate(**inputs, max_new_tokens=500)

    # Decode the output
    translated_text = tokenizer.decode(output_ids[0], skip_special_tokens=True)
    fulloutput = translated_text.split("Response:")[-1].strip()
    if not fulloutput:
        fulloutput = ""
    return fulloutput


english_text = """
Delhi is a beautiful place
"""

punjabi_translation = translate_to_punjabi(english_text)

print(punjabi_translation)

📚 ドキュメント

モデルとデータの情報

プロパティ	詳細
トレーニングデータ	AI4BharatのBharat Parallel Corpus Collection (BPCC)からの1000万の英語<>パンジャビ語の並列文
評価データ	IndicTrans2を通じて利用可能なIN22-Convデータセットの1503サンプル
モデルアーキテクチャ	BF16精度のLLaMA 3.1 8Bベース
スコア (chrF++)	IN22-ConvデータセットでchrF++スコア28.1を達成。オープンソースモデルとしては優れたスコアです。

推論のためのGPU要件

このモデルで推論を行うには、以下の最低限のGPU要件が必要です。

メモリ要件：BF16 (BFloat16) 精度での推論には16 - 18GBのVRAMが必要です。
推奨GPU：
- NVIDIA A100 (20GB)：BF16精度に最適で、LLaMA 8Bのような大規模モデルを効率的に処理できます。
- 少なくとも16GBのVRAMを持つ他のGPUも動作する可能性がありますが、メモリの可用性に応じてパフォーマンスが異なります。