survey-finetuned-TinyLlama-1.1B-Chat-v1.0オープンソースモデル - 異業種のアンケート回答を簡単に生成

ホーム

Survey Finetuned TinyLlama 1.1B Chat V1.0

aryashah00によって開発

TinyLlamaをファインチューニングしたアンケート回答生成モデルで、複数分野にわたる合成調査回答生成に最適化されています

大規模言語モデル

Transformers

英語オープンソースライセンス:MIT #アンケート生成 #ロールプレイング回答 #合成データ生成

ダウンロード数 99

リリース時間 : 4/10/2025

モデル概要

このモデルはTinyLlama-1.1B-Chatをファインチューニングしたバージョンで、特定の人物ロールを含むカスタムアンケート回答データセットを用いて命令ファインチューニングされており、様々な人物ロールの合成調査回答生成に適しています

モデル特徴

多分野アンケート回答生成

医療健康、教育など10分野をカバーし、特定の人物ロールに合ったアンケート回答を生成可能

ロールプレイング能力

詳細な人物ロール記述に基づき、そのロール特性に合ったアンケート回答を生成可能

パラメータ効率的なファインチューニング

LoRA手法を採用したファインチューニング（r=16, alpha=32, dropout=0.05）

モデル能力

テキスト生成

対話システム

アンケート生成

合成データ生成

使用事例

市場調査

医療満足度調査

様々な医療従事者ロールの医療サービスに対する評価フィードバックを生成

特定医療ロールの視点に合った詳細な回答を生成可能

教育評価

教育効果評価

学生、教師など異なるロールの教育効果に対するフィードバックを生成

様々な教育ロールの回答パターンをシミュレート可能

🚀 aryashah00/survey-finetuned-TinyLlama-1.1B-Chat-v1.0

このモデルは、複数のドメインにわたる合成アンケート回答の生成に最適化された、TinyLlama/TinyLlama-1.1B-Chat-v1.0 のファインチューニング版です。独自のアンケート回答データセットを使用して命令調整されており、各回答は特定の人物像を反映しています。

🚀 クイックスタート

このモデルは、異なる人物像からの合成アンケート回答を生成するために特別に設計されています。以下の情報を提供すると最適に動作します。

詳細な人物像の説明
特定のアンケートの質問

✨ 主な機能

複数ドメインの合成アンケート回答生成に最適化
独自データセットを用いた命令調整により、特定の人物像を反映した回答を生成

📦 インストール

このモデルを使用するには、transformers ライブラリが必要です。以下のコマンドでインストールできます。

pip install transformers

💻 使用例

基本的な使用法

from transformers import AutoModelForCausalLM, AutoTokenizer

# Load model and tokenizer
model = AutoModelForCausalLM.from_pretrained("aryashah00/survey-finetuned-TinyLlama-1.1B-Chat-v1.0", device_map="auto", trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained("aryashah00/survey-finetuned-TinyLlama-1.1B-Chat-v1.0", trust_remote_code=True)

# Define persona and question
persona = "A nurse who educates the child about modern medical treatments and encourages a balanced approach to healthcare"
question = "How often was your pain well controlled during this hospital stay?"

# Prepare prompts
system_prompt = f"You are embodying the following persona: {persona}"
user_prompt = f"Survey Question: {question}\n\nPlease provide your honest and detailed response to this question."

# Create message format
messages = [
    {"role": "system", "content": system_prompt},
    {"role": "user", "content": user_prompt}
]

# Apply chat template
input_text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)

# Tokenize
input_ids = tokenizer(input_text, return_tensors="pt").input_ids.to(model.device)

# Generate response
import torch
with torch.no_grad():
    output_ids = model.generate(
        input_ids=input_ids,
        max_new_tokens=256,
        temperature=0.7,
        top_p=0.9,
        do_sample=True
    )

# Decode
output = tokenizer.decode(output_ids[0], skip_special_tokens=True)

# Extract just the generated response
response_start = output.find(input_text) + len(input_text)
generated_response = output[response_start:].strip()

print(f"Generated response: {generated_response}")

高度な使用法

import requests

API_URL = "https://api-inference.huggingface.co/models/aryashah00/survey-finetuned-TinyLlama-1.1B-Chat-v1.0"
headers = {"Authorization": "Bearer YOUR_API_KEY"}

def query(payload):
    response = requests.post(API_URL, headers=headers, json=payload)
    return response.json()

messages = [
    {"role": "system", "content": "You are embodying the following persona: A nurse who educates the child about modern medical treatments and encourages a balanced approach to healthcare"},
    {"role": "user", "content": "Survey Question: How often was your pain well controlled during this hospital stay?\n\nPlease provide your honest and detailed response to this question."}
]

output = query({"inputs": messages})
print(output)

📚 ドキュメント

モデルの詳細

Property	Details
モデルタイプ	ファインチューニングされた言語モデル
訓練データ	約3,000件のサンプル、10ドメイン（医療、教育など）、ChatML命令形式
訓練方法	LoRAを用いたパラメータ効率的ファインチューニング
LoRAパラメータ	r=16, alpha=32, dropout=0.05
訓練設定	バッチサイズ: 8、学習率: 0.0002、エポック数: 5