crush - smol - v0オープンソースモデル - 油圧プレスで物体を押しつぶす面白い動画を無料で生成

ホーム

Crush Smol V0

finetrainersによって開発

THUDM/CogVideoX-5bモデルをcrush-smolデータセットで微調整したバージョンで、物体が油圧プレスで粉砕される動画コンテンツの生成に特化

テキスト生成ビデオオープンソースライセンス:その他 #油圧プレス効果生成 #物体粉砕シミュレーション #高ダイナミックレンジ動画生成

ダウンロード数 94

リリース時間 : 1/27/2025

モデル概要

これはテキストから動画を生成するモデルで、特に物体が大型金属シリンダーで粉砕される高品質な動画クリップの生成に優れています

モデル特徴

油圧プレス粉砕効果

物体が油圧プレスで粉砕されるシーンに最適化された動画生成能力

高品質動画出力

512x768解像度、25fpsの滑らかな動画を生成可能

LoRAサポート

64ランクのLoRAバリアントを提供し、軽量な展開と使用が容易

モデル能力

テキストから動画変換

特定シーン動画生成

物理現象シミュレーション

使用事例

特殊効果動画制作

油圧プレス粉砕効果

様々な物体が油圧プレスで粉砕される特殊効果動画を生成

例ではキャンドル、電球、ハンバーガーが粉砕されるリアルな効果を展示

教育デモンストレーション

物理現象デモ

物体が圧力で変形する過程を展示する物理教材用

🚀 THUDM/CogVideoX-5b モデルのファインチューニング

このプロジェクトは、THUDM/CogVideoX-5b モデルを finetrainers/crush-smol データセットでファインチューニングしたものです。また、パラメータのLoRAバリアントも提供しています。こちらで確認できます。

🚀 クイックスタート

このモデルは、特定のデータセットでファインチューニングされたもので、テキストから動画を生成することができます。以下のコードを使って、モデルを使用することができます。

推論コード

from diffusers import CogVideoXTransformer3DModel, DiffusionPipeline 
from diffusers.utils import export_to_video
import torch 

transformer = CogVideoXTransformer3DModel.from_pretrained(
    "finetrainers/crush-smol-v0", torch_dtype=torch.bfloat16
)
pipeline = DiffusionPipeline.from_pretrained(
    "THUDM/CogVideoX-5b", transformer=transformer, torch_dtype=torch.bfloat16
).to("cuda")

prompt = """
DIFF_crush A thick burger is placed on a dining table, and a large metal cylinder descends from above, crushing the burger as if it were under a hydraulic press. The bulb is crushed, leaving a pile of debris around it.
"""
negative_prompt = "inconsistent motion, blurry motion, worse quality, degenerate outputs, deformed outputs"

video = pipeline(
    prompt=prompt, 
    negative_prompt=negative_prompt, 
    num_frames=81, 
    height=512,
    width=768,
    num_inference_steps=50
).frames[0]
export_to_video(video, "output.mp4", fps=25)

トレーニングログ

トレーニングログは、WandB こちらで確認できます。

✨ 主な機能

テキストから動画生成：入力されたテキストに基づいて動画を生成します。
LoRAバリアント：ファインチューニングされたチェックポイントからLoRAを抽出し、同様の効果をエミュレートすることができます。

📦 インストール

このモデルを使用するには、必要なライブラリをインストールする必要があります。以下のコードを参考にしてください。

pip install diffusers torch

📚 ドキュメント

モデル情報

属性	詳情
モデルタイプ	THUDM/CogVideoX-5b のファインチューニングモデル
トレーニングデータ	finetrainers/crush-smol
ライブラリ名	diffusers
ライセンス	other

LoRAの使用方法

ファインチューニングされたチェックポイントから64ランクのLoRAを抽出しました（スクリプトはこちら）。このLoRA を使って、同様の効果をエミュレートすることができます。

コード

from diffusers import DiffusionPipeline 
from diffusers.utils import export_to_video
import torch 

pipeline = DiffusionPipeline.from_pretrained("THUDM/CogVideoX-5b", torch_dtype=torch.bfloat16).to("cuda")
pipeline.load_lora_weights("finetrainers/cakeify-v0", weight_name="extracted_crush_smol_lora_64.safetensors")

prompt = """
DIFF_crush A thick burger is placed on a dining table, and a large metal cylinder descends from above, crushing the burger as if it were under a hydraulic press. The bulb is crushed, leaving a pile of debris around it.
"""
negative_prompt = "inconsistent motion, blurry motion, worse quality, degenerate outputs, deformed outputs"

video = pipeline(
    prompt=prompt, 
    negative_prompt=negative_prompt, 
    num_frames=81, 
    height=512,
    width=768,
    num_inference_steps=50
).frames[0]
export_to_video(video, "output_lora.mp4", fps=25)

重要な注意事項

⚠️ 重要提示

これは実験的なチェックポイントであり、汎化性能が低いことはよく知られています。

コードリポジトリ

コードはこちらで公開されています。

ウィジェット

入力テキスト: DIFF_crush A red candle is placed on a metal platform, and a large metal cylinder descends from above, flattening the candle as if it were under a hydraulic press. The candle is crushed into a flat, round shape, leaving a pile of debris around it.
- 出力動画: ./assets/output_0.mp4
入力テキスト: DIFF_crush A bulb is placed on a wooden platform, and a large metal cylinder descends from above, crushing the bulb as if it were under a hydraulic press. The bulb is crushed into a flat, round shape, leaving a pile of debris around it.
- 出力動画: ./assets/output_1.mp4
入力テキスト: DIFF_crush A thick burger is placed on a dining table, and a large metal cylinder descends from above, crushing the burger as if it were under a hydraulic press. The bulb is crushed, leaving a pile of debris around it.
- 出力動画: ./assets/output_2.mp4