オープンソースのCogVideoX - 5bとpika - dissolve - v0：テキストを動画に変換し、物体の溶解エフェクト生成を無料で実現

ホーム

Pika Dissolve V0

finetrainersによって開発

CogVideoX-5bはテキストからビデオを生成する拡散モデルで、物体溶解エフェクトの生成能力を微調整により実現しています。

テキスト生成ビデオオープンソースライセンス:その他 #物体溶解エフェクト #高速度撮影シミュレーション #繊細な粒子ダイナミクス

ダウンロード数 75

リリース時間 : 1/14/2025

モデル概要

このモデルはテキスト記述に基づいて高品質なビデオコンテンツを生成でき、特に物体が徐々に溶解または分解する動的効果の表現に優れています。

モデル特徴

高解像度ビデオ生成

512×768解像度の高品質ビデオを生成可能

精密エフェクト表現

物体溶解、分解などの繊細な動的効果表現に特に優れる

長シーケンス生成

81フレームの長尺ビデオシーケンス生成をサポート

モデル能力

テキストからビデオ生成

エフェクトビデオ制作

動的シーンシミュレーション

使用事例

クリエイティブコンテンツ制作

物体溶解エフェクト

様々な物体が徐々に溶解または分解する動的ビデオを生成

例ではガラスの花瓶、折り鶴などの物体が優雅に溶解する効果を展示

広告エフェクト制作

広告クリエイティブのためのユニークな物体変換エフェクトを制作

アート創作

デジタルアート表現

デジタルアーティスト向けのクリエイティブインスピレーション可視化ツールを提供

🚀 PIKA_DISSOLVE モデル

このモデルは、[THUDM/CogVideoX - 5b](https://huggingface.co/THUDM/CogVideoX - 5b) を [modal - labs/dissolve](https://huggingface.co/datasets/modal - labs/dissolve) データセットでファインチューニングしたものです。テキストからビデオを生成する能力を持ち、様々な物体の溶解シーンを表現できます。

🚀 クイックスタート

このモデルを使用するには、以下の手順に従ってください。まず、必要なライブラリをインポートし、モデルをロードします。その後、プロンプトを設定してビデオを生成します。

from diffusers import CogVideoXTransformer3DModel, DiffusionPipeline 
from diffusers.utils import export_to_video
import torch 

transformer = CogVideoXTransformer3DModel.from_pretrained(
    "sayakpaul/pika-dissolve-v0", torch_dtype=torch.bfloat16
)
pipeline = DiffusionPipeline.from_pretrained(
    "THUDM/CogVideoX-5b", transformer=transformer, torch_dtype=torch.bfloat16
).to("cuda")

prompt = """
PIKA_DISSOLVE A slender glass vase, brimming with tiny white pebbles, stands centered on a polished ebony dais. Without warning, the glass begins to dissolve from the edges inward. Wisps of translucent dust swirl upward in an elegant spiral, illuminating each pebble as they drop onto the dais. The gently drifting dust eventually settles, leaving only the scattered stones and faint traces of shimmering powder on the stage.
"""
negative_prompt = "inconsistent motion, blurry motion, worse quality, degenerate outputs, deformed outputs"

video = pipeline(
    prompt=prompt, 
    negative_prompt=negative_prompt, 
    num_frames=81, 
    height=512,
    width=768,
    num_inference_steps=50
).frames[0]
export_to_video(video, "output_vase.mp4", fps=25)

✨ 主な機能

テキストからビデオ生成：指定したテキストに基づいて、物体が溶解するビデオを生成できます。
多様なシーン表現：雪玉、ティーカップ、木のマスク、ガラスの花瓶など、様々な物体の溶解シーンを表現できます。

📦 インストール

このモデルを使用するには、diffusers ライブラリが必要です。以下のコマンドでインストールできます。

pip install diffusers

💻 使用例

基本的な使用法

以下は、ティーカップが溶解するシーンを生成する例です。

# 上記のクイックスタートのコードを参照

高度な使用法

複数のプロンプトを使用して、異なるシーンを生成することができます。

# 各プロンプトを設定
prompts = [
    "PIKA_DISSOLVE A meticulously detailed, tea cup, sits centrally on a dark brown circular pedestal. The cup, seemingly made of clay, begins to dissolve from the bottom up. The disintegration process is rapid but not explosive, with a cloud of fine, light tan dust forming and rising in a swirling, almost ethereal column that expands outwards before slowly descending. The dust particles are individually visible as they float, and the overall effect is one of delicate disintegration rather than shattering. Finally, only the empty pedestal and the intricately patterned marble floor remain.",
    "PIKA_DISSOLVE Resting quietly atop an ancient stone altar, a delicately carved wooden mask starts to crumble from its outer edges. The intricate patterns crack and give way, releasing a fine, smoke-like plume of mahogany-hued particles that dance upwards, then disperse gradually into the hushed atmosphere. As the dust descends, the once captivating mask is reduced to an outline on the weathered altar."
]

for prompt in prompts:
    video = pipeline(
        prompt=prompt, 
        negative_prompt=negative_prompt, 
        num_frames=81, 
        height=512,
        width=768,
        num_inference_steps=50
    ).frames[0]
    # 各ビデオを保存
    video_name = prompt[:20].replace(" ", "_") + ".mp4"
    export_to_video(video, video_name, fps=25)

📚 ドキュメント

ベースモデル：THUDM/CogVideoX - 5b
データセット：modal - labs/dissolve
ライブラリ名：diffusers
ライセンス：[other](https://huggingface.co/THUDM/CogVideoX - 5b/blob/main/LICENSE)

ウィジェット出力例

入力テキスト	出力ビデオ
PIKA_DISSOLVE A meticulously detailed, tea cup, sits centrally on a dark brown circular pedestal. The cup, seemingly made of clay, begins to dissolve from the bottom up. The disintegration process is rapid but not explosive, with a cloud of fine, light tan dust forming and rising in a swirling, almost ethereal column that expands outwards before slowly descending. The dust particles are individually visible as they float, and the overall effect is one of delicate disintegration rather than shattering. Finally, only the empty pedestal and the intricately patterned marble floor remain.	output_cup.mp4
PIKA_DISSOLVE Resting quietly atop an ancient stone altar, a delicately carved wooden mask starts to crumble from its outer edges. The intricate patterns crack and give way, releasing a fine, smoke - like plume of mahogany - hued particles that dance upwards, then disperse gradually into the hushed atmosphere. As the dust descends, the once captivating mask is reduced to an outline on the weathered altar.	output_altar.mp4
PIKA_DISSOLVE A slender glass vase, brimming with tiny white pebbles, stands centered on a polished ebony dais. Without warning, the glass begins to dissolve from the edges inward. Wisps of translucent dust swirl upward in an elegant spiral, illuminating each pebble as they drop onto the dais. The gently drifting dust eventually settles, leaving only the scattered stones and faint traces of shimmering powder on the stage.	output_vase.mp4
PIKA_DISSOLVE On a narrow marble ledge, a gracefully folded paper crane rests, its surface marked by delicate ink lines. It starts to fragment from the tail feathers outward, releasing a cloud of feather - light pulp fibers. Suspended for a moment in a magical swirl, the fibers drift back down, cloaking the ledge in a near - transparent veil of white. Then the ledge stands empty, the crane’s faint silhouette lingering in memory.	output_marble.mp4