vfusion3d Open-Source 3D Generation Model - Achieve Scalable 3D Generation and Reconstruction with Minimal 3D Data

Vfusion3d

Developed by jadechoghari

VFusion3D is a large-scale feed-forward 3D generation model trained with limited 3D data and extensive synthetic multi-view data, exploring scalable 3D generation/reconstruction models.

3D Vision

Transformers

#Video to 3D #Multi-view Synthesis #Scalable 3D Generation

Downloads 249

Release Time : 7/31/2024

Model Overview

VFusion3D is a large 3D generation model learned from video diffusion models, supporting 3D content generation from a single image, marking an important step toward building a 3D foundation model.

Model Features

Scalable 3D Generation

Trained with limited 3D data and extensive synthetic multi-view data to achieve scalable 3D generation capabilities.

Multi-format Output

Supports output of 3D planar data, mesh files (.obj), and multi-view rendered videos.

Efficient Inference

Feed-forward architecture enables fast 3D content generation.

Model Capabilities

Single-image 3D Reconstruction

3D Mesh Generation

Multi-view Video Rendering

3D Content Generation

Use Cases

3D Content Creation

Virtual Character Modeling

Generate 3D models from a single character image.

Produces editable 3D meshes and rotation showcase videos.

Product Showcase

Convert product photos into 3D models.

Supports multi-angle viewing of product details.

Game Development

Rapid Prototyping

Quickly generate 3D assets for games.

Shortens the 3D modeling workflow.

🚀 [ECCV 2024] VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models

VFusion3D is a large, feed - forward 3D generative model. It is trained with a small amount of 3D data and a large volume of synthetic multi - view data, exploring scalable 3D generative/reconstruction models as a step towards a 3D foundation.

Porject page, Paper link

VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
Junlin Han, Filippos Kokkinos, Philip Torr
GenAI, Meta and TVG, University of Oxford
European Conference on Computer Vision (ECCV), 2024

🚀 Quick Start

Getting started with VFusion3D is super easy! 🤗 Here’s how you can use the model with Hugging Face:

📦 Installation

Install Dependencies (Optional)

Depending on your needs, you may want to enable specific features like mesh generation or video rendering. We've got you covered with these additional packages:

!pip --quiet install imageio[ffmpeg] PyMCubes trimesh rembg[gpu,cli] kiui

💻 Usage Examples

Basic Usage

import torch
from transformers import AutoModel, AutoProcessor

# load the model and processor
model = AutoModel.from_pretrained("jadechoghari/vfusion3d", trust_remote_code=True)
processor = AutoProcessor.from_pretrained("jadechoghari/vfusion3d")

# download and preprocess the image
import requests
from PIL import Image
from io import BytesIO

image_url = 'https://sm.ign.com/ign_nordic/cover/a/avatar-gen/avatar-generations_prsz.jpg'
response = requests.get(image_url)
image = Image.open(BytesIO(response.content))

# preprocess the image and get the source camera 
image, source_camera = processor(image)


# generate planes (default output)
output_planes = model(image, source_camera)
print("Planes shape:", output_planes.shape)

# generate a 3D mesh
output_planes, mesh_path = model(image, source_camera, export_mesh=True)
print("Planes shape:", output_planes.shape)
print("Mesh saved at:", mesh_path)

# Generate a video
output_planes, video_path = model(image, source_camera, export_video=True)
print("Planes shape:", output_planes.shape)
print("Video saved at:", video_path)

Default (Planes): By default, VFusion3D outputs planes—ideal for further 3D operations.
Export Mesh: Want a 3D mesh? Just set export_mesh=True, and you'll get a .obj file ready to roll. You can also customize the mesh resolution by adjusting the mesh_size parameter.
Export Video: Fancy a 3D video? Set export_video=True, and you'll receive a beautifully rendered video from multiple angles. You can tweak render_size and fps to get the video just right.

Check out our demo app to see VFusion3D in action! 🤗

✨ Features

3D Generation Results

User Study Results

📚 Documentation

Acknowledgement

This inference code of VFusion3D heavily borrows from OpenLRM.

Citation

If you find this work useful, please cite us:

@article{han2024vfusion3d,
  title={VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models},
  author={Junlin Han and Filippos Kokkinos and Philip Torr},
  journal={European Conference on Computer Vision (ECCV)},
  year={2024}
}

📄 License

The majority of VFusion3D is licensed under CC - BY - NC, however portions of the project are available under separate license terms: OpenLRM as a whole is licensed under the Apache License, Version 2.0, while certain components are covered by NVIDIA's proprietary license.
The model weights of VFusion3D is also licensed under CC - BY - NC.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご