Hunyuan3D 2.0 is a large-scale 3D synthesis system comprising the shape generation model Hunyuan3D-DiT and the texture synthesis model Hunyuan3D-Paint. It can generate high-quality textured 3D models and supports editing and animation
Model Features
High-resolution Texture Generation
Capable of generating high-quality 4K resolution texture maps with excellent detail representation
Precise Condition Following
Strong understanding of input image conditions, ensuring highly consistent generation results with inputs
Modular Design
Decoupled shape and texture generation, supporting texture mapping for both generated and manually modeled meshes
User-friendly Platform
Provides the Hunyuan3D-Studio production platform, enabling both professional and non-professional users to edit and animate 3D assets
Model Capabilities
Generate 3D models from single images
Generate 3D models from text descriptions
High-resolution texture synthesis
3D model editing
3D model animation
Use Cases
Game Development
Rapid Game Asset Generation
Automatically generate textured 3D models from concept art
Significantly shortens art asset production cycles
Virtual Reality
VR Scene Construction
Quickly generate 3D objects for virtual environments
Improves scene construction efficiency
E-commerce
Product 3D Display
Convert product images into interactive 3D models
Enhances user experience
🚀 Hunyuan3D-2
Living out everyone’s imagination on creating and manipulating 3D assets.
“ Living out everyone’s imagination on creating and manipulating 3D assets.”
You may follow the next steps to use Hunyuan3D 2.0 via code or the Gradio App.
📦 Installation
Please install Pytorch via the official site. Then install the other requirements via
pip install -r requirements.txt
# for texturecd hy3dgen/texgen/custom_rasterizer
python3 setup.py install
cd ../../..
cd hy3dgen/texgen/differentiable_renderer
bash compile_mesh_painter.sh OR python3 setup.py install (on Windows)
💻 Usage Examples
Basic Usage
We designed a diffusers-like API to use our shape generation model - Hunyuan3D-DiT and texture synthesis model - Hunyuan3D-Paint.
You could assess Hunyuan3D-DiT via:
from hy3dgen.shapegen import Hunyuan3DDiTFlowMatchingPipeline
pipeline = Hunyuan3DDiTFlowMatchingPipeline.from_pretrained('tencent/Hunyuan3D-2')
mesh = pipeline(image='assets/demo.png')[0]
The output mesh is a trimesh object, which you could save to glb/obj (or other format) file.
For Hunyuan3D-Paint, do the following:
from hy3dgen.texgen import Hunyuan3DPaintPipeline
from hy3dgen.shapegen import Hunyuan3DDiTFlowMatchingPipeline
# let's generate a mesh first
pipeline = Hunyuan3DDiTFlowMatchingPipeline.from_pretrained('tencent/Hunyuan3D-2')
mesh = pipeline(image='assets/demo.png')[0]
pipeline = Hunyuan3DPaintPipeline.from_pretrained('tencent/Hunyuan3D-2')
mesh = pipeline(mesh, image='assets/demo.png')
Advanced Usage
Please visit minimal_demo.py for more advanced usage, such as text to 3D and texture generation for handcrafted mesh.
Gradio App
You could also host a Gradio App in your own computer via:
pip3 install gradio==3.39.0
python3 gradio_app.py
Don't forget to visit Hunyuan3D for quick use, if you don't want to host yourself.
✨ Features
Abstract
We present Hunyuan3D 2.0, an advanced large-scale 3D synthesis system for generating high-resolution textured 3D assets.
This system includes two foundation components: a large-scale shape generation model - Hunyuan3D-DiT, and a large-scale
texture synthesis model - Hunyuan3D-Paint.
The shape generative model, built on a scalable flow-based diffusion transformer, aims to create geometry that properly
aligns with a given condition image, laying a solid foundation for downstream applications.
The texture synthesis model, benefiting from strong geometric and diffusion priors, produces high-resolution and vibrant
texture maps for either generated or hand-crafted meshes.
Furthermore, we build Hunyuan3D-Studio - a versatile, user-friendly production platform that simplifies the re-creation
process of 3D assets. It allows both professional and amateur users to manipulate or even animate their meshes
efficiently.
We systematically evaluate our models, showing that Hunyuan3D 2.0 outperforms previous state-of-the-art models,
including the open-source models and closed-source models in geometry details, condition alignment, texture quality, and
e.t.c.
Hunyuan3D 2.0
Architecture
Hunyuan3D 2.0 features a two-stage generation pipeline, starting with the creation of a bare mesh, followed by the
synthesis of a texture map for that mesh. This strategy is effective for decoupling the difficulties of shape and
texture generation and also provides flexibility for texturing either generated or handcrafted meshes.
Performance
We have evaluated Hunyuan3D 2.0 with other open-source as well as close-source 3d-generation methods.
The numerical results indicate that Hunyuan3D 2.0 surpasses all baselines in the quality of generated textured 3D assets
and the condition following ability.
This repository contains the models of the paper Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation. The system includes a large-scale shape generation model - Hunyuan3D-DiT and a large-scale texture synthesis model - Hunyuan3D-Paint. The shape generative model is built on a scalable flow-based diffusion transformer, and the texture synthesis model benefits from strong geometric and diffusion priors.