D

Dust3r ViTLarge BaseDecoder 512 Dpt

Developed by naver
DUSt3R is a model for easily achieving geometric 3D vision from images, capable of reconstructing 3D scenes from single or multiple images.
Downloads 46.93k
Release Time : 6/24/2024

Model Overview

DUSt3R is a deep learning-based 3D vision model focused on reconstructing 3D geometric structures from 2D images. It employs an asymmetric CroCo3DStereo architecture, combining a ViT-Large encoder and ViT-Base decoder, enabling efficient processing of input images at various resolutions.

Model Features

Multi-resolution Support
Supports various input resolutions (512x384, 512x336, etc.), adapting to different scenario requirements
Efficient 3D Reconstruction
Quickly reconstructs 3D scene geometric structures from single or multiple images
Advanced Architecture
Asymmetric CroCo3DStereo architecture combining ViT-Large encoder and ViT-Base decoder

Model Capabilities

Single-image 3D Reconstruction
Multi-view 3D Reconstruction
Depth Estimation
Point Cloud Generation

Use Cases

Computer Vision
Scene Reconstruction
Reconstruct 3D models of indoor/outdoor scenes from single or multiple photos
Generates 3D scenes usable for AR/VR applications
Object Modeling
Generate 3D models from object photos
Usable for 3D printing or digital content creation
Augmented Reality
AR Scene Understanding
Provides 3D geometric information of scenes for AR applications
Enhances realism and interactivity of AR objects
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase