D

Dust3r ViTLarge BaseDecoder 512 Linear

Developed by naver
DUSt3R is a deep learning model for generating 3D geometric models from images, capable of easily handling geometric 3D vision tasks.
Downloads 313
Release Time : 6/21/2024

Model Overview

DUSt3R is a ViT architecture-based deep learning model focused on generating 3D geometric structures from 2D images. It employs an asymmetric CroCo3DStereo architecture capable of processing input images of varying resolutions.

Model Features

Multi-resolution Support
Supports various input resolutions (512x384 to 512x160) to adapt to different scenario requirements.
Efficient 3D Reconstruction
Capable of rapidly reconstructing 3D geometric structures from single or multiple 2D images.
Hybrid ViT Architecture
Combines ViT-Large encoder and ViT-Base decoder to balance performance and efficiency.

Model Capabilities

Single-image 3D Reconstruction
Multi-view 3D Reconstruction
Geometric Structure Estimation
Depth Estimation

Use Cases

Computer Vision
Scene Reconstruction
Reconstructs 3D geometric structures of indoor/outdoor scenes from single or multiple photos.
Generates 3D point clouds or mesh representations of scenes.
Object Modeling
Generates 3D models from object photos.
Useful for AR/VR content creation or 3D printing.
Robotic Vision
Environmental Perception
Provides robots with 3D geometric understanding of environments.
Supports robot navigation and obstacle avoidance.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase