Openlrm-obj-large-1.1 Open-Source Model - Easily Generate 3D Models from a Single Image for Free

Home

Openlrm Obj Large 1.1

Developed by zxhezexin

OpenLRM is an open-source implementation of the LRM paper, designed for generating 3D models from a single image.

3D Vision

Transformers

#Single-image 3D Reconstruction #Tri-plane Decoding #DINOv2 Encoding

Downloads 21

Release Time : 3/4/2024

Model Overview

OpenLRM is a Transformer-based image-to-3D model capable of generating high-quality 3D reconstructions from a single input image.

Model Features

Multi-size Model Selection

Offers small/base/large models of different scales to accommodate varying computational resource needs.

Dual-data Training

Some models are trained on a mixed dataset of Objaverse and MVImgNet to enhance generalization capabilities.

Improved Image Encoder

Utilizes a DINOv2 model with register tokens as the image encoder.

Model Capabilities

Single-view 3D Reconstruction

3D Model Generation

Tri-plane Representation Learning

Use Cases

3D Content Creation

3D Asset Generation

Quickly generate 3D model assets from product photos

Computer Vision Research

3D Reconstruction Benchmarking

Serves as a benchmark model for 3D reconstruction tasks

🚀 Model Card for OpenLRM V1.1

This model card provides detailed information about the OpenLRM project, an open - source implementation of the paper LRM. The information corresponds to Version 1.1.

🚀 Quick Start

The content in the original README does not have a quick - start section, so this section is skipped.

✨ Features

The content in the original README does not have a feature description section, so this section is skipped.

📦 Installation

The content in the original README does not have an installation steps section, so this section is skipped.

💻 Usage Examples

The content in the original README does not have code examples, so this section is skipped.

📚 Documentation

Overview

This model card is for the OpenLRM project, which is an open - source implementation of the paper LRM.
Information contained in this model card corresponds to Version 1.1.

Model Details

Training data

Property	Details
[openlrm - obj - small - 1.1](https://huggingface.co/zxhezexin/openlrm - obj - small - 1.1)	Objaverse
[openlrm - obj - base - 1.1](https://huggingface.co/zxhezexin/openlrm - obj - base - 1.1)	Objaverse
[openlrm - obj - large - 1.1](https://huggingface.co/zxhezexin/openlrm - obj - large - 1.1)	Objaverse
[openlrm - mix - small - 1.1](https://huggingface.co/zxhezexin/openlrm - mix - small - 1.1)	Objaverse + MVImgNet
[openlrm - mix - base - 1.1](https://huggingface.co/zxhezexin/openlrm - mix - base - 1.1)	Objaverse + MVImgNet
[openlrm - mix - large - 1.1](https://huggingface.co/zxhezexin/openlrm - mix - large - 1.1)	Objaverse + MVImgNet

Model architecture (version==1.1)

Type	Layers	Feat. Dim	Attn. Heads	Triplane Dim.	Input Res.	Image Encoder	Size
small	12	512	8	32	224	dinov2_vits14_reg	446M
base	12	768	12	48	336	dinov2_vitb14_reg	1.04G
large	16	1024	16	80	448	dinov2_vitb14_reg	1.81G

Training settings

Type	Rend. Res.	Rend. Patch	Ray Samples
small	192	64	96
base	288	96	96
large	384	128	128

Notable Differences from the Original Paper

We do not use the deferred back - propagation technique in the original paper.
We used random background colors during training.
The image encoder is based on the DINOv2 model with register tokens.
The triplane decoder contains 4 layers in our implementation.

🔧 Technical Details

The content in the original README does not have in - depth technical details (more than 50 - word specific technical descriptions), so this section is skipped.

📄 License

The model weights are released under the Creative Commons Attribution - NonCommercial 4.0 International License.
They are provided for research purposes only, and CANNOT be used commercially.

Disclaimer

This model is an open - source implementation and is NOT the official release of the original research paper. While it aims to reproduce the original results as faithfully as possible, there may be variations due to model implementation, training data, and other factors.

Ethical Considerations

This model should be used responsibly and ethically, and should not be used for malicious purposes.
Users should be aware of potential biases in the training data.
The model should not be used under the circumstances that could lead to harm or unfair treatment of individuals or groups.

Usage Considerations

The model is provided "as is" without warranty of any kind.
Users are responsible for ensuring that their use complies with all relevant laws and regulations.
The developers and contributors of this model are not liable for any damages or losses arising from the use of this model.

This model card is subject to updates and modifications. Users are advised to check for the latest version regularly.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご