Theia (theia-base-patch16-224-cdiv) Open-Source Model - A Visual Representation Artifact Empowering Robot Learning

Theia Base Patch16 224 Cdiv

Developed by theaiinstitute

Theia is a vision foundation model designed for robot learning, constructed by distilling multiple off-the-shelf vision foundation models, possessing rich visual representation capabilities.

Image Classification

Transformers

Open Source License:Other #Robot Vision #Multi-Model Distillation #Few-Shot Learning

Downloads 7,621

Release Time : 7/29/2024

Model Overview

Theia is a vision foundation model specifically designed for robot learning. By distilling knowledge from multiple vision foundation models such as CLIP, DINOv2, and ViT, it builds diverse visual representations that can enhance the performance of downstream robot learning tasks.

Model Features

Multi-Model Distillation

Constructs diverse visual representations by distilling knowledge from multiple vision foundation models such as CLIP, DINOv2, and ViT.

Efficient Learning

Outperforms its teacher models and existing robot learning models with less training data and smaller model size.

Rich Visual Representations

Encodes diverse visual knowledge to enhance downstream robot learning performance.

Model Capabilities

Visual Representation Learning

Robot Vision Task Enhancement

Multimodal Visual Understanding

Use Cases

Robot Learning

Robot Visual Navigation

Leverages Theia's visual representation capabilities to enhance robot navigation in complex environments.

Experiments show that Theia outperforms existing models with less training data and smaller model size.

Object Recognition and Grasping

Improves robot accuracy in object recognition and grasping through Theia's diverse visual knowledge.

🚀 Theia

Theia is a vision foundation model for robot learning that distills multiple off - the - shelf vision foundation models. Its rich visual representations enhance downstream robot learning.

🚀 Quick Start

Theia is a vision foundation model tailored for robot learning. It distills multiple off - the - shelf vision foundation models trained on various vision tasks. The rich visual representations of Theia encode diverse visual knowledge, which significantly enhances downstream robot learning. The model was introduced in the paper Theia: Distilling Diverse Vision Foundation Models for Robot Learning. Experiments in the paper show that Theia outperforms its teacher models and prior robot learning models, requiring less training data and smaller model sizes. You can find demo videos on the project page.

Theia Overview

✨ Features

The theia-base-patch16-224-cdiv model uses DeiT-Base as a backbone. It simultaneously distills CLIP, DINOv2, and ViT. For more usage information, please visit the Theia repository.

📚 Documentation

Model Details

The theia-base-patch16-224-cdiv model leverages DeiT-Base as its backbone and distills knowledge from CLIP, DINOv2, and ViT. For more details on usage, visit the Theia repository.

Citation

If you use Theia in your research, please use the following BibTeX entry:

@article{shang2024theia,
  author    = {Shang, Jinghuan and Schmeckpeper, Karl and May, Brandon B. and Minniti, Maria Vittoria and Kelestemur, Tarik and Watkins, David and Herlant, Laura},
  title     = {Theia: Distilling Diverse Vision Foundation Models for Robot Learning},
  journal   = {arXiv},
  year      = {2024},
}

Usage

The pre - trained model weights and code released with Theia are available for use under The AI Institute License, which is fully reproduced below:

Copyright (c) 2024 Boston Dynamics AI Institute LLC

Redistribution and use in source and binary forms, with or without
modification, are permitted provided that the following conditions are met:
1. Redistributions of source code must retain the copyright notice included
with the software, this list of conditions and the following disclaimer.
2. Redistributions in binary form must reproduce the copyright notice, this
list of conditions and the following disclaimer in the documentation and/or
other materials provided with the distribution.
3. Modified versions of the software must be conspicuously marked as such.
4. The software may only be used for non - commercial research purposes.
For profit enterprises may use the software, subject to this limitation.

THIS SOFTWARE IS PROVIDED BY THE AI INSTITUTE AND CONTRIBUTORS "AS IS" AND
ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, NON -
INFRINGEMENT,TITLE, MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
DISCLAIMED. IN NO EVENT SHALL THE AI INSTITUTE OR CONTRIBUTORS BE LIABLE FOR
ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, PUNITIVE OR CONSEQUENTIAL
DAMAGES (INCLUDING, BUT NOT LIMITED TO, DAMAGES ARISING OUT OF CLAIMS OF
INTELLECTUAL PROPERTY RIGHTS INFRINGEMENT; PROCUREMENT OF SUBSTITUTE GOODS OR
SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY,
OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

📄 License

The Theia project is released under The AI Institute License.

Copyright (c) 2024 Boston Dynamics AI Institute LLC

Redistribution and use in source and binary forms, with or without
modification, are permitted provided that the following conditions are met:
1. Redistributions of source code must retain the copyright notice included
with the software, this list of conditions and the following disclaimer.
2. Redistributions in binary form must reproduce the copyright notice, this
list of conditions and the following disclaimer in the documentation and/or
other materials provided with the distribution.
3. Modified versions of the software must be conspicuously marked as such.
4. The software may only be used for non - commercial research purposes.
For profit enterprises may use the software, subject to this limitation.

THIS SOFTWARE IS PROVIDED BY THE AI INSTITUTE AND CONTRIBUTORS "AS IS" AND
ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, NON -
INFRINGEMENT,TITLE, MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
DISCLAIMED. IN NO EVENT SHALL THE AI INSTITUTE OR CONTRIBUTORS BE LIABLE FOR
ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, PUNITIVE OR CONSEQUENTIAL
DAMAGES (INCLUDING, BUT NOT LIMITED TO, DAMAGES ARISING OUT OF CLAIMS OF
INTELLECTUAL PROPERTY RIGHTS INFRINGEMENT; PROCUREMENT OF SUBSTITUTE GOODS OR
SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY,
OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご