LLaRA Open-Source Visual Motion Strategy Model - A Practical Tool for Free Use in Robotics Research

Llava 1.5 7b Llara D Inbc Aux B VIMA 80k

Developed by variante

LLaRA is an open-source visual motion strategy model, fine-tuned from LLaVA-7b-v1.5 on instruction-following data and auxiliary datasets, primarily used for robotics research.

Transformers

Open Source License:Apache-2.0 #Visual Motion Strategy #Multimodal Instruction Following #Robot Learning

Downloads 390

Release Time : 7/13/2024

Model Overview

LLaRA is a large multimodal model for robotics, capable of processing visual and language inputs to generate motion strategies.

Model Features

Multimodal Processing Capability

Can simultaneously process visual and text inputs to generate corresponding motion strategies.

Specialized for Robotics

Designed and optimized specifically for robotic applications, suitable for vision-language strategy research.

Open-Source Model

Open-sourced under Apache 2.0 license, facilitating research and extension.

Model Capabilities

Vision-Language Understanding

Motion Strategy Generation

Multimodal Instruction Following

Use Cases

Robotics

Visual Instruction Following

Generates robot motion strategies based on visual and language instructions.

Multimodal Task Planning

Performs complex task planning by combining visual and language inputs.

Property	Details
Model Type	LLaRA is an open - source visuomotor policy trained by fine - tuning [LLaVA - 7b - v1.5](https://huggingface.co/liuhaotian/llava - v1.5 - 7b) on instruction - following data `D - inBC` and 4 auxiliary datasets, converted from [VIMA - Data](https://huggingface.co/datasets/VIMA/VIMA - Data). For the conversion code, please refer to convert_vima.ipynb
Model Date	llava - 1.5 - 7b - llara - D - inBC - Aux - B - VIMA - 80k was trained in June 2024.
Paper or resources for more information	https://github.com/LostXine/LLaRA
Where to send questions or comments about the model	https://github.com/LostXine/LLaRA/issues

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Llava 1.5 7b Llara D Inbc Aux B VIMA 80k

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 LLaRA Model Card

🚀 Quick Start

📚 Documentation

📋 Model details

🎯 Intended use

📄 License