O

Openvla 7b Oft Finetuned Libero Goal

Developed by moojink
OpenVLA-OFT is an optimized vision-language-action model that significantly improves the performance and speed of the basic OpenVLA model through fine-tuning technology.
Downloads 579
Release Time : 2/25/2025

Model Overview

This model combines vision, language, and action generation capabilities, and is specifically optimized for robot tasks. It can generate continuous action sequences based on visual input and task descriptions.

Model Features

Optimized fine-tuning technology
Adopts OFT (Optimized Fine-Tuning) technology, with significant performance improvement compared to the basic model
Multimodal input processing
Can simultaneously process visual images, language descriptions, and proprioceptive state inputs
Continuous action generation
Generates continuous robot action sequences through an MLP action head

Model Capabilities

Vision-language understanding
Continuous action prediction
Robot task execution
Multimodal data fusion

Use Cases

Robot control
Spatial task execution
Completes spatial operation tasks based on visual input and task descriptions
Performs better than the basic model on the LIBERO-Goal task
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase