Openvla 7b Oft Finetuned Libero Goal
OpenVLA-OFT is an optimized vision-language-action model that significantly improves the performance and speed of the basic OpenVLA model through fine-tuning technology.
Downloads 579
Release Time : 2/25/2025
Model Overview
This model combines vision, language, and action generation capabilities, and is specifically optimized for robot tasks. It can generate continuous action sequences based on visual input and task descriptions.
Model Features
Optimized fine-tuning technology
Adopts OFT (Optimized Fine-Tuning) technology, with significant performance improvement compared to the basic model
Multimodal input processing
Can simultaneously process visual images, language descriptions, and proprioceptive state inputs
Continuous action generation
Generates continuous robot action sequences through an MLP action head
Model Capabilities
Vision-language understanding
Continuous action prediction
Robot task execution
Multimodal data fusion
Use Cases
Robot control
Spatial task execution
Completes spatial operation tasks based on visual input and task descriptions
Performs better than the basic model on the LIBERO-Goal task
Featured Recommended AI Models
Š 2025AIbase