Openvla 7b Oft Finetuned Libero Spatial
OpenVLA-OFT is an optimized vision-language-action model that significantly improves the running speed and task success rate of the basic OpenVLA model through fine-tuning technology.
Downloads 2,513
Release Time : 2/25/2025
Model Overview
This project focuses on the fine-tuning of vision-language-action models, aiming to optimize the model's running speed and improve the task success rate. It is suitable for LIBERO-Spatial tasks and uses optimized fine-tuning technology to enhance performance.
Model Features
Optimized Fine-tuning Technology
Adopt optimized fine-tuning technology to significantly improve the performance of the basic OpenVLA model
Efficient Action Generation
Capable of generating continuous action blocks, suitable for robot control tasks
Multimodal Input Processing
Support multimodal inputs of vision (images), language (task descriptions), and proprioceptive states
Model Capabilities
Vision-Language-Action Multimodal Processing
Robot Action Sequence Generation
Continuous Action Prediction
Task-oriented Control
Use Cases
Robot Control
LIBERO-Spatial Task Execution
Generate robot action sequences based on visual and language inputs
Improve task execution speed and success rate
Featured Recommended AI Models
Š 2025AIbase