Openvla 7b Finetuned Libero Object
This is a vision-language-action model fine-tuned using LoRA on the LIBERO-Object dataset, specifically designed for robotics.
Downloads 959
Release Time : 9/3/2024
Model Overview
This model is a vision-language-action model obtained by fine-tuning the OpenVLA 7B model with LoRA (r=32) on the LIBERO-Object dataset from the LIBERO simulation benchmark, suitable for the field of robotics.
Model Features
LoRA Fine-tuning
Efficient fine-tuning using LoRA (r=32), adapting to specific tasks while maintaining the model's original performance.
Multimodal Processing
Capable of processing both visual and linguistic information to achieve image-to-text conversion.
Robotics Optimization
Specifically optimized for robotic tasks in the LIBERO simulation benchmark.
Model Capabilities
Vision-Language Joint Understanding
Image-to-Text Conversion
Robot Action Command Generation
Multimodal Task Processing
Use Cases
Robotics
Object Manipulation in LIBERO Simulation Environment
Generates robot operation commands based on visual input in the LIBERO simulation environment.
Optimized model performance
Featured Recommended AI Models
Š 2025AIbase