F

Flower Libero 10

Developed by mbreuss
FlowerVLA is a pre-trained vision-language-action flow policy model for robotic manipulation tasks, trained on the LIBERO 10 dataset with only 1 billion parameters.
Downloads 14
Release Time : 3/17/2025

Model Overview

FlowerVLA adopts an innovative architecture, utilizing half the parameters of the Florence-2 model for multimodal vision-language encoding and employing a novel Transformer-based flow matching architecture to deliver an efficient and versatile VLA strategy with approximately 1 billion parameters.

Model Features

Efficient Multimodal Encoding
Achieves multimodal vision-language encoding with half the parameters of the Florence-2 model
Flow Matching Architecture
Employs a novel Transformer-based flow matching architecture
Efficient Parameter Scale
Contains only 1 billion parameters, providing an efficient and versatile VLA strategy
High Performance
Achieves high success rates in the LIBERO 10 challenge

Model Capabilities

Vision-Language-Action Model
Robotic Manipulation Tasks
Multimodal Encoding
Flow Matching

Use Cases

Robotic Manipulation
Place Items into Basket
Place alphabet soup and ketchup into the basket
Success Rate 0.9791666666666666
Turn on Stove and Place Moka Pot
Kitchen Scene 3: Turn on stove and place moka pot
Success Rate 0.9791666666666666
Place Black Bowl into Bottom Cabinet Drawer and Close
Kitchen Scene 4: Place black bowl into bottom cabinet drawer and close
Success Rate 1.0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase