Octo Small
Octo Small is a robot control model trained based on diffusion policy, capable of predicting 7-dimensional actions for the next 4 steps, suitable for multi-source robot datasets.
Downloads 335
Release Time : 12/13/2023
Model Overview
This model adopts a Transformer architecture, preprocesses image data through a lightweight convolutional encoder, and combines a T5 language encoder to process text instructions for robot action prediction and control.
Model Features
Multi-source Data Training
Integrates training data from 25 different robot datasets, covering a wide range of robot operation scenarios
Lightweight Architecture
Adopts a compact Transformer architecture with 27 million parameters, suitable for real-time robot control
Multimodal Input
Supports input from main camera and wrist camera images, combined with language instructions for action prediction
Diffusion Policy
Trained using diffusion policy, capable of predicting 7-dimensional actions for the next 4 steps
Model Capabilities
Robot Action Prediction
Multimodal Data Processing
Real-time Control
Language Instruction Understanding
Use Cases
Industrial Robots
Assembly Line Operations
Completes product assembly tasks based on visual input and language instructions
Service Robots
Object Grasping
Identifies and grasps specified items based on user instructions
Featured Recommended AI Models
Š 2025AIbase