O

Octo Small

Developed by rail-berkeley
Octo Small is a robot control model trained based on diffusion policy, capable of predicting 7-dimensional actions for the next 4 steps, suitable for multi-source robot datasets.
Downloads 335
Release Time : 12/13/2023

Model Overview

This model adopts a Transformer architecture, preprocesses image data through a lightweight convolutional encoder, and combines a T5 language encoder to process text instructions for robot action prediction and control.

Model Features

Multi-source Data Training
Integrates training data from 25 different robot datasets, covering a wide range of robot operation scenarios
Lightweight Architecture
Adopts a compact Transformer architecture with 27 million parameters, suitable for real-time robot control
Multimodal Input
Supports input from main camera and wrist camera images, combined with language instructions for action prediction
Diffusion Policy
Trained using diffusion policy, capable of predicting 7-dimensional actions for the next 4 steps

Model Capabilities

Robot Action Prediction
Multimodal Data Processing
Real-time Control
Language Instruction Understanding

Use Cases

Industrial Robots
Assembly Line Operations
Completes product assembly tasks based on visual input and language instructions
Service Robots
Object Grasping
Identifies and grasps specified items based on user instructions
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase