R

Rdt 1b

Developed by robotics-diffusion-transformer
A 1-billion-parameter imitation learning diffusion Transformer model pretrained on 1M+ multi-robot operation data, supporting multi-view visual-language-action prediction
Downloads 2,644
Release Time : 8/27/2024

Model Overview

This model can predict future 64 robot actions based on language instructions and multi-view RGB images, compatible with various modern mobile robotic arm systems

Model Features

Multimodal Input Support
Simultaneously processes language instructions and up to three-view RGB image inputs
Universal Robot Compatibility
Supports various robotic platforms including single/dual arms, joint/end-effector space, position/velocity control
Large-scale Pretraining
Trained on 1M+ robot operation data and 46 public datasets
Long-sequence Action Prediction
Capable of predicting future 64 continuous robot actions

Model Capabilities

Vision-language understanding
Robot action sequence prediction
Multi-view image processing
Cross-platform robot control

Use Cases

Industrial Automation
Assembly Line Operation
Complete part grasping and assembly tasks based on language instructions
Achieves precise continuous motion control
Service Robots
Home Organization
Identify and organize household items based on voice commands
Completes complex multi-step operation sequences
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase