# Robot Manipulation Control
Cogact Small
MIT
CogACT is a novel advanced Vision-Language-Action (VLA) architecture derived from Vision-Language Models (VLM), specifically designed for robot manipulation.
Multimodal Fusion
Transformers English

C
CogACT
405
4
Cogact Large
MIT
CogACT is a novel advanced Vision-Language-Action (VLA) architecture derived from Vision-Language Models (VLM), specifically designed for robot manipulation.
Multimodal Fusion
Transformers English

C
CogACT
122
3
Cogact Base
MIT
CogACT is a novel Vision-Language-Action (VLA) architecture that combines vision-language models with specialized action modules for robotic manipulation tasks.
Multimodal Fusion
Transformers English

C
CogACT
6,589
12
Featured Recommended AI Models