Videomaev2 Huge
VideoMAEv2-Huge is a self-supervised learning-based video feature extraction model, pre-trained for 1200 epochs on the UnlabeledHybrid-1M dataset.
Downloads 1,145
Release Time : 1/14/2025
Model Overview
This model is primarily used for video feature extraction, employing a dual-masking strategy for pre-training, effectively capturing spatiotemporal features in videos.
Model Features
Dual-Masking Pre-Training Strategy
Employs a dual-masking strategy for self-supervised learning, enhancing the model's understanding of spatiotemporal features in videos.
Large-Scale Pre-Training
Pre-trained for 1200 epochs on the UnlabeledHybrid-1M dataset, learning rich video feature representations.
Efficient Feature Extraction
Capable of extracting meaningful spatiotemporal features from videos, suitable for downstream video understanding tasks.
Model Capabilities
Video Feature Extraction
Video Classification
Video Understanding
Use Cases
Video Analysis
Video Content Classification
Classify video content, such as action recognition, scene recognition, etc.
Video Retrieval
Extract video features for similar video retrieval.
Featured Recommended AI Models
Š 2025AIbase