V

Videomaev2 Huge

Developed by OpenGVLab
VideoMAEv2-Huge is a self-supervised learning-based video feature extraction model, pre-trained for 1200 epochs on the UnlabeledHybrid-1M dataset.
Downloads 1,145
Release Time : 1/14/2025

Model Overview

This model is primarily used for video feature extraction, employing a dual-masking strategy for pre-training, effectively capturing spatiotemporal features in videos.

Model Features

Dual-Masking Pre-Training Strategy
Employs a dual-masking strategy for self-supervised learning, enhancing the model's understanding of spatiotemporal features in videos.
Large-Scale Pre-Training
Pre-trained for 1200 epochs on the UnlabeledHybrid-1M dataset, learning rich video feature representations.
Efficient Feature Extraction
Capable of extracting meaningful spatiotemporal features from videos, suitable for downstream video understanding tasks.

Model Capabilities

Video Feature Extraction
Video Classification
Video Understanding

Use Cases

Video Analysis
Video Content Classification
Classify video content, such as action recognition, scene recognition, etc.
Video Retrieval
Extract video features for similar video retrieval.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase