Videomae Base Finetuned
A video understanding model fine-tuned on an unknown dataset based on the VideoMAE base model, achieving 86.41% accuracy on the evaluation set
Downloads 15
Release Time : 2/8/2023
Model Overview
This model is a fine-tuned version of the VideoMAE base architecture, primarily used for video content understanding tasks. Specific application scenarios require further details
Model Features
Efficient Video Representation Learning
Utilizes a masked autoencoder architecture to effectively learn spatiotemporal feature representations of videos
Excellent Fine-tuning Performance
Achieves 86.41% accuracy on the evaluation set, demonstrating strong performance
Lightweight Training
Can be effectively trained with a batch size of 8
Model Capabilities
Video Feature Extraction
Video Content Classification
Spatiotemporal Pattern Recognition
Use Cases
Video Content Analysis
Action Recognition
Identify human actions or behaviors in videos
86.41% accuracy (based on evaluation set)
Scene Classification
Classify video scene content
Featured Recommended AI Models
Š 2025AIbase