V

Videomae Base Short Finetuned Ssv2 Finetuned Rwf2000 Epochs8 Batch8 Fp16

Developed by lmazzon70
Video action recognition model based on VideoMAE architecture, pre-trained on SSv2 dataset and further fine-tuned on RWF-2000 dataset
Downloads 14
Release Time : 1/11/2023

Model Overview

This is a deep learning model for video action recognition, based on the VideoMAE architecture, pre-trained through self-supervised learning and fine-tuned for specific action recognition tasks.

Model Features

Efficient Video Understanding
Adopts VideoMAE architecture to achieve efficient video representation learning through masked autoencoder
Two-stage Training
Pre-trained on SSv2 dataset first, then fine-tuned on RWF-2000 dataset to enhance task-specific performance
Mixed Precision Training
Uses FP16 mixed precision training to improve training efficiency

Model Capabilities

Video Action Recognition
Temporal Action Analysis
Video Content Understanding

Use Cases

Security Surveillance
Violence Detection
Identify violent behaviors in surveillance videos
Sports Analysis
Athlete Action Recognition
Identify specific movements and techniques of athletes
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase