Timesformer Large Finetuned K400
T
Timesformer Large Finetuned K400
Developed by fcakyon
TimeSformer is a video classification model based on spatio-temporal attention mechanism, specifically designed for video understanding tasks.
Downloads 254
Release Time : 12/10/2022
Model Overview
This model is pre-trained on the Kinetics-400 dataset and can classify videos into one of 400 possible categories. It employs a pure attention mechanism to process spatio-temporal information in videos.
Model Features
Pure Attention Mechanism
Completely based on Transformer architecture to process spatio-temporal information in videos, without the need for convolutional operations.
Efficient Video Understanding
Capable of effectively capturing spatio-temporal features in videos, suitable for long video understanding.
Large-scale Pretraining
Pre-trained on the large-scale Kinetics-400 video dataset.
Model Capabilities
Video Classification
Spatio-Temporal Feature Extraction
Video Content Understanding
Use Cases
Video Content Analysis
Action Recognition
Identify human actions and behaviors in videos
Can recognize 400 action categories from the Kinetics-400 dataset
Video Content Classification
Classify and label video content
Featured Recommended AI Models