Timesformer Base Finetuned K600
TimeSformer is a video classification model based on spatio-temporal attention mechanisms, fine-tuned on the Kinetics-600 dataset.
Downloads 20
Release Time : 12/10/2022
Model Overview
This model is primarily used for video classification tasks, capable of classifying videos into one of 600 possible Kinetics-600 labels.
Model Features
Spatio-temporal attention mechanism
Utilizes an innovative spatio-temporal attention mechanism to process video data without traditional 3D convolution operations.
Efficient video understanding
Effectively captures spatio-temporal features in videos for efficient video classification.
Large-scale pre-training
Pre-trained and fine-tuned on the large-scale Kinetics-600 video dataset.
Model Capabilities
Video classification
Spatio-temporal feature extraction
Action recognition
Use Cases
Video content analysis
Action recognition
Identifies human actions and behaviors in videos
Can classify 600 different action categories
Video content classification
Automatically classifies and tags video content
Featured Recommended AI Models
Š 2025AIbase