T

Timesformer Large Finetuned K400

Developed by fcakyon
TimeSformer is a video classification model based on spatio-temporal attention mechanism, specifically designed for video understanding tasks.
Downloads 254
Release Time : 12/10/2022

Model Overview

This model is pre-trained on the Kinetics-400 dataset and can classify videos into one of 400 possible categories. It employs a pure attention mechanism to process spatio-temporal information in videos.

Model Features

Pure Attention Mechanism
Completely based on Transformer architecture to process spatio-temporal information in videos, without the need for convolutional operations.
Efficient Video Understanding
Capable of effectively capturing spatio-temporal features in videos, suitable for long video understanding.
Large-scale Pretraining
Pre-trained on the large-scale Kinetics-400 video dataset.

Model Capabilities

Video Classification
Spatio-Temporal Feature Extraction
Video Content Understanding

Use Cases

Video Content Analysis
Action Recognition
Identify human actions and behaviors in videos
Can recognize 400 action categories from the Kinetics-400 dataset
Video Content Classification
Classify and label video content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase