T

Timesformer Base Finetuned K400

Developed by facebook
TimeSformer is a video classification model pre-trained on the Kinetics-400 dataset, utilizing a spatiotemporal attention mechanism for video understanding.
Downloads 108.61k
Release Time : 10/7/2022

Model Overview

This model is primarily used for video classification tasks, capable of categorizing videos into one of the 400 possible classes in the Kinetics-400 dataset.

Model Features

Spatiotemporal Attention Mechanism
Employs an innovative spatiotemporal attention mechanism to process video data, eliminating the need for traditional 3D convolution operations.
Efficient Video Understanding
Directly models spatiotemporal relationships in videos through attention mechanisms, improving video understanding efficiency.
Fine-tuned on Kinetics-400
Fine-tuned on the large-scale Kinetics-400 video dataset, delivering excellent classification performance.

Model Capabilities

Video Classification
Spatiotemporal Feature Extraction
Video Content Understanding

Use Cases

Video Analysis
Action Recognition
Identify human actions and behaviors in videos
Can classify 400 different human actions
Video Content Classification
Classify and label video content
Supports 400 categories from the Kinetics-400 dataset
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase