Timesformer-large-finetuned-k400 Open-source Video Classification Model

Timesformer Large Finetuned K400

Developed by fcakyon

TimeSformer is a video classification model based on spatio-temporal attention mechanism, specifically designed for video understanding tasks.

Video Processing

Transformers

#Video Action Recognition #Spatio-Temporal Attention Mechanism #Kinetics-400 Pretraining

Downloads 254

Release Time : 12/10/2022

Model Overview

This model is pre-trained on the Kinetics-400 dataset and can classify videos into one of 400 possible categories. It employs a pure attention mechanism to process spatio-temporal information in videos.

Model Features

Pure Attention Mechanism

Completely based on Transformer architecture to process spatio-temporal information in videos, without the need for convolutional operations.

Efficient Video Understanding

Capable of effectively capturing spatio-temporal features in videos, suitable for long video understanding.

Large-scale Pretraining

Pre-trained on the large-scale Kinetics-400 video dataset.

Model Capabilities

Video Classification

Spatio-Temporal Feature Extraction

Video Content Understanding

Use Cases

Video Content Analysis

Action Recognition

Identify human actions and behaviors in videos

Can recognize 400 action categories from the Kinetics-400 dataset

Video Content Classification

Classify and label video content

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Timesformer Large Finetuned K400

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 TimeSformer (large-sized model, fine-tuned on Kinetics-400)

🚀 Quick Start

✨ Features

💻 Usage Examples

Basic Usage

BibTeX entry and citation info

📄 License