Open-source TEMPURA-Qwen2.5-VL-3B-s1 Model - Enhance Video Event Understanding and Temporal Segmentation Capabilities

TEMPURA Qwen2.5 VL 3B S1

Developed by andaba

TEMPURA is a video temporal understanding framework combining causal reasoning with fine-grained temporal segmentation, enhancing video event comprehension through two-stage training

Video-to-Text

Transformers

#Video Temporal Reasoning #Causal Event Prediction #Dense Event Segmentation

Downloads 16

Release Time : 5/4/2025

Model Overview

This model achieves temporal understanding and causal reasoning of video events through masked event prediction and video segmentation techniques, supporting video-to-text generation tasks

Model Features

Two-stage Training Paradigm

Stage one reconstructs missing events through masked event prediction, stage two learns video segmentation and dense description techniques

Temporal Understanding Capability

Deconstructs videos into non-overlapping events and generates timestamp-aligned detailed descriptions

Large-scale Training Data

Trained on VER dataset (containing 1 million training instances, 500k videos)

Model Capabilities

Video temporal understanding

Event causal reasoning

Video-to-text generation

Timestamp-aligned description generation

Use Cases

Video Analysis

Video Event Reasoning

Analyzing causal relationships and temporal sequences of events in videos

Outperforms existing strong baseline models

Temporal Localization

Accurately locating specific event timestamps in videos

Demonstrates excellent performance in benchmark tests

Property	Details
Base Model	Qwen/Qwen2.5-VL-3B-Instruct
Datasets	andaba/TEMPURA-VER
Library Name	transformers
Tags	text-generation-inference
Pipeline Tag	video-text-to-text

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

TEMPURA Qwen2.5 VL 3B S1

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 TEMPURA: Temporal Event Masked Prediction and Understanding for Reasoning in Action

🚀 Quick Start

✨ Features

Model Information

Model Weights

📄 License

Citing TEMPURA