T

Timezero Charades 7B

Developed by wwwyyy
TimeZero is a reasoning-guided large vision-language model (LVLM) specifically designed for temporal video grounding (TVG) tasks. It identifies temporal segments in videos corresponding to natural language queries through reinforcement learning methods.
Downloads 183
Release Time : 3/18/2025

Model Overview

TimeZero is a reasoning-guided large vision-language model (LVLM) adept at identifying temporal segments in videos that correspond to natural language queries. It is entirely trained via reinforcement learning, enabling the model to reason about video-language relationships during inference.

Model Features

Reinforcement Learning Training
Fully trained with reinforcement learning to enhance the accuracy of temporal boundary prediction.
Reasoning During Inference
Demonstrates emergent reasoning capabilities during inference, generating chains of thought to support segment predictions.
SOTA Performance
Sets a new record on the Charades-STA benchmark.

Model Capabilities

Temporal Video Grounding
Video-Language Relationship Reasoning
Temporal Segment Identification

Use Cases

Video Analysis
Video Segment Retrieval
Locate specific segments in videos based on natural language queries.
Achieves 83.3% R1@0.3 accuracy on the Charades-STA benchmark.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase