TinyLLaVA-Video-R1 Open-source Video Inference Model - Enhancing Abilities through Reinforcement Learning and Unleashing New Features in Inference

Tinyllava Video R1

Developed by Zhang199

TinyLLaVA-Video-R1 is a small-scale video reasoning model based on the traceable training model TinyLLaVA-Video. It significantly enhances reasoning and thinking abilities through reinforcement learning and exhibits the emergent property of 'epiphany moments'.

Video-to-Text

Transformers

Open Source License:Apache-2.0 #Video Question Answering and Reasoning #Small-scale and Efficient #Emergent Epiphany

Downloads 123

Release Time : 4/13/2025

Model Overview

This model focuses on video-text generation tasks, capable of understanding and analyzing video content to generate relevant text descriptions or answer questions.

Model Features

Reinforcement Learning Optimization

Through reinforcement learning on general video question-answering datasets, the model's reasoning and thinking abilities have been significantly improved.

Emergent Properties

The model exhibits the emergent property of 'epiphany moments,' enabling better understanding and analysis of complex video content.

Small-scale and Efficient

As a small-scale model, TinyLLaVA-Video-R1 provides excellent video understanding capabilities while maintaining efficiency.

Model Capabilities

Video Content Understanding

Video Question Answering

Video-text Generation

Use Cases

Video Analysis

Video Question Answering System

Used to build intelligent systems capable of answering questions about video content.

Performs excellently in multiple benchmark tests, such as Video-MME, MVBench, etc.

Video Content Summarization

Automatically generates text summaries of video content.

Education

Educational Video Understanding

Helps students understand educational video content and answer related questions.

Property	Details
Model Type	Video - text - to - text
Library Name	transformers
License	Apache - 2.0

Model (HF Path)	Video - MME(wo sub)	MVBench	MLVU	MMVU(mc)
Zhang199/TinyLLaVA-Video-R1	46.6	49.5	52.4	46.9

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Tinyllava Video R1

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 TinyLLaVA-Video-R1

🚀 Quick Start

✨ Features

📚 Documentation

Result

📄 License