O

Oryx 1.5 7B

Developed by THUdyh
Oryx-1.5-7B is a 7B-parameter model developed based on the Qwen2.5 language model, supporting a 32K token context window and specializing in efficiently processing visual inputs of arbitrary spatial dimensions and durations.
Downloads 133
Release Time : 10/22/2024

Model Overview

Oryx-1.5-7B is a multimodal language model capable of handling video and image inputs, supporting both English and Chinese, and suitable for video content understanding and generation tasks.

Model Features

Efficient Visual Processing
Capable of efficiently processing visual inputs of arbitrary spatial dimensions and durations, including videos and images.
Long Context Support
Supports a 32K token context window, making it suitable for processing long video content.
Multilingual Support
Supports processing and generation in both English and Chinese.

Model Capabilities

Video Content Understanding
Video Content Description Generation
Multimodal Reasoning
Long Video Processing

Use Cases

Video Content Analysis
Video Content Description
Provides detailed descriptions of input video content
Generates accurate textual descriptions of video content
Education
Educational Video Understanding
Understands and summarizes educational video content
Helps students quickly grasp key points of videos
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase