E

Eagle2 9B

Developed by KnutJaegersberg
Eagle2 is a high-performance series of vision-language models focused on enhancing model performance through optimized data strategies and training methods. Eagle2-9B is the large model in this series, achieving a good balance between performance and inference speed.
Downloads 15
Release Time : 1/23/2025

Model Overview

Eagle2-9B is a vision-language model (VLM) capable of processing image and text inputs to generate text outputs. It is built upon the Qwen2.5-7B-Instruct language model and Siglip+ConvNext vision model, supporting multilingual and multimodal tasks.

Model Features

Multimodal Capability
Capable of processing both image and text inputs, understanding visual content, and generating relevant text.
Multilingual Support
Supports 13 languages, including Chinese, English, and several other major languages.
High Performance
Excels in multiple benchmarks, particularly in document understanding, chart question answering, and information extraction tasks.
Long Context Support
Supports context lengths up to 16K, suitable for handling complex tasks.

Model Capabilities

Image Understanding
Text Generation
Multimodal Reasoning
Document Analysis
Chart Understanding
Video Understanding
Multilingual Processing

Use Cases

Document Processing
Document Question Answering
Extract information from document images and answer questions
Achieved 92.6 points on the DocVQA test set
Visual Question Answering
Chart Understanding
Understand and interpret chart content
Achieved 86.4 points on the ChartQA test set
Image Question Answering
Answer questions about image content
Achieved 83.0 points on the TextVQA validation set
Multimodal Reasoning
Mathematical Visual Reasoning
Solve problems requiring visual and mathematical reasoning
Achieved 63.8 points on the MathVista test set
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase