P

Prometheus Vision 13b V1.0

Developed by prometheus-eval
The first open-source vision-language model specifically developed for evaluation tasks, demonstrating high correlation with both GPT-4V and human evaluators
Downloads 121
Release Time : 1/14/2024

Model Overview

Prometheus-Vision is a vision-language model specifically designed for evaluation tasks, capable of generating linguistic feedback and score judgments based on images, instructions, responses to be evaluated, scoring criteria, and reference answers.

Model Features

Multi-component Evaluation Capability
Capable of processing five input components: image, instruction, response to be evaluated, scoring criteria, and reference answer, to generate detailed feedback and scores
High Correlation with GPT-4V
Evaluation results show high correlation with GPT-4V and human evaluators, with potential to replace GPT-4V evaluations
Fine-grained Scoring
Provides fine-grained scoring from 1 to 5 points, accompanied by detailed evaluation feedback

Model Capabilities

Image Understanding
Text Generation
Visual Question Answering
Response Evaluation
Feedback Generation

Use Cases

Educational Assessment
Visual Question Answering System Evaluation
Evaluating the quality of responses from visual question answering systems
Provides scores and feedback highly consistent with human evaluations
Content Moderation
Image Content Compliance Evaluation
Assessing the compliance and appropriateness of image-related content
Generates detailed compliance evaluation reports
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase