Qwen2.5-vl-vqa-vibook Open-source Visual Question Answering Model - Free Deployment and Support for Vietnamese Image Question Answering

Qwen2.5 Vl Vqa Vibook

Developed by sunbv56

A visual question answering model based on the Qwen2.5 architecture, focusing on Vietnamese scenarios and supporting the answering of image-related questions.

Text-to-Image OtherOpen Source License:Apache-2.0 #Vietnamese visual question answering #Multimodal instruction fine-tuning #OCR enhanced understanding

Downloads 148

Release Time : 6/18/2025

Model Overview

This model is a visual question answering model that combines visual and language processing capabilities, can understand image content and answer related questions, and is specifically optimized for Vietnamese scenarios.

Model Features

Vietnamese support

Specifically optimized for Vietnamese scenarios and capable of handling Vietnamese visual question answering tasks.

Multimodal capabilities

Combines visual and language processing capabilities to understand image content and generate relevant answers.

Lightweight model

With a scale of 3B parameters, suitable for deployment in resource-constrained environments.

Model Capabilities

Image understanding

Vietnamese question answering

Multimodal reasoning

Use Cases

Education

Vietnamese learning assistance

Help students understand Vietnamese vocabulary and scenarios through images.

Customer service

Automated customer service

Answer customers' questions about products through images.

🚀 Model Card for Model ID

This model card provides details about a visual question - answering model. It includes information on the model's development, usage, training, evaluation, and more.

✨ Features

Pipeline Tag: Visual Question Answering
Base Model: Qwen/Qwen2.5 - VL - 3B - Instruct
Library Name: peft
License: apache - 2.0
Datasets: LR - AI - Labs/vi - OCR_VQA
Language: vi

📚 Documentation

Model Details

Model Description

Developed by: [More Information Needed]
Funded by [optional]: [More Information Needed]
Shared by [optional]: [More Information Needed]
Model type: [More Information Needed]
Language(s) (NLP): [More Information Needed]
License: apache - 2.0
Finetuned from model [optional]: [More Information Needed]

Model Sources [optional]

Repository: [More Information Needed]
Paper [optional]: [More Information Needed]
Demo [optional]: [More Information Needed]

Uses

Direct Use

[More Information Needed]

Downstream Use [optional]

[More Information Needed]

Out-of-Scope Use

[More Information Needed]

Bias, Risks, and Limitations

[More Information Needed]

Recommendations

⚠️ Important Note

Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.

How to Get Started with the Model

Use the code below to get started with the model. [More Information Needed]

Training Details

Training Data

[More Information Needed]

Training Procedure

Preprocessing [optional]

[More Information Needed]

Training Hyperparameters

Training regime: [More Information Needed]

Speeds, Sizes, Times [optional]

[More Information Needed]

Evaluation

Testing Data, Factors & Metrics

Testing Data

[More Information Needed]

Factors

[More Information Needed]

Metrics

[More Information Needed]

Results

[More Information Needed]

Summary

Model Examination [optional]

[More Information Needed]

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

Hardware Type: [More Information Needed]
Hours used: [More Information Needed]
Cloud Provider: [More Information Needed]
Compute Region: [More Information Needed]
Carbon Emitted: [More Information Needed]

Technical Specifications [optional]

Model Architecture and Objective

[More Information Needed]

Compute Infrastructure

Hardware

[More Information Needed]

Software

[More Information Needed]

Citation [optional]

BibTeX: [More Information Needed]

APA: [More Information Needed]

Glossary [optional]

[More Information Needed]

More Information [optional]

[More Information Needed]

Model Card Authors [optional]

[More Information Needed]

Model Card Contact

[More Information Needed]

Framework versions

Property	Details
PEFT Version	0.14.0

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご