3

360VL 8B

Developed by qihoo360
360VL is a multimodal model developed based on the LLama3 language model, featuring powerful image understanding and bilingual dialogue capabilities.
Downloads 22
Release Time : 5/16/2024

Model Overview

360VL is an open-source large multimodal model developed based on the LLama3 language model, designed with a globally aware multi-branch projector architecture, supporting bilingual dialogue (Chinese-English) and image understanding.

Model Features

Multiturn Image-Text Dialogue
Can simultaneously receive text and image inputs and output text content, supporting multiturn visual question answering for a single image.
Bilingual Text Support
Supports bilingual dialogue (Chinese-English), including text recognition in images.
Powerful Image Understanding
Excels at analyzing visual content, efficiently completing tasks such as image information extraction, organization, and summarization.
Fine Image Resolution
Supports higher-resolution image understanding at 672×672.

Model Capabilities

Multimodal Dialogue
Image Understanding
Visual Question Answering
Bilingual Text Processing

Use Cases

Intelligent Customer Service
Product Inquiry
User uploads a product image and asks for product information.
The model can accurately identify the product and provide relevant information.
Education
Image-Based Learning Assistance
Students upload images of study materials and ask related questions.
The model can understand the image content and provide answers.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase