G

Gme Qwen2 VL 7B Instruct

Developed by Alibaba-NLP
Qwen2-VL-7B-Instruct is a multimodal vision-language model based on the Qwen2 architecture, supporting both Chinese and English, suitable for various natural language processing tasks.
Downloads 3,844
Release Time : 12/21/2024

Model Overview

This model is a 7B-parameter vision-language model capable of processing text and image inputs, performing text generation, image understanding, and multimodal tasks.

Model Features

Multimodal Capability
Supports text and image inputs, capable of understanding and processing multimodal information.
Multilingual Support
Supports English and Chinese, suitable for cross-language application scenarios.
High Performance
Outstanding performance in multiple benchmarks, especially in text similarity and classification tasks.

Model Capabilities

Text similarity calculation
Text classification
Text clustering
Information retrieval
Reranking
Multimodal understanding

Use Cases

E-commerce
Product Review Classification
Sentiment analysis and classification of product reviews
Achieved 97.33% accuracy in Amazon review classification tasks
Finance
Bank Customer Service Classification
Automatic classification of bank customer inquiries
Achieved 84.76% accuracy on the Banking77 dataset
Academic Research
Paper Clustering
Topic clustering of academic papers
Achieved 54.96% V-measure in ArXiv paper clustering tasks
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase