G

Gme Qwen2 VL 2B Instruct

Developed by Alibaba-NLP
Qwen2-VL-2B-Instruct is a vision-language model based on the Qwen2 architecture, supporting both English and Chinese, suitable for various natural language processing tasks.
Downloads 31.18k
Release Time : 12/21/2024

Model Overview

This model is a multimodal vision-language model capable of handling text and image-related tasks, with special optimization for instruction-following capabilities.

Model Features

Multilingual Support
Supports English and Chinese, suitable for cross-language tasks.
Multi-Task Processing
Capable of performing various NLP tasks such as sentence similarity, classification, and retrieval.
Vision-Language Capabilities
Combines visual and language processing abilities, suitable for multimodal tasks.

Model Capabilities

Text Classification
Sentence Similarity Calculation
Information Retrieval
Clustering Analysis
Re-ranking
Multimodal Processing

Use Cases

Text Analysis
Sentiment Analysis
Classify sentiment polarity in Amazon reviews.
Accuracy up to 96.75%
Intent Recognition
Identify user intent in bank customer service conversations.
Accuracy 80.24%
Information Retrieval
Document Retrieval
Perform document retrieval on the ArguAna dataset.
Average precision@10 reaches 52.78
Multimodal Applications
Image-Text Matching
Perform image-text matching tasks by combining visual and language information.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase