M

Mmalaya2

Developed by DataCanvas
A multimodal model fine-tuned based on InternVL-Chat-V1-5, excelling in MMBench benchmark tests
Downloads 26
Release Time : 8/19/2024

Model Overview

MMAlaya2 enhances multimodal understanding and generation capabilities through fine-tuning with 20 LoRA modules and TIES merging method, achieving GPT-4o level performance in Chinese multimodal benchmarks

Model Features

Multi-LoRA Module Fusion
Significantly improves model performance through fine-tuning with 20 LoRA modules and TIES merging method
Chinese Multimodal Advantage
Achieves 82.1 points in MMBench Chinese tests, on par with GPT-4o
Domain-Specific Optimization
Conducts error analysis and data supplementation for specific categories like natural relationships and image emotions

Model Capabilities

Image Understanding
Multimodal Q&A
Scene Recognition
Sentiment Analysis
Style Recognition

Use Cases

Visual Q&A
Image Scene Understanding
Recognizes scenes and contexts in images
Excellent performance in MMBench image scene category
Sentiment Analysis
Analyzes emotions conveyed in images
Improved accuracy in image sentiment category
Multimodal Reasoning
Natural Relationship Understanding
Understands natural relationships between objects in images
Reduced error rate in natural relationship category
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase