T

Tvl Mini 0.1

Developed by 2Vasabi
This is a LORA fine-tuned version of the Qwen2-VL-2B model for Russian, supporting multimodal tasks.
Downloads 23
Release Time : 9/13/2024

Model Overview

This model is a Russian multimodal model based on LORA fine-tuning of Qwen2-VL-2B-Instruct, primarily used for text generation tasks while also supporting various multimodal tasks such as visual reasoning, image captioning, and visual question answering.

Model Features

Multilingual support
Specially optimized for Russian while maintaining English capabilities
Multimodal capabilities
Supports joint processing of images and text, enabling visual reasoning and question answering
Efficient fine-tuning
Uses LORA technology for efficient fine-tuning of the base model

Model Capabilities

Text generation
Visual reasoning
Image captioning
Visual question answering
Multimodal conversation

Use Cases

Content generation
Image caption generation
Generate detailed textual descriptions based on input images
Can accurately describe the main content and scenes in the image
Intelligent Q&A
Visual question answering
Answer various questions about image content
Can understand image content and provide relevant answers
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase