TVC 7B
TVC-7B is a 7 billion parameter model based on Qwen2-VL-7B-Instruct. It supports both Chinese and English, has an 8K token context window, and excels in long-chain reasoning and multimodal processing.
Downloads 1,658
Release Time : 3/6/2025
Model Overview
TVC-7B is a multimodal model capable of handling image-to-text conversion tasks, especially suitable for scenarios requiring long-chain reasoning.
Model Features
Long-chain reasoning ability
Supports an 8K token context window, suitable for handling complex tasks requiring multi-step reasoning.
Multimodal processing
Can handle both image and text inputs simultaneously to achieve image-to-text conversion.
Bilingual support
Supports both Chinese and English, suitable for cross-language application scenarios.
Model Capabilities
Image-text conversion
Long-chain reasoning
Multimodal processing
Chinese-English bilingual understanding
Use Cases
Visual question answering
Image content reasoning
Perform multi-step reasoning based on image content to answer complex questions.
Can accurately answer visual questions requiring multi-step reasoning.
Multimodal interaction
Image description generation
Generate detailed text descriptions based on images.
Generate accurate and detailed image descriptions.
Featured Recommended AI Models
Qwen2.5 VL 7B Abliterated Caption It I1 GGUF
Apache-2.0
Quantized version of Qwen2.5-VL-7B-Abliterated-Caption-it, supporting multilingual image description tasks.
Image-to-Text
Transformers Supports Multiple Languages

Q
mradermacher
167
1
Nunchaku Flux.1 Dev Colossus
Other
The Nunchaku quantized version of the Colossus Project Flux, designed to generate high-quality images based on text prompts. This model minimizes performance loss while optimizing inference efficiency.
Image Generation English
N
nunchaku-tech
235
3
Qwen2.5 VL 7B Abliterated Caption It GGUF
Apache-2.0
This is a static quantized version based on the Qwen2.5-VL-7B model, focusing on image captioning generation tasks and supporting multiple languages.
Image-to-Text
Transformers Supports Multiple Languages

Q
mradermacher
133
1
Olmocr 7B 0725 FP8
Apache-2.0
olmOCR-7B-0725-FP8 is a document OCR model based on the Qwen2.5-VL-7B-Instruct model. It is fine-tuned using the olmOCR-mix-0225 dataset and then quantized to the FP8 version.
Image-to-Text
Transformers English

O
allenai
881
3
Lucy 128k GGUF
Apache-2.0
Lucy-128k is a model developed based on Qwen3-1.7B, focusing on proxy-based web search and lightweight browsing, and can run efficiently on mobile devices.
Large Language Model
Transformers English

L
Mungert
263
2
Š 2025AIbase