# Conversational Interaction

Qwen2 VL OCR 2B Instruct GGUF
Apache-2.0
A multimodal model fine-tuned based on Qwen/Qwen2-VL-2B-Instruct, optimized for OCR, image-to-text conversion, LaTeX math solving, and handwriting recognition
Image-to-Text Supports Multiple Languages
Q
prithivMLmods
142
1
VARCO VISION 14B
VARCO-VISION-14B is a powerful English-Korean Vision-Language Model (VLM) that supports image and text input, generates text output, and possesses capabilities for grounding, referencing, and OCR.
Image-to-Text Transformers Supports Multiple Languages
V
NCSOFT
1,022
28
Yuna Ai V1
Yuna AI is a virtual companion model designed for emotional companionship and conversation, aiming to establish deep emotional connections with users.
Large Language Model Supports Multiple Languages
Y
yukiarimo
134
3
YOLO LLaMa 7B VisNav
Other
This project integrates the YOLO object detection model with the LLaMa 2 7B large language model, aiming to provide navigation assistance for visually impaired individuals in their daily travels.
Multimodal Fusion Transformers
Y
LearnItAnyway
19
1
Tapas Temporary Repo
Apache-2.0
TAPAS is a table-based question answering model that handles conversational QA tasks on tabular data through pre-training and fine-tuning.
Question Answering System Transformers English
T
lysandre
3,443
0
Tapas Tiny Finetuned Sqa
Apache-2.0
TAPAS is a QA model for tabular data. This tiny version is fine-tuned on the SQA dataset, suitable for table-based QA tasks in conversational scenarios.
Question Answering System Transformers English
T
google
2,391
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase