Model Selection

Conversational Interaction

# Conversational Interaction

Qwen2 VL OCR 2B Instruct GGUF

A multimodal model fine-tuned based on Qwen/Qwen2-VL-2B-Instruct, optimized for OCR, image-to-text conversion, LaTeX math solving, and handwriting recognition

Image-to-Text Supports Multiple Languages

VARCO VISION 14B

VARCO-VISION-14B is a powerful English-Korean Vision-Language Model (VLM) that supports image and text input, generates text output, and possesses capabilities for grounding, referencing, and OCR.

Transformers Supports Multiple Languages

Yuna AI is a virtual companion model designed for emotional companionship and conversation, aiming to establish deep emotional connections with users.

Large Language Model Supports Multiple Languages

YOLO LLaMa 7B VisNav

This project integrates the YOLO object detection model with the LLaMa 2 7B large language model, aiming to provide navigation assistance for visually impaired individuals in their daily travels.

Multimodal Fusion

Tapas Temporary Repo

TAPAS is a table-based question answering model that handles conversational QA tasks on tabular data through pre-training and fine-tuning.

Question Answering System

Transformers English

Tapas Tiny Finetuned Sqa

TAPAS is a QA model for tabular data. This tiny version is fine-tuned on the SQA dataset, suitable for table-based QA tasks in conversational scenarios.

Question Answering System

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase