Q

Qwen2 VL OCR 2B Instruct GGUF

Developed by prithivMLmods
A multimodal model fine-tuned based on Qwen/Qwen2-VL-2B-Instruct, optimized for OCR, image-to-text conversion, LaTeX math solving, and handwriting recognition
Downloads 142
Release Time : 5/15/2025

Model Overview

A conversational model combining visual and textual understanding, supporting mixed tasks such as optical character recognition, handwritten text extraction, and mathematical formula parsing

Model Features

Multimodal OCR Capability
Capable of handling mixed recognition tasks for printed text, handwritten text, and mathematical formulas
Quantization Support
Provides multiple quantization versions from 1-bit to 8-bit to accommodate different hardware requirements
Conversational Interaction
Supports question-and-answer interactions based on visual input

Model Capabilities

Optical Character Recognition (OCR)
Handwritten Text Extraction
LaTeX Mathematical Formula Parsing
Image-to-Text Conversion
Visual Question Answering (VQA)

Use Cases

Document Digitization
Printed Document OCR
Convert printed text in scanned documents or photos into editable text
Supports complex layout recognition
Handwritten Note Transcription
Recognize messy handwritten content and convert it into digital text
Optimized for unconventional handwriting
Educational Assistance
Math Homework Parsing
Recognize handwritten or printed math problems and provide LaTeX-formatted parsing
Supports formula and symbol recognition
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase