T

Table Llava V1.5 7b

Developed by SpursgoZmy
Table LLaVA 7B is an open-source multimodal chatbot specifically designed for understanding various table images and performing diverse table-related tasks.
Downloads 165
Release Time : 6/17/2024

Model Overview

An open-source multimodal chatbot designed for table image understanding, supporting tasks such as Q&A, table cell description, and structural comprehension.

Model Features

Multimodal Table Understanding
Specifically designed for table images, capable of handling various table tasks such as Q&A, description, and structural understanding.
Two-Stage Training Process
Utilizes pre-training and instruction fine-tuning to ensure strong performance on both table and non-table tasks.
High Compatibility
Fully compatible with LLaVA v1.5 code, allowing direct inference using the original codebase.

Model Capabilities

Table Image Understanding
Table Question Answering
Table Cell Description
Table Structure Analysis
Multimodal Instruction Following

Use Cases

Document Processing
Financial Statement Analysis
Automatically parse financial statement images and answer related questions
Data Table Extraction
Extract table data from images and generate structured descriptions
Research Applications
Multimodal Large Model Research
Serves as a benchmark model for multimodal table understanding research
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase