TableLlama Open-Source Table Processing Large Model - Free Deployment, Process 8K Context Table Tasks

Tablellama

Developed by osunlp

TableLlama is an open-source general-purpose large model specifically designed for various table tasks, trained on the TableInstruct dataset and capable of handling contexts up to 8K in length.

Large Language Model

Transformers

English#Table Large Model #8K Long Context #Multi-task Table Processing

Downloads 257

Release Time : 11/20/2023

Model Overview

TableLlama is an open-source large language model optimized for table tasks, capable of handling multiple table-related tasks such as table understanding, table filling, table question answering, etc.

Model Features

Long Context Processing

Supports processing contexts up to 8K in length, suitable for handling large table data.

Multi-task Capability

Capable of handling 11 different types of table tasks, covering a wide range of table application scenarios.

Open-source Availability

The model is fully open-source and released under the cc-by-4.0 license.

Model Capabilities

Table Understanding

Table Filling

Table Question Answering

Table Data Extraction

Table Data Analysis

Table Data Conversion

Use Cases

Data Analysis

Table Data Question Answering

Extract information from tables to answer user questions

Performs well on multiple evaluation datasets

Table Data Conversion

Convert table data into other formats or structures

Business Intelligence

Report Analysis

Automatically analyze business report data

🚀 TableLlama: Towards Open Large Generalist Models for Tables

TableLlama is an open - source large generalist model designed for various table - based tasks. It is trained on a well - curated instruction tuning dataset for tables, enabling it to handle up to 8K context.

🚀 Quick Start

You can use the TableLlama models through Huggingface's Transformers library. For more advanced usage, check our Github repo: https://osu-nlp-group.github.io/TableLlama/

✨ Features

Tailored for Tables: Specifically designed to handle various table - based tasks.
Large Context Handling: Can handle up to 8K context.
Trained on Comprehensive Data: Trained on the 🤗 TableInstruct Dataset, which covers a variety of real - world tables and realistic tasks.

📦 Installation

No specific installation steps are provided in the original document.

💻 Usage Examples

Basic Usage

You can use the models through Huggingface's Transformers library.

Advanced Usage

Check our Github repo for more advanced use: https://osu-nlp-group.github.io/TableLlama/

Prompt Format

Below is an instruction that describes a task, paired with an input that provides further context. Write a response that
appropriately completes the request.

### Instruction:
{instruction}

### Input:
{input}

### Question:
{question}

### Response:

📚 Documentation

Data

The models are trained on the 🤗 TableInstruct Dataset, which includes a comprehensive table - based instruction tuning dataset that covers a variety of real - world tables and realistic tasks. We include 14 datasets of 11 tasks in total. Check out the dataset card for more details.

Training Procedure

The models are fine - tuned with the TableInstruct dataset using LongLoRA (7B), fully fine - tuning version as the base model, which replaces the vanilla attention mechanism of the original Llama - 2 (7B) with shift short attention. The training takes 9 days on a 48 80*A100 cluster. Check out our paper for more details.

Evaluation

The models are evaluated on 8 in - domain datasets of 8 tasks and 6 out - of - domain datasets of 4 tasks.

🔧 Technical Details

📄 License

The project uses the cc - by - 4.0 license.

Limitations

We've tried our best to build table generalist models. However, we acknowledge that the models' performance may vary based on the complexity and specifics of the table tasks and datasets. Still not all table - based tasks can be covered comprehensively.

Citation

If you use the models, data, or code from this project, please cite the original paper:

@misc{zhang2023tablellama,
  title={TableLlama: Towards Open Large Generalist Models for Tables}, 
  author={Tianshu Zhang and Xiang Yue and Yifei Li and Huan Sun},
  year={2023},
  eprint={2311.09206},
  archivePrefix={arXiv},
  primaryClass={cs.CL}
}

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご