D

Document Qa Model

Developed by lakshya-rawat
A document Q&A model fine-tuned based on LayoutLMv3-base, capable of understanding documents using OCR data and answering related questions.
Downloads 54
Release Time : 4/19/2025

Model Overview

This model is trained to utilize OCR data (via PaddleOCR) to understand documents and accurately answer questions related to structured information in document layouts.

Model Features

Multilingual Support
Supports document Q&A in English, Spanish, French, German, and Italian.
Layout Awareness
Capable of understanding document layouts and structures to improve Q&A accuracy.
OCR Integration
Enhances document comprehension by combining text and positional information extracted via PaddleOCR.

Model Capabilities

Document Image Q&A
Text Information Extraction
Structured Query Answering

Use Cases

Document Processing
Utility Bill Parsing
Extracts and answers questions about fees, dates, etc., from utility bill images.
High accuracy in extracting fee and date information.
Invoice Information Extraction
Extracts vendor, amount, and product information from invoice images.
Structured output of vendor and amount information.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase