I

Im2latex

Developed by DGurgurov
A baseline model based on VisionEncoderDecoderModel, fine-tuned on datasets for generating LaTeX formulas from images.
Downloads 288
Release Time : 7/15/2024

Model Overview

This model can convert images containing mathematical formulas into LaTeX code, suitable for scenarios such as academic document processing and mathematical formula recognition.

Model Features

Hybrid Architecture
Combines a visual encoder (Swin Transformer) and a text decoder (GPT-2) to effectively handle image-to-text conversion tasks.
High-Precision Formula Recognition
Achieves a BLEU score of 0.67 on test sets, capable of accurately recognizing complex mathematical formulas.
Distributed Training
Uses PyTorch's Distributed Data Parallel (DDP) for efficient training.

Model Capabilities

Image Recognition
Mathematical Formula Conversion
LaTeX Code Generation

Use Cases

Academic Research
Digitizing Paper Formulas
Convert mathematical formulas from scanned documents or images into editable LaTeX code.
Improves efficiency in academic document processing.
Educational Assistance Tool
Helps students and teachers quickly obtain LaTeX representations of formulas in images.
Facilitates sharing and teaching of mathematical content.
Document Processing
PDF Formula Extraction
Extract formula images from PDF documents and convert them into editable formats.
Simplifies document editing workflows.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase