N

Nougat Latex Base

Developed by Norm
This model is a LaTeX OCR model fine-tuned based on Nougat-base, specifically designed to generate LaTeX code from images, with a particular optimization for the recognition ability of mathematical formula images.
Downloads 8,523
Release Time : 10/8/2023

Model Overview

The LaTeX model based on Nougat improves the quality of generating LaTeX code from images by adjusting the input resolution and adopting an adaptive filling method, and is particularly suitable for the recognition of mathematical formula images.

Model Features

Optimized input resolution
Adjusted the input resolution and adopted an adaptive filling method to reduce scaling artifacts and improve the quality of LaTeX code generation.
High-performance LaTeX generation
Outperforms the similar model pix2tex in terms of token accuracy and normalized edit distance.
Special optimization for mathematical formulas
Specifically optimized for mathematical formula image segments, suitable for academic and technical document processing.

Model Capabilities

Image-to-LaTeX code conversion
Mathematical formula recognition
Academic document processing

Use Cases

Academic research
Thesis formula extraction
Extract the LaTeX code of mathematical formulas from academic thesis images.
Token accuracy 62.38%, normalized edit distance 0.0618
Education
Teaching material processing
Convert handwritten or printed mathematical formulas into editable LaTeX format.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase