S

Sumen Base

Developed by hoang-quoc-trung
Sumen is an end-to-end Transformer-based model specifically designed for converting mathematical formula images into LaTeX sequences, supporting both printed and handwritten formula recognition.
Downloads 311
Release Time : 4/5/2024

Model Overview

Trained on large-scale datasets, this model efficiently and accurately converts mathematical formula images into LaTeX code, suitable for academic publishing, educational technology, and other fields.

Model Features

End-to-End Transformer Architecture
Utilizes VisionEncoderDecoder structure to directly generate LaTeX sequences without intermediate steps
Multi-Type Formula Support
Capable of processing both printed and handwritten mathematical formulas
High-Performance Recognition
Achieves SOTA-level recognition accuracy on public test sets

Model Capabilities

Printed Mathematical Formula Recognition
Handwritten Mathematical Formula Recognition
Image-to-LaTeX Conversion
Complex Mathematical Expression Parsing

Use Cases

Academic Publishing
Paper Formula Digitization
Convert mathematical formulas in scanned papers into editable LaTeX code
Improves literature digitization efficiency and supports formula retrieval and reuse
Educational Technology
Automatic Homework Grading
Recognize mathematical formulas in students' handwritten assignments
Enables automated assessment of math homework
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase