Sumen Base
Sumen is an end-to-end Transformer-based model specifically designed for converting mathematical formula images into LaTeX sequences, supporting both printed and handwritten formula recognition.
Downloads 311
Release Time : 4/5/2024
Model Overview
Trained on large-scale datasets, this model efficiently and accurately converts mathematical formula images into LaTeX code, suitable for academic publishing, educational technology, and other fields.
Model Features
End-to-End Transformer Architecture
Utilizes VisionEncoderDecoder structure to directly generate LaTeX sequences without intermediate steps
Multi-Type Formula Support
Capable of processing both printed and handwritten mathematical formulas
High-Performance Recognition
Achieves SOTA-level recognition accuracy on public test sets
Model Capabilities
Printed Mathematical Formula Recognition
Handwritten Mathematical Formula Recognition
Image-to-LaTeX Conversion
Complex Mathematical Expression Parsing
Use Cases
Academic Publishing
Paper Formula Digitization
Convert mathematical formulas in scanned papers into editable LaTeX code
Improves literature digitization efficiency and supports formula retrieval and reuse
Educational Technology
Automatic Homework Grading
Recognize mathematical formulas in students' handwritten assignments
Enables automated assessment of math homework
Featured Recommended AI Models
Š 2025AIbase