Nougat Small
Nougat is a vision-language model based on the Donut architecture, specifically designed for converting scientific PDFs into Markdown format.
Downloads 10.28k
Release Time : 9/21/2023
Model Overview
This model employs Swin Transformer as the visual encoder and mBART as the text decoder, capable of predicting Markdown text directly from PDF image pixels in an autoregressive manner.
Model Features
PDF to Markdown Conversion
Specifically designed for scientific PDF documents, converting them into easy-to-use Markdown format.
End-to-End Processing
Directly predicts text from PDF image pixels without intermediate OCR steps.
Academic Document Optimization
Optimized for complex structures in academic documents such as mathematical formulas and tables.
Model Capabilities
PDF Document Parsing
Markdown Generation
Academic Document Processing
Mathematical Formula Recognition
Table Extraction
Use Cases
Academic Research
Paper Format Conversion
Convert academic paper PDFs into editable Markdown format.
Facilitates researchers in editing and reusing paper content.
Literature Digitization
Convert scanned scientific literature into structured digital documents.
Enhances searchability and accessibility of literature.
Publishing Industry
Document Format Conversion
Convert traditional PDF publications into modern Markdown format.
Facilitates multi-platform publishing and content management.
Featured Recommended AI Models
Š 2025AIbase