N

Nougat Base

Developed by facebook
Nougat is a model based on the Donut architecture, specifically trained for transcribing scientific PDFs into easy-to-use Markdown format
Downloads 8,151
Release Time : 9/21/2023

Model Overview

This model is trained for PDF-to-Markdown conversion, employing Swin Transformer as the visual encoder and mBART model as the text decoder, capable of autoregressively predicting Markdown content

Model Features

PDF to Markdown
Transcription capability specifically optimized for scientific PDF documents
Autoregressive prediction
Predicts Markdown content with only PDF image pixels as input
Hybrid architecture
Combines the strengths of Swin Transformer visual encoder and mBART text decoder

Model Capabilities

PDF document parsing
Markdown generation
Academic document processing

Use Cases

Academic document processing
Scientific paper transcription
Convert PDF-format scientific papers into structured Markdown format
Improves document readability and editability
Technical document conversion
Convert technical documents from PDF to more manageable Markdown format
Facilitates content management and version control
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase