A

Arabic Small Nougat

Developed by MohamedRashad
An end-to-end structured optical character recognition system specifically designed for Arabic, fine-tuned based on the facebook/nougat-small architecture
Downloads 1,149
Release Time : 2/17/2024

Model Overview

This model is an end-to-end structured OCR system for Arabic books, capable of converting Arabic book images into structured text (especially in Markdown format).

Model Features

Arabic OCR Optimization
Specially optimized for Arabic text recognition, capable of handling complex layouts in Arabic books
Structured Output
Generates structured text in Markdown format, preserving the original document's formatting information
End-to-End Processing
Complete processing pipeline from image to text without intermediate steps

Model Capabilities

Arabic Text Recognition
English Text Recognition
Book Image Processing
Markdown Format Generation

Use Cases

Literature Digitization
Digitization of Ancient Arabic Texts
Convert images of ancient Arabic texts into editable digital text
Achieves digitization and searchability of ancient text content
Printed Material Processing
Arabic Book Scanning
Process scanned Arabic book pages to extract text content
Generates structured e-book content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase