# Document Parsing
Sapnous VR 6B
Apache-2.0
Sapnous-6B is an advanced vision-language model that enhances perception and understanding of the world through powerful multimodal capabilities.
Image-to-Text
Transformers English

S
Sapnous-AI
261
5
Embedder Collection
Multilingual embedding model for German and English, supporting a context length of 8192 tokens
Text Embedding Supports Multiple Languages
E
kalle07
6,623
10
Pix2text Table Rec
MIT
A table structure recognition model developed based on Microsoft's Table Transformer for table detection and recognition tasks in documents
Text Recognition
Transformers

P
breezedeus
1,124
2
Nougat
This model is outdated. It is recommended to use the official Nougat model. Nougat is an advanced vision-language model focused on document understanding and analysis.
Image-to-Text
Transformers

N
nielsr
14
4
Featured Recommended AI Models