Model Selection

Document Parsing

# Document Parsing

Sapnous-6B is an advanced vision-language model that enhances perception and understanding of the world through powerful multimodal capabilities.

Transformers English

Embedder Collection

Multilingual embedding model for German and English, supporting a context length of 8192 tokens

Text Embedding Supports Multiple Languages

Pix2text Table Rec

A table structure recognition model developed based on Microsoft's Table Transformer for table detection and recognition tasks in documents

Text Recognition

This model is outdated. It is recommended to use the official Nougat model. Nougat is an advanced vision-language model focused on document understanding and analysis.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase