# Image-to-text
Sdxl Aam Xl Anime Mix
Other
Anime-style image generation model based on Stable Diffusion XL, supporting image-to-text conversion
Image Generation
S
zenless-lab
1,259
0
Paligemma 3b Mix 448 Ft TableDetection
A multimodal table detection model fine-tuned from google/paligemma-3b-mix-448, specialized in identifying table regions in images
Image-to-Text
Transformers

P
ucsahin
19
4
Blip Base Captioning Ft Hl Scenes
Apache-2.0
This model is an image captioning model based on the BLIP architecture, specifically fine-tuned for high-level scene descriptions.
Image-to-Text
Transformers English

B
michelecafagna26
13
0
Trocr Processor
TrOCR is a Transformer-based optical character recognition model specifically designed for handwritten text recognition, fine-tuned on the IAM handwritten database.
Image-to-Text
Transformers

T
anaghasavit
18
3
Movie Picture Captioning
Apache-2.0
This model can generate captions for any photo in a cinematic narrative style, trained on movie posters and plot summaries, primarily for entertainment purposes.
Image-to-Text
Transformers Other

M
dumperize
35
4
Veld Base
Apache-2.0
Pre-trained visual encoder-text decoder model supporting Korean and English
Image-to-Text
Transformers Supports Multiple Languages

V
KETI-AIR
40
0
Trocr Large Stage1
TrOCR is a Transformer-based pre-trained model for Optical Character Recognition (OCR) tasks.
Text Recognition
Transformers

T
microsoft
3,700
25
Featured Recommended AI Models