olmOCR is an optical character recognition model fine-tuned based on Qwen2-VL-7B-Instruct. It focuses on converting image content such as PDFs into text and improves the recognition accuracy in specific scenarios through fine-tuning.
Text Recognition
Safetensors Other