F

Florence 2 DocLayNet Fixed

Developed by yifeihu
Florence-2 model fine-tuned on the DocLayNet dataset, specialized for document layout analysis tasks, with improved performance through simplified category names
Downloads 95
Release Time : 10/29/2024

Model Overview

This model is a fine-tuned version of Florence-2-large-ft, optimized for document layout analysis tasks, specifically addressing the classification and localization of visual elements in documents.

Model Features

Optimized category names
Simplified original category names to single tokens, improving model performance by 7% and accelerating training and inference
Bounding box quality
Produces clearer bounding box edges, avoiding text truncation and multiple box issues
Scientific paper optimization
Excellent performance on scientific paper subsets, achieving 87% mAP50-95

Model Capabilities

Document layout analysis
Visual element detection
Text region recognition
Table detection
Formula recognition

Use Cases

Academic document processing
Paper figure and table recognition
Automatically identifies figures, tables, formulas, and other elements in academic papers
Achieves 87% mAP50-95 on scientific paper subsets
Document digitization
Document structure parsing
Analyzes document layout structure to identify headers, footers, titles, and other elements
Overall mAP50-95 reaches 70%
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase