# Cross-Level Feature Fusion
Ret OpenCLIP ViT H 14
Apache-2.0
ReT is an innovative method supporting multimodal query and document retrieval, achieving fine-grained retrieval by integrating multi-level representations from vision and text backbone networks.
Multimodal Fusion
Transformers

R
aimagelab
23
0
Ret CLIP ViT L 14
Apache-2.0
ReT is an innovative method supporting multimodal query and document retrieval, achieving fine-grained retrieval by fusing multi-level representations from vision and text backbone networks.
Multimodal Fusion
Transformers

R
aimagelab
523
0
Featured Recommended AI Models