# Medical Image-Text Alignment
Biomedclip Vit Bert Hf
MIT
A BiomedCLIP model implemented based on PyTorch and Huggingface frameworks, reproducing the original microsoft/BiomedCLIP-PubMedBERT_256-vit_base_patch16_224 model
Multimodal Fusion
Transformers English

B
chuhac
4,437
1
Monet
A vision-language foundation model based on CLIP ViT-L/14 architecture, specialized in dermatological image analysis, achieving transparent medical imaging AI through medical literature training
Image-to-Text
Transformers

M
suinleelab
655
2
Pmc Vit L 14 Hf
A vision-language model fine-tuned on the PMC-OA dataset based on CLIP-ViT-L/14
Text-to-Image
Transformers

P
ryanyip7777
260
1
Featured Recommended AI Models