Paligemma 3b Ft Science Qa 448
PaliGemma is a 3B-parameter lightweight vision-language model developed by Google, built upon SigLIP vision model and Gemma language model, supporting image and text inputs to generate text outputs.
Image-to-Text
Transformers