# Zero-shot object detection

Llmdet Swin Tiny Hf
Apache-2.0
LLMDet is a powerful open-vocabulary object detector supervised by large language models, capable of zero-shot object detection.
Object Detection
L
fushh7
2,451
0
Owlv2 Large Patch14 Ensemble
Apache-2.0
OWLv2 is a zero-shot text-conditioned object detection model that can detect objects in images through text queries.
Text-to-Image Transformers
O
Thomasboosinger
1
0
Owlvit Base Patch32
OWL-ViT is a zero-shot object detection model based on Vision Transformer, capable of detecting objects of new categories without fine-tuning.
Object Detection Transformers
O
Xenova
86
1
Owlv2 Base Patch16
Apache-2.0
OWLv2 is a zero-shot text-conditioned object detection model that can detect and locate objects in images through text queries.
Text-to-Image Transformers
O
vvmnnnkv
26
0
Owlv2 Large Patch14 Finetuned
Apache-2.0
OWLv2 is a zero-shot text-conditioned object detection model that can detect objects in images through text queries without requiring category-specific training data.
Text-to-Image Transformers
O
google
1,434
4
Owlv2 Base Patch16 Finetuned
Apache-2.0
OWLv2 is a zero-shot text-conditioned object detection model that can retrieve objects in images through text queries.
Object Detection Transformers
O
google
2,698
3
Owlv2 Base Patch16 Ensemble
Apache-2.0
OWLv2 is a zero-shot text-conditioned object detection model that can localize objects in images through text queries.
Text-to-Image Transformers
O
google
932.80k
99
Owlv2 Base Patch16
Apache-2.0
OWLv2 is a zero-shot text-conditioned object detection model that can retrieve objects in images through text queries.
Text-to-Image Transformers
O
google
15.42k
26
Grounding Dino Tiny
Apache-2.0
Grounding DINO is an open-set object detection model that combines the DINO detector with grounding pre-training, enabling zero-shot object detection.
Object Detection Transformers
G
IDEA-Research
771.67k
74
Owlvit Base Patch32
Apache-2.0
OWL-ViT is a zero-shot text-conditioned object detection model that can search for objects in images via text queries without requiring category-specific training data.
Text-to-Image Transformers
O
google
764.95k
129
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase