O

Owlv2 Base Patch16 Ensemble

Developed by google
OWLv2 is a zero-shot text-conditioned object detection model that can localize objects in images through text queries.
Downloads 932.80k
Release Time : 10/13/2023

Model Overview

OWLv2 is a CLIP-based open-world localization model supporting zero-shot object detection via text queries, capable of recognizing objects in images without requiring category-specific training data.

Model Features

Zero-shot detection
Detects objects in images using only text descriptions, without requiring category-specific training data
Open vocabulary
Supports arbitrary text queries, not limited to predefined category sets
Multi-query support
Can simultaneously detect objects in images using multiple text queries

Model Capabilities

Image object detection
Text-conditioned query
Open-vocabulary recognition

Use Cases

Computer vision research
Zero-shot object detection research
Used to study the model's detection capabilities on unseen categories
Interdisciplinary applications
Special object recognition
Applied in domains requiring recognition of uncommon objects not present in training data
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase