O

Owlv2 Large Patch14

Developed by google
OWLv2 is a zero-shot text-conditioned object detection model that can detect objects in images through text queries without requiring category-specific training data.
Downloads 3,679
Release Time : 10/13/2023

Model Overview

OWLv2 is a CLIP-based open-vocabulary object detection model using ViT-L/14 as the visual encoder, capable of detecting objects in images through natural language descriptions.

Model Features

Zero-shot Detection Capability
Detects novel category objects through text descriptions without requiring category-specific training data.
Open-vocabulary Understanding
Capable of understanding and detecting object categories not present in training data.
Multi-query Detection
Supports simultaneous object detection using multiple text queries.

Model Capabilities

Object detection in images
Text-conditioned object localization
Open-vocabulary recognition
Simultaneous multi-category detection

Use Cases

Computer Vision Research
Zero-shot Object Detection Research
Investigating the model's detection capability on unseen categories
Industrial Applications
Inventory Management
Detecting items in warehouses through natural language descriptions
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase