O

Opensearch Neural Sparse Encoding Doc V2 Mini

Developed by opensearch-project
OpenSearch's learned sparse retrieval model v2 mini version, encoding documents into sparse vectors to optimize search relevance and efficiency
Downloads 113
Release Time : 7/18/2024

Model Overview

This is a learned sparse retrieval model specifically designed for OpenSearch. It encodes documents into 30522-dimensional sparse vectors and calculates similarity scores through the inner product of query and document sparse vectors. Compared to the v1 series, the v2 series shows improvements in search relevance, efficiency, and inference speed.

Model Features

Efficient Sparse Encoding
Encodes documents into 30522-dimensional sparse vectors, optimizing storage and retrieval efficiency
Inference-Free Retrieval
No model inference required during retrieval; directly uses pre-computed sparse vectors
Performance Optimization
Compared to the v1 series, the v2 series shows improvements in search relevance and inference speed
OpenSearch Integration
Designed specifically for OpenSearch, supporting retrieval based on Lucene inverted indexes

Model Capabilities

Document Sparse Encoding
Efficient Similarity Calculation
Large-scale Document Retrieval
Zero-shot Retrieval

Use Cases

Information Retrieval
Document Search
Quickly retrieves relevant content from large-scale document collections
Achieves an average NDCG@10 of 0.497 on BEIR benchmark subsets
Question Answering Systems
Serves as the retrieval component for QA systems to quickly find relevant passages
NDCG@10 of 0.510 on the NQ (Natural Questions) dataset
Enterprise Search
Internal Document Retrieval
Helps enterprises quickly search internal knowledge bases and documents
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase