V

Vectorizer.vanilla

Developed by sinequa
A vectorizer developed by Sinequa that generates embedding vectors from input paragraphs or queries for sentence similarity computation and retrieval tasks.
Downloads 634
Release Time : 7/11/2023

Model Overview

This model is a feature extraction model specifically designed to generate text embeddings. Paragraph vectors can be stored in vector indexes, while query vectors are used to retrieve relevant paragraphs.

Model Features

Efficient inference
Achieves millisecond-level inference on various GPUs, with peak batch processing speeds of up to 5ms (FP16 quantization)
Multi-GPU support
Supports various NVIDIA GPUs including A10/T4/L4, with FP16/FP32 quantization options
Robust training
Trained using query-paragraph-negative triplets and in-batch negative sampling strategy to enhance model discrimination

Model Capabilities

Text vectorization
Sentence similarity computation
Semantic retrieval

Use Cases

Information retrieval
Document retrieval system
Vectorize document paragraphs for storage and quickly retrieve relevant content via query vectors
Achieved average Recall@100 of 0.639 across 14 datasets in BEIR benchmark
Q&A systems
FAQ matching
Match user questions with knowledge base questions via vector similarity
Achieved Recall@100 of 0.995 on Quora dataset
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase