V

Vectorizer.raspberry

Developed by sinequa
A vectorizer developed by Sinequa that generates embedding vectors based on input paragraphs or queries, used for sentence similarity calculation and retrieval tasks.
Downloads 408
Release Time : 7/11/2023

Model Overview

This model is a feature extraction and sentence similarity calculation model, primarily used to generate embedding vectors for paragraphs and queries, supporting multilingual text processing.

Model Features

Multilingual Support
Supports 9 major languages and is compatible with 91 other languages used in the base model's pre-training.
Efficient Inference
On an NVIDIA A10 GPU, inference time with FP16 quantization and batch size 1 takes only 1 millisecond.
Case and Accent Insensitivity
Insensitive to text case and accents, improving model robustness.
Dimensionality Reduction
Reduces output dimension to 256 through an additional dense layer, optimizing storage and computational efficiency.

Model Capabilities

Multilingual Text Embedding
Sentence Similarity Calculation
Paragraph Vectorization
Query Vectorization
Cross-language Retrieval

Use Cases

Information Retrieval
Document Retrieval
Using query vectors to find relevant document paragraphs
Achieved an average Recall@100 of 0.613 on the BEIR benchmark
Multilingual Applications
Cross-language Search
Supports text similarity calculation and retrieval in multiple languages
On the MIRACL benchmark, achieved a Chinese Recall@100 of 0.680
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase