V

Vectorizer V1 S Multilingual

Developed by sinequa
A multilingual vectorizer developed by Sinequa that generates embedding vectors for input paragraphs or queries, used for similarity calculation and information retrieval.
Downloads 322
Release Time : 7/10/2023

Model Overview

This model is a multilingual feature extractor based on the BERT-Small architecture, primarily used for sentence similarity calculation and information retrieval tasks, supporting four languages: English, French, German, and Spanish.

Model Features

Multilingual support
Supports text processing in four languages: English, French, German, and Spanish
Efficient inference
Demonstrates efficient inference speed across different GPUs, requiring only 5 milliseconds for batch processing of 32 samples under FP16 quantization
Case insensitivity
Insensitive to text case and accents, improving retrieval robustness
In-batch negative training
Uses query-passage pairs and in-batch negative samples for training to optimize vector representation

Model Capabilities

Text vectorization
Multilingual text processing
Semantic similarity calculation
Information retrieval

Use Cases

Information retrieval
Document retrieval system
Build a document retrieval system based on semantic similarity
Achieved an average Recall@100 of 0.448 on the BEIR benchmark
Multilingual Q&A system
Backend for a multilingual Q&A system
Achieved a French Recall@100 of 0.583 on the MIRACL benchmark
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase