# Cross-language retrieval

Multilingual E5 Large Pooled Q8 0 GGUF
MIT
Multilingual E5 large pooled model supporting sentence similarity calculation and feature extraction tasks in multiple languages.
Text Embedding Supports Multiple Languages
M
falan42
56
1
GIST Embedding V0
MIT
GIST-Embedding-v0 is a sentence embedding model based on sentence-transformers, mainly used for sentence similarity calculation and feature extraction tasks.
Text Embedding English
G
avsolatorio
252.21k
26
Colqwen2 2b V1.0
A visual retrieval model based on Qwen2-VL-2B-Instruct and ColBERT strategy, capable of generating multi-vector text and image representations
Text-to-Image Supports Multiple Languages
C
tsystems
700
1
Gte Qwen2 1.5B Instruct GGUF
Apache-2.0
A 7B-parameter sentence embedding model based on the Qwen2 architecture, specializing in sentence similarity tasks with outstanding performance on the MTEB benchmark.
Large Language Model
G
mav23
169
2
Vectorizer.guava
A vectorization tool developed by Sinequa that generates embedding vectors from input paragraphs or queries for sentence similarity calculation and retrieval tasks.
Text Embedding Supports Multiple Languages
V
sinequa
204
1
Bge Reranker V2 M3 En Ru
MIT
This is a streamlined version of BAAI/bge-reranker-v2-m3, retaining only the English and Russian vocabulary, making the model 1.5 times smaller while still generating identical embedding vectors.
Text Embedding Transformers Supports Multiple Languages
B
qilowoq
677
6
Gte Multilingual Mlm Base
Apache-2.0
mGTE series multilingual text encoder, supporting 75 languages, with a maximum context length of 8192, based on BERT+RoPE+GLU architecture, excelling in GLUE and XTREME-R benchmarks
Large Language Model Safetensors
G
Alibaba-NLP
342
12
Bloomz 560m Retriever V2
Openrail
A dual encoder based on the Bloomz-560m-dpo-chat model, designed to map articles and queries into the same vector space, supporting cross-language retrieval in French and English.
Text Embedding Transformers Supports Multiple Languages
B
cmarkea
17
2
All Indo E5 Small V4
This is an Indonesian text embedding model based on sentence-transformers, capable of mapping sentences and paragraphs into a 384-dimensional dense vector space, suitable for tasks such as clustering and semantic search.
Text Embedding Transformers
A
LazarusNLP
3,039
7
Sentence Transformers Multilingual E5 Large
This is a multilingual sentence embedding model based on sentence-transformers, capable of mapping text to a 1024-dimensional vector space, suitable for semantic search and clustering tasks.
Text Embedding
S
embaas
53.70k
2
LEALLA Base
Apache-2.0
LEALLA is a collection of lightweight language-agnostic sentence embedding models supporting 109 languages, distilled from LaBSE. Suitable for obtaining multilingual sentence embeddings and bilingual text retrieval.
Text Embedding Supports Multiple Languages
L
setu4993
772
5
Paraphrase Spanish Distilroberta
A Spanish-English bilingual model based on sentence-transformers that maps text to a 768-dimensional vector space, suitable for semantic search and clustering tasks
Text Embedding Transformers Spanish
P
somosnlp-hackathon-2022
17.25k
15
Cross En It Roberta Sentence Transformer
MIT
A sentence embedding model supporting English and Italian, used for generating vector representations of sentences.
Text Embedding Transformers Supports Multiple Languages
C
T-Systems-onsite
16
0
Msmarco MiniLM L12 En De V1
Apache-2.0
An English-German cross-lingual cross-encoder model trained on the MS Marco passage ranking task, suitable for passage re-ranking in information retrieval scenarios.
Text Embedding Transformers Supports Multiple Languages
M
cross-encoder
19.62k
5
Sentence Transformers Multilingual Snli V2 500k
This is a multilingual sentence embedding model based on sentence-transformers, capable of mapping sentences and paragraphs into a 768-dimensional vector space, suitable for tasks such as clustering and semantic search.
Text Embedding Transformers
S
Pyjay
21
1
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase