G

Gte Multilingual Base

Developed by Alibaba-NLP
GTE Multilingual Base is a multilingual sentence embedding model supporting over 50 languages, suitable for tasks like sentence similarity calculation.
Downloads 1.2M
Release Time : 7/20/2024

Model Overview

This model is a Transformer-based multilingual sentence embedding model that maps sentences in different languages into a unified vector space, facilitating cross-lingual sentence similarity calculation and information retrieval.

Model Features

Multilingual Support
Supports sentence embeddings for over 50 languages, enabling cross-lingual semantic understanding
Multi-task Adaptability
Suitable for various NLP tasks including sentence similarity, clustering, classification, and retrieval
High Performance
Demonstrates excellent performance metrics across multiple benchmarks

Model Capabilities

Sentence similarity calculation
Text clustering
Text classification
Information retrieval
Text reranking
Bilingual text mining

Use Cases

Information Retrieval
Cross-lingual Document Retrieval
Retrieve relevant documents from collections in different languages
Achieved NDCG@10 of 53.638 in AlloprofRetrieval test
Text Classification
Product Review Classification
Sentiment classification for multilingual product reviews
Achieved 80.72% accuracy in AmazonPolarityClassification
Sentence Similarity
Cross-lingual Sentence Matching
Calculate semantic similarity between sentences in different languages
Achieved Spearman correlation of 81.21 in BIOSSES test
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase