Gte Multilingual Reranker Base Onnx Op14 Opt Gpu Int8
This is the quantized ONNX version of Alibaba-NLP/gte-multilingual-reranker-base, utilizing INT8 quantization, optimized for GPU, and suitable for text classification tasks.
Downloads 91
Release Time : 3/27/2025
Model Overview
This model is a quantized ONNX version based on Alibaba-NLP/gte-multilingual-reranker-base, using ONNX opset 14, optimized for GPU devices, primarily for text classification and sentence similarity tasks.
Model Features
INT8 Quantization
Utilizes INT8 quantization technology to significantly improve inference speed.
GPU Optimization
Specially optimized for GPU devices to enhance computational efficiency.
Multilingual Support
Supports text processing tasks in multiple languages.
ONNX Runtime
Uses the ONNX runtime framework to provide efficient model inference capabilities.
Model Capabilities
Text Classification
Sentence Similarity Calculation
Multilingual Text Processing
Use Cases
Information Retrieval
Document Re-ranking
Re-rank search results in information retrieval systems to improve relevance.
Enhances the accuracy and relevance of retrieval results
Text Analysis
Text Classification
Classify text for tasks such as sentiment analysis and topic classification.
Efficient and accurate text classification
Featured Recommended AI Models
Š 2025AIbase