S

Semantic Xlmr Bn

Developed by afschowdhury
A multilingual sentence embedding model optimized for Bengali, mapping text to a 768-dimensional vector space
Downloads 225
Release Time : 2/1/2023

Model Overview

A sentence transformer model based on XLM-RoBERTa architecture, fine-tuned specifically for Bengali, suitable for tasks such as semantic similarity calculation, text clustering, and semantic search

Model Features

Multilingual knowledge distillation
Trained using paraphrase-distilroberta-base-v2 as the teacher model for knowledge distillation
Bengali optimization
Fine-tuned specifically for Bengali text to enhance semantic understanding
Efficient vector representation
Converts sentences into 768-dimensional dense vectors for easy downstream task processing

Model Capabilities

Calculate sentence similarity
Generate text embedding vectors
Support multilingual processing
Perform semantic search
Text clustering analysis

Use Cases

Information retrieval
Document retrieval system
Document search and ranking based on semantic similarity
Dialogue systems
FAQ matching
Identify semantic similarity between user questions and knowledge base questions
Content recommendation
Similar content recommendation
Recommend related articles or products based on content semantic similarity
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase