I

Indic Sentence Similarity Sbert

Developed by l3cube-pune
This is an IndicSBERT model trained on STS datasets of ten major Indian languages, suitable for English and multiple Indian languages with cross-lingual capabilities.
Downloads 1,642
Release Time : 3/4/2023

Model Overview

This model is a sentence similarity model trained on STS datasets of ten major Indian languages, supporting English, Hindi, Marathi, Kannada, Tamil, Telugu, Gujarati, Odia, Punjabi, Malayalam, and Bengali. Released as part of the MahaNLP project.

Model Features

Multilingual Support
Supports sentence similarity calculation for English and ten major Indian languages
Cross-Lingual Capability
Enables sentence similarity comparison across different Indian languages
Trained on STS Datasets
Specifically trained using Semantic Text Similarity (STS) datasets

Model Capabilities

Sentence Feature Extraction
Sentence Similarity Calculation
Cross-Lingual Sentence Comparison

Use Cases

Natural Language Processing
Multilingual Text Matching
Compare similar sentences expressed in different Indian languages
Cross-Lingual Information Retrieval
Find similar content in documents of different languages
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase