B

Bhasha Embed V0

Developed by AkshitaS
This is a multilingual model supporting embeddings for Hindi (Devanagari), English, and Romanized Hindi text, with special support for Romanized Hindi and cross-lingual alignment.
Downloads 203
Release Time : 6/24/2024

Model Overview

The model can embed Hindi (Devanagari), English, and Romanized Hindi text, supporting cross-lingual alignment and Romanized Hindi, suitable for multilingual text similarity computation and feature extraction.

Model Features

Romanized Hindi Support
The first embedding model supporting Romanized Hindi (transliterated Hindi/hin_Latn).
Cross-lingual Alignment
Outputs language-agnostic embeddings, supporting queries across multilingual candidate pools containing Hindi, English, and Romanized Hindi text.

Model Capabilities

Text Embedding
Sentence Similarity Computation
Multilingual Feature Extraction

Use Cases

Text Similarity
Multilingual Sentence Similarity
Compute similarity between sentences in different languages (Hindi, English, Romanized Hindi).
Supports cross-lingual sentence alignment and similarity scoring.
Information Retrieval
Multilingual Search
Retrieve texts similar to the query sentence from a multilingual candidate pool.
Supports mixed retrieval in Hindi, English, and Romanized Hindi.
Featured Recommended AI Models
┬й 2025AIbase