P

Ptbr Similarity E5 Small

Developed by jmbrito
This is a Portuguese-English sentence similarity model fine-tuned based on multilingual-e5-small, capable of mapping sentences to a 384-dimensional vector space.
Downloads 518
Release Time : 8/25/2023

Model Overview

This model is a fine-tuned version of intfloat/multilingual-e5-small using the ASSIN2 dataset for similarity scoring, specifically designed for sentence similarity calculation tasks.

Model Features

Bilingual support
Supports sentence similarity calculation in both Portuguese and English
High-dimensional vector space
Can map sentences and paragraphs to a 384-dimensional dense vector space
Fine-tuning optimization
Optimized using the ASSIN2 dataset to enhance similarity scoring performance

Model Capabilities

Sentence vectorization
Semantic similarity calculation
Text clustering
Semantic search

Use Cases

Information retrieval
Document similarity search
Find semantically similar documents in a document library
Improves retrieval relevance and accuracy
Text analysis
Text clustering
Group semantically similar texts together
Enables unsupervised text classification
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase