M

Msmarco Distilbert Base Tas B Mmarco Pt 100k

Developed by mpjan
This is a Portuguese sentence transformer model based on DistilBERT, specifically designed for sentence similarity and semantic search tasks.
Downloads 44
Release Time : 11/3/2022

Model Overview

The model maps Portuguese sentences and paragraphs into a 768-dimensional dense vector space, suitable for tasks such as clustering or semantic search. It was fine-tuned on the first 100,000 triplets of the Portuguese MMARCO dataset.

Model Features

Portuguese language support
Optimized specifically for Portuguese text, suitable for handling semantic tasks in Portuguese.
Efficient vector representation
Converts text into 768-dimensional dense vectors, preserving semantic information while maintaining computational efficiency.
Fine-tuning optimization
Specially fine-tuned on the Portuguese MMARCO dataset, improving performance on similarity tasks.

Model Capabilities

Sentence embedding
Semantic similarity calculation
Text clustering
Information retrieval

Use Cases

Information retrieval
Portuguese document search
Building a semantic search engine for Portuguese documents
Improving the semantic relevance of search results
Text analysis
Portuguese text clustering
Performing thematic clustering analysis on Portuguese texts
Automatically discovering thematic patterns in texts
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase