S

Simcse Indobert Base

Developed by LazarusNLP
SimCSE model based on IndoBERT for generating semantic embedding vectors of Indonesian sentences
Downloads 26
Release Time : 5/27/2023

Model Overview

This is a model based on sentence-transformers that can map Indonesian sentences and paragraphs into a 768-dimensional dense vector space, suitable for tasks such as clustering or semantic search.

Model Features

Indonesian-specific
Sentence embedding model specifically optimized for Indonesian
High-dimensional semantic space
Maps sentences into a 768-dimensional dense vector space
SimCSE training
Trained using contrastive learning (SimCSE) method to enhance sentence representation quality

Model Capabilities

Sentence embedding generation
Semantic similarity calculation
Text clustering
Semantic search

Use Cases

Information retrieval
Similar document retrieval
Find semantically similar documents in Indonesian document collections
Text analysis
Topic clustering
Perform semantic-based topic clustering analysis on Indonesian texts
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase