S

Simcse Model Roberta Base Thai

Developed by mrp
This is a sentence-transformers model based on XLM-R, specifically optimized for Thai language, capable of mapping sentences and paragraphs into a 768-dimensional dense vector space.
Downloads 69
Release Time : 3/2/2022

Model Overview

The model is trained using the SimCSE method, primarily for sentence similarity computation and feature extraction tasks, especially suitable for Thai text processing.

Model Features

Thai language optimization
Specifically trained and optimized for Thai text
SimCSE training method
Trained using the contrastive learning framework SimCSE to improve sentence representation quality
768-dimensional vector space
Can map text into a 768-dimensional dense vector space

Model Capabilities

Sentence similarity computation
Text feature extraction
Semantic search
Text clustering

Use Cases

Information retrieval
Semantic search
Building a Thai semantic search engine
Understands the semantic intent of queries rather than just keyword matching
Text analysis
Document clustering
Automatic classification and clustering of Thai documents
Groups based on semantic similarity rather than surface features
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase