Simcse Model Roberta Base Thai
S
Simcse Model Roberta Base Thai
Developed by mrp
This is a sentence-transformers model based on XLM-R, specifically optimized for Thai language, capable of mapping sentences and paragraphs into a 768-dimensional dense vector space.
Downloads 69
Release Time : 3/2/2022
Model Overview
The model is trained using the SimCSE method, primarily for sentence similarity computation and feature extraction tasks, especially suitable for Thai text processing.
Model Features
Thai language optimization
Specifically trained and optimized for Thai text
SimCSE training method
Trained using the contrastive learning framework SimCSE to improve sentence representation quality
768-dimensional vector space
Can map text into a 768-dimensional dense vector space
Model Capabilities
Sentence similarity computation
Text feature extraction
Semantic search
Text clustering
Use Cases
Information retrieval
Semantic search
Building a Thai semantic search engine
Understands the semantic intent of queries rather than just keyword matching
Text analysis
Document clustering
Automatic classification and clustering of Thai documents
Groups based on semantic similarity rather than surface features
Featured Recommended AI Models