Simcse Model M Bert Thai Cased
S
Simcse Model M Bert Thai Cased
Developed by mrp
A Thai sentence embedding model based on mBERT, trained using the SimCSE method on Thai Wikipedia data, capable of mapping text to 768-dimensional vectors
Downloads 1,637
Release Time : 3/2/2022
Model Overview
This model fine-tunes mBERT using the contrastive learning framework (SimCSE), specifically designed to generate semantic embedding vectors for Thai text, suitable for tasks like sentence similarity calculation and semantic search
Model Features
Thai language optimization
Sentence embedding model specifically optimized for Thai language characteristics
SimCSE framework
Uses a contrastive learning framework to improve sentence representation quality
High-dimensional semantic space
Generates 768-dimensional dense vector representations
Model Capabilities
Sentence vectorization
Semantic similarity calculation
Text clustering
Semantic search
Use Cases
Information retrieval
Thai similar question matching
Matching semantically similar Thai questions in Q&A systems
Improves Q&A system accuracy
Content recommendation
Thai news recommendation
News article recommendations based on content similarity
Enhances user reading experience
Featured Recommended AI Models