S

Simcse Model M Bert Thai Cased

Developed by mrp
A Thai sentence embedding model based on mBERT, trained using the SimCSE method on Thai Wikipedia data, capable of mapping text to 768-dimensional vectors
Downloads 1,637
Release Time : 3/2/2022

Model Overview

This model fine-tunes mBERT using the contrastive learning framework (SimCSE), specifically designed to generate semantic embedding vectors for Thai text, suitable for tasks like sentence similarity calculation and semantic search

Model Features

Thai language optimization
Sentence embedding model specifically optimized for Thai language characteristics
SimCSE framework
Uses a contrastive learning framework to improve sentence representation quality
High-dimensional semantic space
Generates 768-dimensional dense vector representations

Model Capabilities

Sentence vectorization
Semantic similarity calculation
Text clustering
Semantic search

Use Cases

Information retrieval
Thai similar question matching
Matching semantically similar Thai questions in Q&A systems
Improves Q&A system accuracy
Content recommendation
Thai news recommendation
News article recommendations based on content similarity
Enhances user reading experience
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase