K

Kf Deberta Multitask

Developed by upskyy
This is a Korean sentence embedding model based on sentence-transformers, capable of mapping sentences and paragraphs into a 768-dimensional dense vector space, suitable for tasks such as clustering or semantic search.
Downloads 1,866
Release Time : 1/14/2024

Model Overview

The model uses the DebertaV2 architecture, trained on the KorSTS and KorNLI datasets through multi-task learning, specifically designed for generating semantic embeddings of Korean sentences.

Model Features

Multi-task learning
Trained simultaneously on the KorSTS and KorNLI datasets, enhancing the model's generalization capability
High performance
Achieved a cosine Pearson score of 85.75 on the KorSTS evaluation dataset, outperforming similar Korean models
DebertaV2 architecture
Utilizes the advanced DebertaV2 model as its foundation, offering stronger semantic understanding capabilities

Model Capabilities

Sentence embedding generation
Semantic similarity calculation
Text clustering
Semantic search

Use Cases

Information retrieval
Korean semantic search
Used to build Korean search engines that return results based on semantics rather than keyword matching
Accurately identifies query intent and returns relevant documents
Text analysis
Document clustering
Automatically classifies and clusters Korean documents
Groups related documents based on semantic similarity
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase