M

Multilingual E5 Small Ko

Developed by dragonkue
This is a sentence-transformers model fine-tuned from intfloat/multilingual-e5-small, specifically optimized for Korean retrieval tasks, mapping text to a 384-dimensional vector space.
Downloads 263
Release Time : 5/11/2025

Model Overview

This model is a lightweight Korean retriever designed for ease of use and strong performance in practical retrieval tasks. Suitable for running demos or lightweight applications, it provides a good balance between speed and accuracy.

Model Features

Korean Optimization
Fine-tuned specifically for Korean retrieval tasks, improving the embedding quality of Korean text.
Lightweight Design
Small model size (118M), suitable for lightweight applications and rapid deployment.
Multilingual Support
Supports English text processing in addition to Korean.
Boundary-Optimized Training
Uses GISTEmbedLoss training with boundaries, significantly improving retrieval performance.

Model Capabilities

Semantic Text Similarity Calculation
Semantic Search
Paraphrase Mining
Text Classification
Text Clustering

Use Cases

Information Retrieval
Korean Document Retrieval
Retrieve relevant documents from a Korean document library.
Performs excellently on multiple Korean retrieval benchmarks.
Question Answering System
Build the retrieval component for a Korean question-answering system.
Performs well on datasets like Ko-StrategyQA.
Text Analysis
Text Similarity Calculation
Calculate semantic similarity between Korean texts.
Text Clustering
Perform semantic clustering on Korean texts.
Featured Recommended AI Models
ยฉ 2025AIbase