E

E5 Base Korean

Developed by upskyy
This is a Korean-optimized sentence embedding model based on the multilingual-e5-base model, supporting multilingual text similarity computation and feature extraction.
Downloads 53
Release Time : 8/9/2024

Model Overview

The model maps sentences and paragraphs into a 768-dimensional dense vector space, suitable for tasks such as semantic text similarity, semantic search, paraphrase mining, text classification, and clustering.

Model Features

Multilingual Support
Supports text embedding for 100+ languages, with special optimization for Korean
High-Quality Semantic Representation
Performs excellently in Korean semantic similarity tasks, achieving a Pearson cosine similarity of 0.859
Long Text Processing
Supports a maximum sequence length of 512 tokens, suitable for paragraph-level text processing

Model Capabilities

Semantic Text Similarity Computation
Semantic Search
Text Classification
Text Clustering
Paraphrase Mining

Use Cases

Information Retrieval
Cross-Language Document Retrieval
Search for semantically similar documents in a multilingual document repository
Content Recommendation
Similar News Recommendation
Recommend semantically similar news articles based on user reading content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase