K

KR SBERT Medium Extended Patent2024 Hn

Developed by snunlp
This is a sentence-transformers model fine-tuned from snunlp/KR-Medium-extended, specifically designed for Korean patent text similarity tasks.
Downloads 773
Release Time : 8/27/2024

Model Overview

The model maps sentences and paragraphs into a 768-dimensional dense vector space, suitable for semantic text similarity, semantic search, paraphrase mining, text classification, clustering, and other tasks.

Model Features

Patent Text Optimization
Specially optimized for Korean patent texts, better handling technically complex patent content.
High-Dimensional Vector Representation
Maps text into a 768-dimensional dense vector space, capturing rich semantic information.
Large-Scale Training
Trained on the korpat-triplet dataset containing 1,795,000 training samples.

Model Capabilities

Calculate Sentence Similarity
Semantic Search
Paraphrase Mining
Text Classification
Text Clustering

Use Cases

Patent Analysis
Patent Similarity Search
Find other patents similar to a given patent description
Improves patent search efficiency and accuracy
Patent Classification
Automatically classify patents into different technical fields based on content
Simplifies patent management processes
Technical Document Processing
Technical Document Deduplication
Identify similar technical documents
Reduces storage of duplicate documents
Featured Recommended AI Models
ยฉ 2025AIbase