M

Minilm L6 Danish Encoder

Developed by KennethTM
This is a lightweight Danish sentence embedding model, adjusted based on the English MiniLM model, suitable for Danish text processing tasks.
Downloads 5,802
Release Time : 1/9/2024

Model Overview

This model can map Danish sentences and paragraphs to a 384-dimensional vector space, supporting tasks such as clustering and semantic search. It is adjusted based on the English MiniLM model, uses a Danish tokenizer, and is trained on machine-translated Danish data.

Model Features

Lightweight design
Only approximately 22 million parameters, with low computational resource requirements
Danish optimization
Specifically adjusted using a Danish tokenizer, suitable for Danish text processing
Long text support
Supports a maximum sequence length of 512 tokens
Transfer learning
Adjusted based on the English MiniLM model rather than trained from scratch

Model Capabilities

Text embedding
Sentence similarity calculation
Semantic search
Text clustering

Use Cases

Information retrieval
Danish semantic search
Build a Danish search engine to achieve search based on semantics rather than keywords
Can understand the query intent and return relevant results
Text analysis
Danish text clustering
Automatically group Danish documents or user comments
Discover similar content or themes
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase