D

Dist Mpnet Czeng Cs En

Developed by Seznam
This is a Czech-English bilingual BERT-small model distilled from the all-mpnet-base-v2 model, developed by Seznam.cz, specializing in semantic embedding tasks.
Downloads 1,232
Release Time : 11/2/2023

Model Overview

This model is a small semantic embedding model compressed from a large MPNet model using knowledge distillation technology, supporting both Czech and English, suitable for various natural language processing tasks.

Model Features

Bilingual Support
Supports semantic embedding calculations for both Czech and English.
Efficient Distillation
Retains the performance of the large model while significantly reducing its size through knowledge distillation technology.
High-Quality Embeddings
Performs excellently in various semantic tasks, including similarity search and text classification.

Model Capabilities

Calculate sentence similarity
Generate semantic embedding vectors
Support cross-language semantic matching

Use Cases

Information Retrieval
Cross-Language Document Retrieval
Using this model, you can build a cross-language document retrieval system supporting Czech and English.
Effectively matches documents in different languages that are semantically similar.
Text Analysis
Text Clustering
Utilizes the model's generated embedding vectors to perform clustering analysis on Czech and English texts.
Can discover similar thematic content across languages.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase