M

Minicpm Embedding

Developed by openbmb
MiniCPM-Embedding is an embedding model developed based on the MiniCPM-2B-sft-bf16 foundation model, specializing in retrieval tasks and supporting both Chinese and English.
Downloads 315
Release Time : 9/4/2024

Model Overview

This model is primarily used for text retrieval tasks, capable of generating high-quality text embeddings suitable for various information retrieval scenarios.

Model Features

Bilingual Support
Supports text retrieval tasks in both Chinese and English.
Efficient Retrieval
Performs excellently in multiple retrieval tasks, especially in Chinese retrieval tasks.
Lightweight
Based on MiniCPM-2B-sft-bf16, with a relatively small parameter size, making it suitable for resource-limited environments.

Model Capabilities

Text Embedding Generation
Information Retrieval
Bilingual Retrieval

Use Cases

Information Retrieval
Academic Literature Retrieval
Used for retrieving academic literature, such as scientific documents in the SCIDOCS dataset.
NDCG@10 of 22.38
Medical Q&A Retrieval
Used for retrieving medical-related Q&A data, such as the CmedqaRetrieval dataset.
NDCG@10 of 46.05
E-commerce Product Retrieval
Used for retrieving product information on e-commerce platforms, such as the EcomRetrieval dataset.
NDCG@10 of 70.21
Q&A Systems
Fact-based Q&A
Used for answering factual questions, such as tasks in the FEVER dataset.
NDCG@10 of 90.76
Open-domain Q&A
Used for open-domain Q&A tasks, such as the NQ dataset.
NDCG@10 of 69.29
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase