B

Bce Reranker Base V1

Developed by maidalun1020
A bilingual and cross-language reranking model optimized for RAG, supporting Chinese, English, Japanese, and Korean, providing explainable absolute scores
Downloads 68.29k
Release Time : 12/29/2023

Model Overview

BCEmbedding is a bilingual and cross-language embedding model developed by NetEase Youdao, including the EmbeddingModel for generating semantic vectors and the RerankerModel for optimizing search results. As a core component of Youdao's RAG technology stack, it has been applied to open-source projects like QAnything and Youdao products.

Model Features

Cross-language capability
Supports Chinese, English, Japanese, and Korean, as well as cross-language retrieval, optimized based on Youdao's translation engine
RAG-specific optimization
Optimized for real-world scenarios in education, law, finance, and other domains, suitable for tasks like Q&A and summarization
Long text processing
Breaks the 512-character limit, supporting long text reranking
Explainable scoring
Provides absolute quality scores (suggested threshold for filtering low-quality paragraphs is 0.35-0.4)
Two-stage retrieval
Best practice combining bce-embedding-base_v1 for recall and bce-reranker-base_v1 for fine ranking

Model Capabilities

Cross-language semantic matching
Text relevance scoring
Low-quality content filtering
Long text processing
Multi-domain adaptation

Use Cases

Retrieval-Augmented Generation (RAG)
Education Q&A
Textbook content retrieval and answer generation
Performed excellently in LlamaIndex RAG evaluations
Multilingual document processing
Semantic retrieval of mixed Chinese, English, Japanese, and Korean documents
Leading in cross-language scenario evaluations
Information filtering
Low-quality content identification
Filter irrelevant text fragments using absolute score thresholds
Suggested threshold is 0.35-0.4
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase