all-MiniLM-L2-v2開源模型 - 推理提速近2倍，CPU和GPU上精準度高

首頁

All MiniLM L2 V2

由tabularisai開發

該模型是從all-MiniLM-L12-v2蒸餾而來，推理速度提升近2倍，同時在CPU和GPU上保持較高的準確度。

文本嵌入

Safetensors

支持多種語言開源協議:Apache-2.0 #高速文本嵌入 #檢索增強生成 #輕量級模型

下載量 5,063

發布時間 : 5/5/2025

模型概述

一個高效的文本嵌入模型，適用於句子相似度計算和檢索增強生成等任務。

模型特點

高速推理

相比all-MiniLM-L6-v2模型，推理速度提升近2倍

高準確度

在保持高速推理的同時，準確度接近原模型

輕量級

模型體積小，適合資源受限的環境

模型能力

文本嵌入

句子相似度計算

語義檢索

使用案例

信息檢索

檢索增強生成(RAG)

作為檢索器用於RAG流程，快速找到相關文檔

提高檢索速度和系統響應時間

語義分析

句子相似度計算

計算兩個句子之間的語義相似度

可用於問答系統、重複檢測等場景

🚀 最快的文本嵌入模型：tabularisai/all-MiniLM-L2-v2

本模型是從 sentence-transformers/all-MiniLM-L12-v2 蒸餾而來，與最小的 all-MiniLM-L6-v2 模型相比，推理速度幾乎快了 2 倍，同時在 CPU 和 GPU 上都能保持較高的準確性。

🚀 快速開始

本模型可用於文本嵌入和檢索增強生成（RAG）等任務，下面將詳細介紹其使用方法。

💻 使用示例

基礎用法

檢索增強生成（RAG）示例

可將此模型用作 RAG 管道中的檢索器：

from sentence_transformers import SentenceTransformer, util
import faiss
import numpy as np

# Load embedding model
model = SentenceTransformer("tabularisai/all-MiniLM-L2-v2")

# Your 5 simple documents
documents = [
    "Renewable energy comes from natural sources.",
    "Solar panels convert sunlight into electricity.",
    "Wind turbines harness wind power.",
    "Fossil fuels are non-renewable sources of energy.",
    "Hydropower uses water to generate electricity."
]

# Embed documents
doc_embeddings = model.encode(documents, convert_to_numpy=True)

# Create FAISS index
dim = doc_embeddings.shape[1]
index = faiss.IndexFlatL2(dim)
index.add(doc_embeddings)

# Query
query = "What are the benefits of renewable energy?"
query_embedding = model.encode([query], convert_to_numpy=True)

# Search top 3 similar docs
D, I = index.search(query_embedding, k=3)

# Print results
print("Query:", query)
print("\nTop 3 similar documents:")
for rank, idx in enumerate(I[0]):
    print(f"{rank+1}. {documents[idx]} (score: {D[0][rank]:.4f})")

高級用法

句子嵌入示例

首先安裝庫：

pip install -U sentence-transformers

然後加載模型並對句子進行編碼：

from sentence_transformers import SentenceTransformer

model = SentenceTransformer("tabularisai/all-MiniLM-L2-v2")

sentences = [
    "The weather is lovely today.",
    "It's so sunny outside!",
    "He drove to the stadium.",
]

embeddings = model.encode(sentences)
print(embeddings.shape)  # [3, 384]

similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)  # [3, 3]