bge-base-en-v1.5-gguf open-source embedding model - Efficiently demonstrates excellent performance when used with llama.cpp

Bge Base En V1.5 Gguf

Developed by CompendiumLabs

This project provides the BGE embedding model stored in GGUF format, which is suitable for use with llama.cpp and offers better performance than transformers.

Text Embedding Open Source License:MIT #English Embedding Model #Efficient CPU Inference #Lightweight Deployment

Downloads 1,108

Release Time : 2/17/2024

Model Overview

The GGUF format version of the BGE embedding model, focusing on text embedding tasks and suitable for scenarios requiring efficient processing of embedding vectors.

Model Features

GGUF Format Optimization

Stored in GGUF format, it can bring significant performance improvements when used with llama.cpp

Multiple Quantization Options

Four quantization versions, F32, F16, Q8_0, and Q4_K_M, are provided to meet different precision and performance requirements

CPU Acceleration

It can achieve up to 30% acceleration on the CPU while maintaining minimal precision loss

Model Capabilities

Text Embedding

Batch Processing

Efficient Inference

Use Cases

Information Retrieval

Document Similarity Calculation

Calculate the semantic similarity between documents

Natural Language Processing

Semantic Search

Build a search system based on semantics rather than keywords

Filename	Quantization	Size
bge-base-en-v1.5-f32.gguf	F32	417 MB
bge-base-en-v1.5-f16.gguf	F16	209 MB
bge-base-en-v1.5-q8_0.gguf	Q8_0	113 MB
bge-base-en-v1.5-q4_k_m.gguf	Q4_K_M	66 MB

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Bge Base En V1.5 Gguf

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 bge-base-en-v1.5-gguf

🚀 Quick Start

📦 Files Available

💻 Usage Examples

Basic Usage

📄 License