Bge-small-en-v1.5-gguf Open-Source Embedding Model - Designed for llama.cpp with Significantly Improved CPU Processing Speed

Bge Small En V1.5 Gguf

Developed by CompendiumLabs

Provides GGUF format quantized and non-quantized embedding models, designed specifically for llama.cpp, outperforming transformers with significant speed improvements on CPUs

Text Embedding Open Source License:MIT #English Embedding Model #GGUF Quantization #CPU Optimization

Downloads 710

Release Time : 2/17/2024

Model Overview

GGUF format version of BGE small English embedding model, suitable for text embedding tasks, supporting multiple quantization levels

Model Features

GGUF Format Optimization

Designed specifically for llama.cpp, offering significant performance improvements over original transformers implementation

Multiple Quantization Options

Provides quantization levels from F32 to Q4_K_M, balancing speed and accuracy

CPU Efficient Operation

Achieves up to 30% speed improvement on CPUs with minimal accuracy loss after quantization

Model Capabilities

Text Embedding

Semantic Similarity Calculation

Information Retrieval

Use Cases

Search & Retrieval

Document Similarity Search

Calculate semantic similarity between documents

Efficiently find relevant content

Natural Language Processing

Semantic Analysis

Extract semantic representations of text

For downstream NLP tasks

Filename	Quantization	Size
bge-small-en-v1.5-f32.gguf	F32	128 MB
bge-small-en-v1.5-f16.gguf	F16	65 MB
bge-small-en-v1.5-q8_0.gguf	Q8_0	36 MB
bge-small-en-v1.5-q4_k_m.gguf	Q4_K_M	24 MB

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Bge Small En V1.5 Gguf

Model Overview

Model Features

Model Capabilities

Use Cases

license: mit

bge-small-en-v1.5-gguf

Files Available

Usage