bge-large-en-v1.5-gguf Open-source Embedding Model - Designed for llama.cpp with CPU and GPU Acceleration

Bge Large En V1.5 Gguf

Developed by CompendiumLabs

Provides quantized and non-quantized embedding models in GGUF format, specifically designed for llama.cpp. Significantly improves speed when running on CPUs, with moderate acceleration for large models on GPUs.

Text Embedding Open Source License:MIT #English Embedding Model #CPU Optimization #Efficient Inference

Downloads 878

Release Time : 2/17/2024

Model Overview

This is a GGUF-format embedding model converted from BAAI/bge-large-en-v1.5, suitable for the llama.cpp framework, offering multiple quantization versions to optimize performance and resource usage.

Model Features

GGUF Format Optimization

Format specifically designed for llama.cpp, significantly improving speed on CPUs

Multiple Quantization Options

Offers various quantization levels from F32 to Q4_K_M, balancing precision and performance

CPU Efficiency

Achieves up to 30% speed improvement on CPUs with minimal precision loss

Model Capabilities

Text Embedding

Semantic Similarity Calculation

Information Retrieval

Use Cases

Information Retrieval

Document Search

Convert queries and documents into embedding vectors for similarity matching

Improves search relevance and efficiency

Semantic Analysis

Text Clustering

Group similar texts based on embedding vectors

Reveals latent patterns and themes in text data

Filename	Quantization	Size
bge-large-en-v1.5-f32.gguf	F32	1.3 GB
bge-large-en-v1.5-f16.gguf	F16	639 MB
bge-large-en-v1.5-q8_0.gguf	Q8_0	342 MB
bge-large-en-v1.5-q4_k_m.gguf	Q4_K_M	199 MB

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Bge Large En V1.5 Gguf

Model Overview

Model Features

Model Capabilities

Use Cases

license: mit

bge-large-en-v1.5-gguf

Files Available

Usage