USER-bge-m3-Q4_K_M-GGUF Open-source Model - Free Implementation of Sentence Similarity Calculation and Feature Extraction

USER Bge M3 Q4 K M GGUF

Developed by cm4ker

This model is converted from deepvk/USER-bge-m3 to GGUF format, primarily used for sentence similarity calculation and feature extraction.

Text Embedding OtherOpen Source License:Apache-2.0 #Russian sentence embedding #Lightweight inference #Multi-task semantic matching

Downloads 117

Release Time : 8/19/2024

Model Overview

This is a GGUF format-based sentence embedding model suitable for Russian text sentence similarity calculation and feature extraction tasks.

Model Features

GGUF format optimization

Uses GGUF format for easy integration within the llama.cpp ecosystem

Russian language support

Sentence embedding model specifically optimized for Russian text

Quantized version

Provides Q4_K_M quantized version balancing precision and efficiency

Model Capabilities

Sentence similarity calculation

Text feature extraction

Russian text processing

Use Cases

Information retrieval

🚀 cm4ker/USER-bge-m3-Q4_K_M-GGUF

This model focuses on sentence similarity and offers feature extraction capabilities. It was converted to the GGUF format from the base model, enabling efficient use in various applications.

📦 Installation

Install llama.cpp

You can install llama.cpp through brew (works on Mac and Linux):

brew install llama.cpp

💻 Usage Examples

Use with llama.cpp

CLI

llama-cli --hf-repo cm4ker/USER-bge-m3-Q4_K_M-GGUF --hf-file user-bge-m3-q4_k_m.gguf -p "The meaning to life and the universe is"

Server

llama-server --hf-repo cm4ker/USER-bge-m3-Q4_K_M-GGUF --hf-file user-bge-m3-q4_k_m.gguf -c 2048

Alternative Usage Steps

You can also use this checkpoint directly through the usage steps listed in the Llama.cpp repo:

Step 1: Clone llama.cpp from GitHub

git clone https://github.com/ggerganov/llama.cpp

Step 2: Build llama.cpp

Move into the llama.cpp folder and build it with LLAMA_CURL=1 flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).

cd llama.cpp && LLAMA_CURL=1 make

Step 3: Run Inference

./llama-cli --hf-repo cm4ker/USER-bge-m3-Q4_K_M-GGUF --hf-file user-bge-m3-q4_k_m.gguf -p "The meaning to life and the universe is"

./llama-server --hf-repo cm4ker/USER-bge-m3-Q4_K_M-GGUF --hf-file user-bge-m3-q4_k_m.gguf -c 2048

📚 Documentation

This model was converted to GGUF format from deepvk/USER-bge-m3 using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model.

📄 License

This model is licensed under the apache-2.0 license.

📋 Model Information

Property	Details
Base Model	`deepvk/USER-bge-m3`
Datasets	`deepvk/ru-HNP`, `deepvk/ru-WANLI`, `Shitao/bge-m3-data`, `RussianNLP/russian_super_glue`, `reciTAL/mlsum`, `Milana/russian_keywords`, `IlyaGusev/gazeta`, `d0rj/gsm8k-ru`, `bragovo/dsum_ru`, `CarlBrendt/Summ_Dialog_News`
Language	`ru`
Library Name	`sentence-transformers`
Pipeline Tag	`sentence-similarity`
Tags	`sentence-transformers`, `sentence-similarity`, `feature-extraction`, `llama-cpp`, `gguf-my-repo`

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご