B

Bge M3 Financial Matryoshka

Developed by haophancs
A financial domain sentence embedding model fine-tuned based on BAAI/bge-m3, supporting 1024-dimensional vector representation, suitable for semantic similarity and information retrieval tasks
Downloads 34
Release Time : 6/22/2024

Model Overview

This is a financial domain-specific model based on the sentence-transformers framework, capable of mapping text to high-dimensional vector spaces, primarily used for semantic text similarity calculation, information retrieval, and other tasks. The model is optimized for financial texts and is particularly suitable for processing professional content such as financial reports and business documents.

Model Features

Financial Domain Optimization
Specifically fine-tuned for financial texts, performing better when processing professional content such as financial reports and business documents
High-dimensional Vector Representation
Supports 1024-dimensional dense vector representation, capable of capturing richer semantic information
Multi-dimensional Evaluation
The model has been evaluated across multiple dimensions (1024/768/512/384) to ensure performance in different scenarios
Long Text Support
Maximum sequence length of 8192 tokens, suitable for processing longer financial documents

Model Capabilities

Semantic text similarity calculation
Semantic search
Paraphrase mining
Text classification
Text clustering
Financial document analysis

Use Cases

Financial Document Processing
Financial Report Retrieval
Quickly find the most relevant paragraphs from a large number of financial reports based on query questions
Achieved 71.7% accuracy@1 on the test set
Regulatory Document Analysis
Analyze relevant content of specific clauses in regulatory documents
Achieved 83.1% accuracy@3 on the test set
Business Intelligence
Business Report Similarity Analysis
Compare similar content across different business reports
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase