L

Longformer Base 4096 Sentence Transformers All Nli Stsb Quora Nq

Developed by Leo1212
This is a sentence-transformers model fine-tuned from allenai/longformer-base-4096, designed to generate 768-dimensional dense vector representations for sentences and paragraphs, suitable for semantic text similarity, semantic search, and related tasks.
Downloads 45
Release Time : 4/25/2025

Model Overview

The model maps sentences and paragraphs into a 768-dimensional dense vector space, applicable for semantic text similarity, semantic search, paraphrase mining, text classification, clustering, and other tasks.

Model Features

Long Text Processing Capability
Based on the Longformer architecture, supports sequences up to 4098 tokens in length, suitable for processing long documents and paragraphs.
Multi-task Training
Jointly trained on multiple datasets (all-nli, stsb, quora, natural-questions), enhancing the model's generalization ability.
Multi-loss Function Optimization
Optimized using MultipleNegativesRankingLoss, SoftmaxLoss, and CoSENTLoss, improving performance across different tasks.

Model Capabilities

Semantic text similarity calculation
Semantic search
Paraphrase mining
Text classification
Text clustering
Feature extraction

Use Cases

Information Retrieval
Similar Question Matching
Finding semantically similar questions to user queries in Q&A systems
Accurately matches duplicate questions on platforms like Quora
Content Recommendation
Related Content Recommendation
Recommending articles or products based on content similarity
Can improve user engagement and conversion rates
Text Analysis
Text Clustering
Grouping large volumes of documents by semantic similarity
Useful for topic modeling and document organization
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase