Longformer Base 4096 Sentence Transformers All Nli Stsb Quora Nq
This is a sentence-transformers model fine-tuned from allenai/longformer-base-4096, designed to generate 768-dimensional dense vector representations for sentences and paragraphs, suitable for semantic text similarity, semantic search, and related tasks.
Downloads 45
Release Time : 4/25/2025
Model Overview
The model maps sentences and paragraphs into a 768-dimensional dense vector space, applicable for semantic text similarity, semantic search, paraphrase mining, text classification, clustering, and other tasks.
Model Features
Long Text Processing Capability
Based on the Longformer architecture, supports sequences up to 4098 tokens in length, suitable for processing long documents and paragraphs.
Multi-task Training
Jointly trained on multiple datasets (all-nli, stsb, quora, natural-questions), enhancing the model's generalization ability.
Multi-loss Function Optimization
Optimized using MultipleNegativesRankingLoss, SoftmaxLoss, and CoSENTLoss, improving performance across different tasks.
Model Capabilities
Semantic text similarity calculation
Semantic search
Paraphrase mining
Text classification
Text clustering
Feature extraction
Use Cases
Information Retrieval
Similar Question Matching
Finding semantically similar questions to user queries in Q&A systems
Accurately matches duplicate questions on platforms like Quora
Content Recommendation
Related Content Recommendation
Recommending articles or products based on content similarity
Can improve user engagement and conversion rates
Text Analysis
Text Clustering
Grouping large volumes of documents by semantic similarity
Useful for topic modeling and document organization
Featured Recommended AI Models
Š 2025AIbase