S

Sbert Roberta Large Anli Mnli Snli

Developed by usc-isi
A sentence transformation model based on RoBERTa-large, specifically designed for sentence similarity tasks, trained on ANLI, MNLI, and SNLI datasets
Downloads 38
Release Time : 3/2/2022

Model Overview

This model can map sentences and paragraphs into a 768-dimensional vector space, suitable for natural language processing tasks such as semantic search and clustering

Model Features

High-Quality Sentence Embeddings
Generates high-quality sentence embeddings based on the RoBERTa-large architecture
Multi-dataset Training
Jointly trained on three authoritative natural language inference datasets: ANLI, MNLI, and SNLI
Efficient Pooling Strategy
Utilizes mean pooling to effectively aggregate word embedding information

Model Capabilities

Sentence Vectorization
Semantic Similarity Calculation
Text Clustering
Semantic Search

Use Cases

Information Retrieval
Semantic Search System
Build a search system based on semantics rather than keywords
Improves the relevance of search results
Text Analysis
Document Clustering
Automatically group semantically similar documents
Enables unsupervised document organization
Natural Language Understanding
Sentence Similarity Calculation
Calculate the semantic similarity between two sentences
Can be used in applications like question-answering systems and paraphrase detection
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase