S

Sbert Uncased Finnish Paraphrase

Developed by TurkuNLP
Finnish sentence BERT model based on FinBERT training, used for sentence similarity calculation and feature extraction
Downloads 895
Release Time : 3/2/2022

Model Overview

This is a sentence transformer model based on FinBERT training, specifically designed for Finnish sentence similarity calculation and feature extraction. The model processes sentence embeddings through mean pooling and is suitable for tasks such as paraphrase identification.

Model Features

Case-insensitive
The model is case-insensitive, suitable for processing Finnish text in different case forms
High-quality Finnish training
Trained on Finnish paraphrase corpora and automatically collected paraphrase candidate sentences (500,000 positive examples, 5 million negative examples)
Efficient sentence embeddings
Generates high-quality sentence-level embeddings using mean pooling

Model Capabilities

Sentence feature extraction
Sentence similarity calculation
Semantic similarity comparison
Finnish text processing

Use Cases

Text similarity
Paraphrase identification
Identify whether two Finnish sentences are paraphrases
Performs well on Finnish paraphrase corpora
Semantic search
Retrieve semantically similar sentences from large-scale text
Can be used to build a semantic retrieval system with 4 million sentences
Feature extraction
Sentence embedding generation
Generate sentence-level feature representations for downstream tasks
Produces 768-dimensional sentence embedding vectors
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase