S

Sts Distilcamembert Base

Developed by h4c5
This is a French sentence embedding model based on DistilCamemBERT, capable of encoding sentences or paragraphs into 768-dimensional vectors for tasks such as sentence similarity computation.
Downloads 48
Release Time : 2/26/2024

Model Overview

This model is obtained by fine-tuning the DistilCamemBERT base model using the sentence-transformers library, specifically designed for French sentence similarity computation and feature extraction tasks.

Model Features

Efficient distilled model
Based on DistilCamemBERT, the number of parameters is halved, inference time is shorter, while maintaining good performance.
French sentence embeddings
Optimized specifically for French text, capable of generating high-quality sentence embeddings.
High similarity computation accuracy
Achieves a Pearson correlation coefficient of 0.8165 on the STSb French dataset, demonstrating excellent performance.

Model Capabilities

French sentence embeddings
Sentence similarity computation
Text feature extraction

Use Cases

Text similarity
Semantic search
Can be used to build a French semantic search engine, returning results based on the semantic similarity between queries and documents.
Duplicate content detection
Identify text content with different expressions but similar semantics for content deduplication.
Information retrieval
Document clustering
Perform clustering analysis on French documents based on sentence embeddings.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase