Sentence Transformers Experimental Hubert Hungarian
S
Sentence Transformers Experimental Hubert Hungarian
Developed by NYTK
Hungarian sentence embedding model fine-tuned based on the huBERT pre-trained model, specifically designed for sentence similarity tasks
Downloads 379
Release Time : 7/11/2023
Model Overview
This model obtains sentence embedding representations from huBERT output through mean pooling, suitable for Hungarian sentence similarity calculation tasks
Model Features
Hungarian optimization
Specially fine-tuned for Hungarian, trained on the Hunglish 2.0 parallel corpus
Efficient training
Training can be completed in just 15 hours using a single GTX 1080Ti GPU
Lightweight deployment
Maximum sequence length limited to 128 characters, suitable for resource-constrained environments
Model Capabilities
Sentence embedding generation
Sentence similarity calculation
Hungarian text processing
Use Cases
Text similarity
Semantic search
Used for semantic similarity search in Hungarian documents
Q&A system
Matching user questions with similar questions in the knowledge base
Natural language processing
Text clustering
Grouping semantically similar Hungarian sentences
Featured Recommended AI Models