V

Vietnamese Embedding

Developed by AITeamVN
Vietnamese embedding model fine-tuned on BGE-M3, enhancing Vietnamese retrieval capabilities
Downloads 14.26k
Release Time : 3/17/2025

Model Overview

Vietnamese_Embedding is an embedding model fine-tuned on the BGE-M3 model, specifically optimized for Vietnamese retrieval tasks, trained on approximately 300,000 sets of Vietnamese query, positive document, and negative document triplets.

Model Features

Vietnamese optimization
Fine-tuned specifically for Vietnamese retrieval tasks, improving the embedding quality of Vietnamese text
Long text support
Supports sequences up to 2048 tokens, suitable for processing long documents
High performance
Outperforms the base model BGE-M3 and other Vietnamese embedding models in legal text retrieval tasks

Model Capabilities

Vietnamese text embedding
Sentence similarity calculation
Document retrieval

Use Cases

Information retrieval
Legal document retrieval
Achieves high-accuracy document retrieval on legal text datasets
Accuracy@1 reaches 0.7274 on the Legal Zalo 2021 dataset
General document retrieval
Applicable to various Vietnamese document retrieval tasks
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase