Wavlm VLSP Vi
A Vietnamese automatic speech recognition model fine-tuned on the PHONGDTD/VINDATAVLSP - NA dataset based on microsoft/wavlm-base-plus
Downloads 21
Release Time : 3/2/2022
Model Overview
This model is optimized for Vietnamese automatic speech recognition (ASR) tasks, fine-tuned based on the WavLM architecture
Model Features
Vietnamese optimization
Specifically fine-tuned for Vietnamese speech recognition tasks
Based on WavLM architecture
Uses Microsoft's WavLM-base-plus as the base model, with powerful speech representation capabilities
Multi-GPU training
Utilizes distributed multi-GPU training to improve training efficiency
Model Capabilities
Vietnamese speech-to-text
Continuous speech recognition
Use Cases
Speech transcription
Vietnamese meeting minutes
Convert Vietnamese meeting recordings into text transcripts
Voice assistant
Provide speech recognition capabilities for Vietnamese voice assistants
Featured Recommended AI Models