W

Wavlm VLSP Vi

Developed by phongdtd
A Vietnamese automatic speech recognition model fine-tuned on the PHONGDTD/VINDATAVLSP - NA dataset based on microsoft/wavlm-base-plus
Downloads 21
Release Time : 3/2/2022

Model Overview

This model is optimized for Vietnamese automatic speech recognition (ASR) tasks, fine-tuned based on the WavLM architecture

Model Features

Vietnamese optimization
Specifically fine-tuned for Vietnamese speech recognition tasks
Based on WavLM architecture
Uses Microsoft's WavLM-base-plus as the base model, with powerful speech representation capabilities
Multi-GPU training
Utilizes distributed multi-GPU training to improve training efficiency

Model Capabilities

Vietnamese speech-to-text
Continuous speech recognition

Use Cases

Speech transcription
Vietnamese meeting minutes
Convert Vietnamese meeting recordings into text transcripts
Voice assistant
Provide speech recognition capabilities for Vietnamese voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase