Fb Vindata Vi Large
This model is a Vietnamese automatic speech recognition model fine-tuned on the PHONGDTD/VINDATAVLSP - NA dataset based on facebook/wav2vec2-large-xlsr-53
Downloads 29
Release Time : 3/2/2022
Model Overview
An optimized automatic speech recognition model for Vietnamese, fine-tuned based on the wav2vec2-large-xlsr-53 architecture
Model Features
Vietnamese optimization
Specially fine-tuned for Vietnamese speech recognition tasks
Based on wav2vec2 architecture
Uses facebook's wav2vec2-large-xlsr-53 as the base model
Multi-GPU training
Distributed training using 2 GPUs
Model Capabilities
Vietnamese speech recognition
Speech-to-text
Use Cases
Speech transcription
Vietnamese speech transcription
Convert Vietnamese speech content into text
Featured Recommended AI Models
Š 2025AIbase