F

Fb Vindata Vi Large

Developed by phongdtd
This model is a Vietnamese automatic speech recognition model fine-tuned on the PHONGDTD/VINDATAVLSP - NA dataset based on facebook/wav2vec2-large-xlsr-53
Downloads 29
Release Time : 3/2/2022

Model Overview

An optimized automatic speech recognition model for Vietnamese, fine-tuned based on the wav2vec2-large-xlsr-53 architecture

Model Features

Vietnamese optimization
Specially fine-tuned for Vietnamese speech recognition tasks
Based on wav2vec2 architecture
Uses facebook's wav2vec2-large-xlsr-53 as the base model
Multi-GPU training
Distributed training using 2 GPUs

Model Capabilities

Vietnamese speech recognition
Speech-to-text

Use Cases

Speech transcription
Vietnamese speech transcription
Convert Vietnamese speech content into text
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase