W

Wav2vec2 Large Xlsr 53 Vietnamese

Developed by anuragshas
A Vietnamese automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained using the Common Voice dataset.
Downloads 279
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model for Vietnamese, fine-tuned based on the Wav2Vec2-Large-XLSR-53 architecture, supporting 16kHz sampled audio input.

Model Features

Vietnamese-specific
Speech recognition model specifically optimized for Vietnamese
Based on XLSR pre-trained model
Built upon the powerful wav2vec2-large-xlsr-53 pre-trained model
16kHz sampling rate support
Supports processing of 16kHz sampled audio input

Model Capabilities

Vietnamese speech recognition
Speech-to-text
Automatic speech transcription

Use Cases

Speech transcription
Vietnamese speech transcription
Convert Vietnamese speech to text
Word Error Rate (WER) 66.78%
Voice assistants
Vietnamese voice command recognition
For Vietnamese voice assistants or smart home devices
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase