W

Wav2vec2 Xlsr Khmer

Developed by gagan3012
A Khmer speech recognition model fine-tuned on the facebook/wav2vec2-large-xlsr-53 model, achieving a WER of 24.96% on the OpenSLR Khmer dataset.
Downloads 172
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition (ASR) system for Khmer, fine-tuned from Facebook's wav2vec2-large-xlsr-53 model, supporting voice input with a 16kHz sampling rate.

Model Features

High-accuracy Khmer recognition
Achieves a WER of 24.96% on the OpenSLR Khmer test set, demonstrating excellent performance.
Based on XLSR large model
Fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, with strong cross-language speech representation capabilities.
No language model required
Can be used directly without additional language model support.

Model Capabilities

Khmer speech recognition
16kHz audio processing
End-to-end speech-to-text

Use Cases

Speech transcription
Khmer speech-to-text
Convert Khmer speech content into text
WER 24.96%
Voice assistant
Khmer voice command recognition
Used for command recognition in Khmer voice assistant systems
Featured Recommended AI Models
┬й 2025AIbase