W

Wav2vec2 Xls R 300m Khmer

Developed by vitouphy
This is a fine-tuned facebook/wav2vec2-xls-r-300m model based on the OpenSLR dataset, specifically designed for automatic speech recognition tasks in Khmer (km).
Downloads 2,321
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition system for Khmer, trained on a limited dataset (approximately 4 hours) and demonstrating decent recognition capabilities.

Model Features

Efficient training with small data
Achieved decent recognition results using only about 4 hours of training data (actual training duration 3.2 hours)
Language model support
Supports decoding with a language model (kenlm), significantly improving recognition accuracy
Lightweight deployment
Based on a 300M parameter model, relatively lightweight and suitable for practical deployment

Model Capabilities

Khmer speech recognition
Audio to text conversion
Speech content analysis

Use Cases

Speech transcription
Khmer speech to text
Convert Khmer speech content into text transcripts
WER 25.7%, CER 7.03%
Speech analysis
Khmer speech content analysis
Analyze keywords and content in Khmer speech
Featured Recommended AI Models
ยฉ 2025AIbase