W

Wav2vec2 Large Xlsr Thai Demo

Developed by sakares
A speech recognition model fine-tuned on the Thai Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Downloads 609
Release Time : 3/2/2022

Model Overview

This is a model specifically designed for Thai speech recognition, fine-tuned on the XLSR-53 architecture, supporting 16kHz sampling rate audio input.

Model Features

Thai Language Optimization
Specially fine-tuned for Thai speech, improving Thai recognition accuracy
Based on XLSR-53
Utilizes the powerful cross-lingual speech representation foundation model for fine-tuning
16kHz Support
Supports audio input with a 16kHz sampling rate

Model Capabilities

Thai Speech Recognition
Audio to Text Conversion

Use Cases

Speech Transcription
Thai Speech to Text
Converts Thai speech content into text
Achieved a WER of 44.46% on the Common Voice Thai test set
Voice Assistants
Thai Voice Command Recognition
Used for Thai voice assistant command recognition
Featured Recommended AI Models
ยฉ 2025AIbase