Wav2vec2 Large Xlsr 53 Th Cv8 Deepcut
W
Wav2vec2 Large Xlsr 53 Th Cv8 Deepcut
Developed by wannaphong
This model is a Thai automatic speech recognition model trained on the CommonVoice V8 dataset, incorporating the DeepCut tokenizer and language model to improve recognition accuracy.
Downloads 504
Release Time : 6/7/2022
Model Overview
This model fine-tunes wav2vec2-large-xlsr-53 using the Thai CommonVoice V8 dataset, specifically designed for Thai speech recognition tasks. It supports the DeepCut tokenizer and integrates a language model to enhance performance.
Model Features
Integrated Language Model
Incorporating a language model significantly improves recognition accuracy, reducing WER by approximately 3% on the test set.
Support for Multiple Tokenizers
Supports both DeepCut and Newmm Thai tokenizers, allowing selection of the optimal tokenization method based on requirements.
Multi-Dataset Training
Trained on both CommonVoice V7 and V8 datasets, enhancing the model's generalization capability.
Model Capabilities
Thai speech recognition
Support for multiple tokenization methods
High-accuracy speech-to-text
Use Cases
Speech Transcription
Thai Speech Transcription
Convert Thai speech content into text
Achieves 9.61% WER on the CommonVoice V8 test set
Voice Assistants
Thai Voice Command Recognition
Used for Thai voice assistant command recognition systems
Featured Recommended AI Models
Š 2025AIbase