W

Wav2vec2 Large Xlsr 53 Th Cv8 Deepcut

Developed by wannaphong
This model is a Thai automatic speech recognition model trained on the CommonVoice V8 dataset, incorporating the DeepCut tokenizer and language model to improve recognition accuracy.
Downloads 504
Release Time : 6/7/2022

Model Overview

This model fine-tunes wav2vec2-large-xlsr-53 using the Thai CommonVoice V8 dataset, specifically designed for Thai speech recognition tasks. It supports the DeepCut tokenizer and integrates a language model to enhance performance.

Model Features

Integrated Language Model
Incorporating a language model significantly improves recognition accuracy, reducing WER by approximately 3% on the test set.
Support for Multiple Tokenizers
Supports both DeepCut and Newmm Thai tokenizers, allowing selection of the optimal tokenization method based on requirements.
Multi-Dataset Training
Trained on both CommonVoice V7 and V8 datasets, enhancing the model's generalization capability.

Model Capabilities

Thai speech recognition
Support for multiple tokenization methods
High-accuracy speech-to-text

Use Cases

Speech Transcription
Thai Speech Transcription
Convert Thai speech content into text
Achieves 9.61% WER on the CommonVoice V8 test set
Voice Assistants
Thai Voice Command Recognition
Used for Thai voice assistant command recognition systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase