Wav2vec2 Xls R 300m Japanese
This is an automatic speech recognition (ASR) model fine-tuned on the Japanese Common Voice 8.0 dataset based on facebook/wav2vec2-xls-r-300m, supporting Japanese speech-to-text functionality.
Downloads 24
Release Time : 3/2/2022
Model Overview
This model is specifically optimized for Japanese speech recognition tasks, capable of converting Japanese speech into Hiragana and Katakana text. Due to the characteristics of Japanese writing, the model primarily uses Character Error Rate (CER) rather than Word Error Rate (WER) for evaluation.
Model Features
Japanese-specific Optimization
Specially trained and optimized for Japanese speech characteristics, supporting Hiragana and Katakana output
Kanji-to-Kana Conversion
Uses the pykakasi library to convert Kanji to Hiragana, simplifying the recognition task
Large-scale Pretraining Foundation
Fine-tuned based on facebook's wav2vec2-xls-r-300m model, featuring powerful speech feature extraction capabilities
Model Capabilities
Japanese Speech Recognition
Speech-to-Text
Continuous Speech Processing
Use Cases
Speech Transcription
Japanese Speech Transcription
Convert Japanese speech content into text format
Achieves 23.64% CER on the Common Voice 8.0 test set
Voice Assistants
Japanese Voice Command Recognition
Recognize and understand Japanese voice commands
Featured Recommended AI Models