W

Wav2vec2 Xls R 300m Japanese

Developed by AndrewMcDowell
This is an automatic speech recognition (ASR) model fine-tuned on the Japanese Common Voice 8.0 dataset based on facebook/wav2vec2-xls-r-300m, supporting Japanese speech-to-text functionality.
Downloads 24
Release Time : 3/2/2022

Model Overview

This model is specifically optimized for Japanese speech recognition tasks, capable of converting Japanese speech into Hiragana and Katakana text. Due to the characteristics of Japanese writing, the model primarily uses Character Error Rate (CER) rather than Word Error Rate (WER) for evaluation.

Model Features

Japanese-specific Optimization
Specially trained and optimized for Japanese speech characteristics, supporting Hiragana and Katakana output
Kanji-to-Kana Conversion
Uses the pykakasi library to convert Kanji to Hiragana, simplifying the recognition task
Large-scale Pretraining Foundation
Fine-tuned based on facebook's wav2vec2-xls-r-300m model, featuring powerful speech feature extraction capabilities

Model Capabilities

Japanese Speech Recognition
Speech-to-Text
Continuous Speech Processing

Use Cases

Speech Transcription
Japanese Speech Transcription
Convert Japanese speech content into text format
Achieves 23.64% CER on the Common Voice 8.0 test set
Voice Assistants
Japanese Voice Command Recognition
Recognize and understand Japanese voice commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase