W

Wav2vec2 Xls R 300m Japanese

Developed by vitouphy
This is a Japanese automatic speech recognition model fine-tuned based on facebook/wav2vec2-xls-r-300m, specifically designed for transcribing Japanese audio into Hiragana text.
Downloads 29
Release Time : 3/2/2022

Model Overview

This model is a Japanese speech recognition model fine-tuned on the mozilla-foundation/common_voice_8_0 dataset, with special optimization for converting Japanese speech into Hiragana.

Model Features

Hiragana Transcription Optimization
Specifically optimized for converting Japanese speech into Hiragana, using pykakasi to convert all text into Hiragana.
Multi-Dataset Validation
Validated on multiple datasets including Common Voice 8 and Robust Speech Events.
Language Model Support
Supports the use of Language Models (LM) to improve recognition accuracy.

Model Capabilities

Japanese Speech Recognition
Audio to Text
Hiragana Transcription

Use Cases

Speech Transcription
Japanese Speech to Text
Convert Japanese speech content into Hiragana text.
CER 0.2754 (Common Voice 8 test set)
Speech Content Analysis
Analyze Japanese speech content and convert it into a processable text format.
CER 0.2487 (Robust Speech Events dev set)
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase