W

Wav2vec2 Large Xlsr Japanese

Developed by vumichien
A fine-tuned model based on facebook/wav2vec2-large-xlsr-53 for Japanese speech recognition tasks.
Downloads 214
Release Time : 3/2/2022

Model Overview

This model is a Japanese speech recognition model based on the XLSR-53 architecture, fine-tuned with the Common Voice and JSUT datasets, suitable for Japanese speech-to-text tasks.

Model Features

Japanese Speech Recognition
A speech recognition model specifically optimized for Japanese, supporting Japanese speech-to-text conversion.
Fine-tuned on XLSR-53
Fine-tuned based on the facebook/wav2vec2-large-xlsr-53 model, inheriting its powerful speech feature extraction capabilities.
Multi-dataset Training
Trained using the Common Voice and JSUT Japanese speech corpora, enhancing the model's generalization ability.

Model Capabilities

Japanese Speech Recognition
Speech-to-Text
16kHz Sampling Rate Speech Processing

Use Cases

Speech Transcription
Japanese Speech Transcription
Convert Japanese speech content into text format
WER: 30.84%, CER: 17.85%
Voice Assistants
Japanese Voice Command Recognition
Recognize and understand Japanese voice commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase