W

W2v Hf Jsut Xlsr53

Developed by qqpann
A Japanese automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53 using the Common Voice and JSUT datasets.
Downloads 16
Release Time : 3/2/2022

Model Overview

This is an optimized automatic speech recognition model for Japanese, capable of converting Japanese speech into text.

Model Features

Japanese Optimization
Specifically fine-tuned for Japanese speech, improving the accuracy of Japanese speech recognition.
Multi-dataset Training
Trained using both Common Voice and JSUT Japanese datasets, enhancing the model's generalization capability.
16kHz Sampling Rate Support
Supports 16kHz sampling rate audio input, suitable for most speech recognition scenarios.

Model Capabilities

Japanese Speech Recognition
Speech-to-Text

Use Cases

Speech Transcription
Japanese Speech Transcription
Convert Japanese speech content into text
Test WER 51.72%, Test CER 24.89%
Voice Assistants
Japanese Voice Command Recognition
Recognize Japanese voice commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase