W

W2v Hf Commonvoice From Xlsr53 Pretrain 0329UTC1500

Developed by qqpann
A speech recognition model fine-tuned on the Common Voice Japanese dataset based on facebook/wav2vec2-large-xlsr-53
Downloads 15
Release Time : 3/2/2022

Model Overview

This is a model for Japanese automatic speech recognition (ASR), fine-tuned based on the XLSR architecture, supporting voice input with a 16kHz sampling rate

Model Features

Japanese speech recognition
Speech recognition capability specifically optimized for Japanese
Based on XLSR architecture
Model architecture pre-trained using large-scale cross-lingual representation learning
No language model required
Can be used directly without additional language model support

Model Capabilities

Japanese speech-to-text
Automatic speech recognition
16kHz audio processing

Use Cases

Speech transcription
Japanese speech transcription
Convert Japanese speech content into text
Word error rate 70.18%
Voice assistant
Japanese voice command recognition
Recognize Japanese voice commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase