W

Wav2vec2 Xls R 1b Japanese

Developed by vumichien
This model is a fine-tuned version of facebook/wav2vec2-xls-r-1b on public Japanese speech datasets, supporting automatic speech recognition tasks in Japanese.
Downloads 50
Release Time : 3/2/2022

Model Overview

This is an optimized automatic speech recognition model for Japanese, based on the wav2vec2-xls-r-1b architecture, fine-tuned on datasets such as Common Voice.

Model Features

High-Performance Japanese Recognition
Achieves 7.98% WER and 3.42% CER on the Common Voice 7.0 test set
Multi-dataset Training
Combines multiple Japanese speech datasets including Common Voice, JUST, JSSS, and CSS10
Language Model Support
Can be used with a 4-gram language model to significantly improve recognition accuracy

Model Capabilities

Japanese Speech Recognition
Speech-to-Text
Long Audio Processing Support

Use Cases

Speech Transcription
Japanese Speech-to-Text
Convert Japanese speech content into text
Achieves 7.88-7.98% word error rate on the Common Voice test set
Speech Analysis
Japanese Speech Content Analysis
Analyze Japanese speech content to extract key information
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase