whisper-medium-jp Open-source Japanese Speech Recognition Model - Free Deployment for Accurate Japanese Speech Recognition

Home

Whisper Medium Jp

Developed by vumichien

Japanese speech recognition model fine-tuned on the common_voice_11_0 dataset based on openai/whisper-medium

Speech Recognition

Transformers

JapaneseOpen Source License:Apache-2.0 #Japanese Speech Recognition #Low Word Error Rate #Multi-scenario Adaptation

Downloads 4,542

Release Time : 12/7/2022

Model Overview

This is an optimized automatic speech recognition (ASR) model for Japanese, fine-tuned on the Common Voice 11.0 Japanese dataset, capable of converting Japanese speech into text.

Model Features

Japanese Optimization

Specially fine-tuned for Japanese speech recognition, with excellent performance on Japanese test sets

Low Word Error Rate

Achieves a word error rate (WER) of only 9.04% on the Common Voice Japanese test set

Multi-dataset Validation

Performance evaluated on both Common Voice and Fleurs Japanese test sets

Model Capabilities

Japanese Speech Recognition

Speech-to-Text

Automatic Speech Transcription

Use Cases

Speech Transcription

Japanese Meeting Minutes

Automatically convert Japanese meeting recordings into text transcripts

Approximately 90% accuracy

Japanese Podcast Transcription

Transcribe Japanese podcast content into text

Voice Assistants

Japanese Voice Command Recognition

Used for command recognition systems in Japanese voice assistants

Property	Details
Model Type	Whisper Medium Japanese
Training Data	mozilla - foundation/common_voice_11_0
Metrics	WER, CER

Training Loss	Epoch	Step	Validation Loss	Wer
0.0392	3.03	1000	0.2023	10.1807
0.0036	7.01	2000	0.2478	9.4409
0.0013	10.04	3000	0.2791	9.1014
0.0002	14.01	4000	0.2970	9.0625
0.0002	17.04	5000	0.3029	9.0355

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Whisper Medium Jp

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 openai/whisper-medium

🚀 Quick Start

📚 Documentation

Model Information

Evaluation Results

🔧 Technical Details

Training Hyperparameters

Training Results

Framework Versions

📄 License