Kotoba-whisper-v2.0 Open-source Japanese Speech Recognition Model - Free Deployment, Inference Speed Increased by 6.3 Times

Kotoba Whisper V2.0

Developed by kotoba-tech

Kotoba-Whisper is a Japanese automatic speech recognition distilled model developed by Asahi Ushio in collaboration with Kotoba Technologies, based on Whisper large-v3 distillation, achieving a 6.3x inference speed improvement.

Speech Recognition

Transformers

JapaneseOpen Source License:Apache-2.0 #Japanese speech recognition #Efficient distilled model #Low-latency inference

Downloads 8,108

Release Time : 9/17/2024

Model Overview

Japanese automatic speech recognition model optimized through knowledge distillation technology from Whisper large-v3, significantly improving inference speed while maintaining comparable error rates.

Model Features

Efficient inference

6.3x faster inference speed compared to the original Whisper large-v3

High performance

Superior CER/WER on Japanese datasets like ReazonSpeech compared to the original model

Large-scale training

Trained on over 7.2 million Japanese speech-text pairs

Model Capabilities

Japanese speech-to-text

Long audio segmentation processing

Supports Flash Attention 2 acceleration

Use Cases

Speech transcription

TV program subtitle generation

Process Japanese TV program audio to generate accurate subtitles

CER 11.6/WER 55.6 on ReazonSpeech test set

Voice assistant

Provides fast and accurate speech recognition for Japanese voice assistants

model	CommonVoice 8 (Japanese test set)	JSUT Basic 5000	ReazonSpeech (held out test set)
kotoba-tech/kotoba-whisper-v2.0	9.2	8.4	11.6
kotoba-tech/kotoba-whisper-v1.0	9.4	8.5	12.2
openai/whisper-large-v3	8.5	7.1	14.9
openai/whisper-large-v2	9.7	8.2	28.1
openai/whisper-large	10	8.9	34.1
openai/whisper-medium	11.5	10	33.2
openai/whisper-base	28.6	24.9	70.4
openai/whisper-small	15.1	14.2	41.5
openai/whisper-tiny	53.7	36.5	137.9

model	CommonVoice 8 (Japanese test set)	JSUT Basic 5000	ReazonSpeech (held out test set)
kotoba-tech/kotoba-whisper-v2.0	58.8	63.7	55.6
kotoba-tech/kotoba-whisper-v1.0	59.2	64.3	56.4
openai/whisper-large-v3	55.1	59.2	60.2
openai/whisper-large-v2	59.3	63.2	74.1
openai/whisper-large	61.1	66.4	74.9
openai/whisper-medium	63.4	69.5	76
openai/whisper-base	65.5	72.6	80.1
openai/whisper-small	70.2	77.3	90.5
openai/whisper-tiny	90.8	92.4	150.3

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Kotoba Whisper V2.0

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Kotoba-Whisper (v2.0)

✨ Features

📚 Documentation

Model Training

Evaluation Results

CER

WER

📄 License

Additional Information

Tags

Pipeline Tag

Metrics

Datasets

Widget Examples