K

Kotoba Whisper Bilingual V1.0

Developed by kotoba-tech
Kotoba-Whisper-Bilingual is a distilled model collection trained from the Whisper model, specifically designed for Japanese and English speech recognition and speech-to-text translation tasks.
Downloads 782
Release Time : 9/27/2024

Model Overview

This model supports automatic speech recognition (ASR) for Japanese and English, as well as speech-to-text translation tasks between Japanese and English.

Model Features

Bilingual Support
Supports both Japanese and English speech recognition and mutual translation
Efficient Inference
6.3 times faster than the original Whisper large-v3 model
Multitask Capability
Can perform both speech recognition and speech-to-text translation tasks simultaneously

Model Capabilities

Japanese Speech Recognition
English Speech Recognition
Japanese-to-English Speech Translation
English-to-Japanese Speech Translation

Use Cases

Speech Recognition
Japanese Speech Transcription
Convert Japanese speech into text
CER of 9.8 on the CommonVoice 8 Japanese test set
English Speech Transcription
Convert English speech into text
Performs well on the ESB dataset
Speech Translation
Japanese-to-English Translation
Real-time translation of Japanese speech into English text
WER of 73.9 on CoVoST2 (Ja->En)
English-to-Japanese Translation
Real-time translation of English speech into Japanese text
CER of 69.1 on CoVoST2 (En->Ja)
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase