Multilingual-distilwhisper-28k Open-source Multilingual Automatic Speech Recognition Model - Freely Improve Recognition Performance of Target Languages

Multilingual Distilwhisper 28k

Developed by naver

An improved multilingual automatic speech recognition model based on whisper-small, enhancing target language performance through CLSR module and knowledge distillation

Speech Recognition

Transformers

OtherOpen Source License:MIT #Multilingual speech recognition #Lightweight CLSR module #Knowledge distillation optimization

Downloads 47

Release Time : 11/30/2023

Model Overview

This model adds a lightweight CLSR module to whisper-small and employs a hybrid training approach combining cross-entropy and knowledge distillation, significantly improving automatic speech recognition accuracy for Catalan, Tamil, and Thai.

Model Features

Multilingual optimization

Specifically optimized for Catalan, Tamil, and Thai, significantly improving recognition accuracy for these languages

Knowledge distillation

Uses whisper-large-v2 as teacher model for knowledge distillation, retaining large model performance while reducing model size

Lightweight CLSR module

Added lightweight module effectively enhances target language performance while maintaining model efficiency

Model Capabilities

Automatic speech recognition

Multilingual speech-to-text

Language-specific optimization

Use Cases

Speech transcription

Multilingual meeting minutes

Convert meeting recordings in Catalan, Tamil, or Thai into text transcripts

Higher accuracy compared to original whisper-small

Voice assistant

Develop voice assistant applications for target language regions

Educational technology

Language learning applications

Used for pronunciation evaluation and transcription features in language learning apps

Property	Details
Model Type	Multilingual Distilwhisper
Training Data	mozilla - foundation/common_voice_13_0
Supported Languages	ca, ta, th
Tags	automatic - speech - recognition
Pipeline Tag	automatic - speech - recognition
Inference	false

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Multilingual Distilwhisper 28k

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Multilingual Distilwhisper

🚀 Quick Start

✨ Features

📄 License

📚 Documentation

Citation