M

Multilingual Distilwhisper 28k

Developed by naver
An improved multilingual automatic speech recognition model based on whisper-small, enhancing target language performance through CLSR module and knowledge distillation
Downloads 47
Release Time : 11/30/2023

Model Overview

This model adds a lightweight CLSR module to whisper-small and employs a hybrid training approach combining cross-entropy and knowledge distillation, significantly improving automatic speech recognition accuracy for Catalan, Tamil, and Thai.

Model Features

Multilingual optimization
Specifically optimized for Catalan, Tamil, and Thai, significantly improving recognition accuracy for these languages
Knowledge distillation
Uses whisper-large-v2 as teacher model for knowledge distillation, retaining large model performance while reducing model size
Lightweight CLSR module
Added lightweight module effectively enhances target language performance while maintaining model efficiency

Model Capabilities

Automatic speech recognition
Multilingual speech-to-text
Language-specific optimization

Use Cases

Speech transcription
Multilingual meeting minutes
Convert meeting recordings in Catalan, Tamil, or Thai into text transcripts
Higher accuracy compared to original whisper-small
Voice assistant
Develop voice assistant applications for target language regions
Educational technology
Language learning applications
Used for pronunciation evaluation and transcription features in language learning apps
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase