W

Whisper Tiny Vi

Developed by doof-ferb
Vietnamese automatic speech recognition (ASR) model fine-tuned based on OpenAI Whisper-tiny architecture, demonstrating excellent performance on multiple Vietnamese datasets
Downloads 44
Release Time : 2/20/2024

Model Overview

This model is optimized for Vietnamese speech recognition, significantly improving the accuracy of the original Whisper-tiny model in Vietnamese recognition through extensive fine-tuning with Vietnamese speech data

Model Features

Vietnamese optimization
Specifically fine-tuned for Vietnamese speech characteristics, significantly reducing WER compared to the original model
Multi-dataset training
Trained using 10 different Vietnamese speech datasets, covering various speech scenarios
Lightweight
Based on Whisper-tiny architecture, suitable for deployment in resource-constrained environments

Model Capabilities

Vietnamese speech-to-text
Long audio transcription
Real-time speech recognition

Use Cases

Speech transcription
Vietnamese video subtitle generation
Automatically generate subtitles for Vietnamese video content
Achieved only 18.7% WER on VIVOS test set
Voice assistant
Building Vietnamese voice interaction systems
26.6% WER on Common Voice test set
Education
Language learning tool
Helping learners practice Vietnamese pronunciation and listening
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase