Phowhisper Small
PhoWhisper is a system specifically designed for Vietnamese automatic speech recognition, fine-tuned based on the Whisper model, supporting various Vietnamese accents.
Downloads 2,725
Release Time : 2/18/2024
Model Overview
PhoWhisper is a Vietnamese automatic speech recognition system that achieves state-of-the-art performance on Vietnamese ASR benchmark datasets by fine-tuning the multilingual Whisper model with an 844-hour Vietnamese dataset.
Model Features
Multi-accent support
PhoWhisper is fine-tuned using an 844-hour dataset covering various Vietnamese accents, enabling it to recognize different Vietnamese accents.
High performance
Achieves state-of-the-art performance on Vietnamese ASR benchmark datasets.
Based on Whisper model
Fine-tuned on the multilingual Whisper model, inheriting its robustness and multilingual capabilities.
Model Capabilities
Vietnamese speech recognition
Multi-accent speech recognition
Use Cases
Speech-to-text
Vietnamese speech transcription
Convert Vietnamese speech content into text, suitable for scenarios such as meeting minutes and voice notes.
High-accuracy Vietnamese speech recognition
Voice assistant
Vietnamese voice assistant
Provides voice interaction functionality for Vietnamese users, supporting multiple accents.
Enhances the Vietnamese recognition capability of voice assistants
Featured Recommended AI Models