Phowhisper Large
PhoWhisper is a system specifically designed for Vietnamese automatic speech recognition, fine-tuned based on the Whisper model, supporting various Vietnamese accents.
Downloads 2,373
Release Time : 12/19/2023
Model Overview
PhoWhisper is a Vietnamese automatic speech recognition system, achieved by fine-tuning the multilingual Whisper model using an 844-hour Vietnamese dataset, featuring robustness and high accuracy.
Model Features
Multi-accent support
Fine-tuned using an 844-hour dataset covering various Vietnamese accents, adapting to pronunciation characteristics from different regions.
High performance
Achieves state-of-the-art performance on Vietnamese ASR benchmark datasets.
Based on Whisper model
Fine-tuned on the multilingual Whisper model, inheriting Whisper's robustness and accuracy.
Model Capabilities
Vietnamese speech recognition
Multi-accent adaptation
Use Cases
Speech-to-text
Vietnamese meeting transcription
Automatically convert Vietnamese meeting recordings into text transcripts.
Highly accurate text output
Voice assistant
Used as the speech recognition module for Vietnamese voice assistants.
Improves the recognition accuracy of voice assistants
Featured Recommended AI Models
Š 2025AIbase