E

Erax WoW Turbo V1.1

Developed by erax-ai
A Whisper Large-v3 Turbo speech recognition model optimized for Vietnamese, supporting multiple languages with ultra-fast response and high accuracy
Downloads 666
Release Time : 3/30/2025

Model Overview

A speech recognition model optimized based on Whisper Large-v3 Turbo, specifically localized for Vietnamese while supporting 11 languages, suitable for various scenarios like real-time transcription

Model Features

Ultra-fast Response
Processes 30 seconds of audio in approximately 350 milliseconds, ideal for real-time transcription
Multilingual Support
Supports 11 languages, including all 8 regional accents of Vietnamese
High Accuracy
Word Error Rate (WER) of about 12% for major languages, capable of recognizing various accents
Large-scale Training
Trained on a dataset of 600,000 samples (approximately 1,000 hours) of real-world audio
Open Source and Free
Released under MIT license with no usage restrictions

Model Capabilities

Speech recognition
Real-time transcription
Multilingual processing
Accent recognition

Use Cases

Real-time Transcription
Meeting Minutes
Real-time transcription of meeting content
Almost real-time text generation
Live Captioning
Generating instant subtitles for live events
Low-latency subtitle output
Voice Assistants
Voice-controlled Applications
Developing responsive voice control interfaces
High-accuracy voice command recognition
Accessibility Tools
Hearing Assistance
Providing speech-to-text services for the hearing impaired
Real-time speech-to-text conversion
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase