E

Erax WoW Turbo V1.1 CT2

Developed by erax-ai
A localized Vietnamese-enhanced version of Whisper Large-v3 Turbo optimized with CTranslate2, supporting multilingual speech recognition with high speed and accuracy
Downloads 1,283
Release Time : 3/31/2025

Model Overview

This is an optimized speech-to-text model based on the Whisper Large-v3 Turbo architecture, specially enhanced for Vietnamese while supporting multiple languages. The model is optimized with CTranslate2, providing ultra-fast transcription capabilities.

Model Features

Ultra-fast transcription
Processes 30 seconds of audio in approximately 350ms, supporting real-time transcription
Multilingual support
Supports 11 languages, with special optimization for 8 Vietnamese regional accents
High accuracy
Achieves a word error rate (WER) of about 12% for major languages, capable of handling various accents
CTranslate2 optimization
Achieves 2.5x speedup through CTranslate2 library, suitable for low-latency applications

Model Capabilities

Speech-to-text
Multilingual recognition
Real-time transcription
Accent adaptation

Use Cases

Real-time transcription
Meeting minutes
Real-time transcription of meeting content
Near real-time text records
Interview records
Automatically transcribe interview audio
Fast and accurate interview records
Accessibility tools
Hearing assistance
Provides real-time captions for hearing-impaired individuals
Improved communication accessibility
Media production
Video subtitles
Automatically generate subtitles for videos
Fast and accurate subtitle generation
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase