Whisper Large V3
Whisper-large-v3 is an open-source automatic speech recognition (ASR) model by OpenAI, supporting speech-to-text tasks in multiple languages.
Downloads 1,443
Release Time : 11/7/2023
Model Overview
Whisper-large-v3 is a powerful automatic speech recognition model that can convert speech into text and supports multiple languages. This project converts it to the ONNX format for running in a Web environment via the transformers.js library.
Model Features
Web compatibility
Adapted to transformers.js through ONNX conversion and can run directly in a Web environment
Multilingual support
Can recognize and transcribe speech in multiple languages
High accuracy
Performs excellently in automatic speech recognition tasks
Model Capabilities
Speech to text
Multilingual speech recognition
Real-time speech transcription
Use Cases
Speech transcription
Meeting minutes
Automatically convert meeting recordings into text records
Improve the efficiency of meeting minutes and facilitate later retrieval
Subtitle generation
Automatically generate subtitles for video content
Enhance the accessibility of video content
Voice assistant
Voice input
Add voice input functionality to Web applications
Enhance user experience and support barrier-free access
Featured Recommended AI Models