Whisper-large-v3 Open-source Speech Recognition Model - Free Support for Multilingual Speech-to-Text

Whisper Large V3

Developed by Xenova

Whisper-large-v3 is an open-source automatic speech recognition (ASR) model by OpenAI, supporting speech-to-text tasks in multiple languages.

Speech Recognition

Transformers

#Speech to text #Multilingual support #Web deployment

Downloads 1,443

Release Time : 11/7/2023

Model Overview

Whisper-large-v3 is a powerful automatic speech recognition model that can convert speech into text and supports multiple languages. This project converts it to the ONNX format for running in a Web environment via the transformers.js library.

Model Features

Web compatibility

Adapted to transformers.js through ONNX conversion and can run directly in a Web environment

Multilingual support

Can recognize and transcribe speech in multiple languages

High accuracy

Performs excellently in automatic speech recognition tasks

Model Capabilities

Speech to text

Multilingual speech recognition

Real-time speech transcription

Use Cases

Speech transcription

Meeting minutes

Automatically convert meeting recordings into text records

Improve the efficiency of meeting minutes and facilitate later retrieval

Subtitle generation

Automatically generate subtitles for video content

Enhance the accessibility of video content

Voice assistant

Voice input

Add voice input functionality to Web applications

Enhance user experience and support barrier-free access

Property	Details
Base Model	openai/whisper-large-v3
Library Name	transformers.js

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Whisper Large V3

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Whisper ONNX for Transformers.js

🚀 Quick Start