Wav2vec2 Base 960h
ONNX format conversion of Facebook's wav2vec2-base-960h model, designed for Transformers.js, supporting browser-side speech recognition
Downloads 117
Release Time : 7/26/2023
Model Overview
This model is an automatic speech recognition (ASR) model that converts audio input into text output, suitable for English speech transcription tasks
Model Features
Browser Compatibility
ONNX format supports direct operation in browser environments without server-side processing
Lightweight
The base version model is suitable for deployment in resource-limited environments
High Accuracy
Trained on 960 hours of English speech data, it achieves good recognition accuracy
Model Capabilities
English Speech Recognition
Real-time Audio Transcription
Browser-side Speech Processing
Use Cases
Speech Transcription
Automated Meeting Minutes
Automatically convert meeting recordings into text transcripts
Example transcription accuracy can exceed 90%
Voice Control Applications
Add voice control functionality to web applications
Assistive Tools
Real-time Caption Generation
Generate real-time captions for video or live streaming content
Featured Recommended AI Models