W

Wav2vec2 Base 960h

Developed by Xenova
ONNX format conversion of Facebook's wav2vec2-base-960h model, designed for Transformers.js, supporting browser-side speech recognition
Downloads 117
Release Time : 7/26/2023

Model Overview

This model is an automatic speech recognition (ASR) model that converts audio input into text output, suitable for English speech transcription tasks

Model Features

Browser Compatibility
ONNX format supports direct operation in browser environments without server-side processing
Lightweight
The base version model is suitable for deployment in resource-limited environments
High Accuracy
Trained on 960 hours of English speech data, it achieves good recognition accuracy

Model Capabilities

English Speech Recognition
Real-time Audio Transcription
Browser-side Speech Processing

Use Cases

Speech Transcription
Automated Meeting Minutes
Automatically convert meeting recordings into text transcripts
Example transcription accuracy can exceed 90%
Voice Control Applications
Add voice control functionality to web applications
Assistive Tools
Real-time Caption Generation
Generate real-time captions for video or live streaming content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase