Test
T
Test
Developed by GleamEyeBeast
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-base-960h, achieving a word error rate of 21.61% on the evaluation set.
Downloads 21
Release Time : 3/2/2022
Model Overview
This is a fine-tuned model for speech recognition tasks, based on Facebook's wav2vec2 architecture, suitable for English speech-to-text tasks.
Model Features
Efficient Fine-tuning
Fine-tuned based on the pre-trained wav2vec2 model, enabling rapid adaptation with limited data
Low Word Error Rate
Achieved a word error rate of 21.61% on the evaluation set, demonstrating good performance
Lightweight
Based on the wav2vec2-base architecture, requiring lower computational resources compared to larger models
Model Capabilities
English Speech Recognition
Audio-to-Text Conversion
Speech Transcription
Use Cases
Speech Transcription
Meeting Minutes
Automatically transcribe English meeting recordings into text records
Accuracy approximately 78.39% (based on 21.61% word error rate)
Voice Notes
Convert English voice notes into searchable text
Featured Recommended AI Models
Š 2025AIbase