Wav2vec2 Base 20sec Timit And Dementiabank
W
Wav2vec2 Base 20sec Timit And Dementiabank
Developed by shields
A speech recognition model fine-tuned based on facebook/wav2vec2-base, trained on TIMIT and DementiaBank datasets, suitable for English speech recognition tasks.
Downloads 18
Release Time : 3/2/2022
Model Overview
This model is a fine-tuned version of wav2vec2-base, focusing on English speech recognition, specifically optimized for TIMIT and DementiaBank datasets.
Model Features
Efficient Fine-tuning
Fine-tuned based on the wav2vec2-base model, achieving good recognition performance on specific datasets.
Low Word Error Rate
Achieved a word error rate (WER) of 0.2313 on the evaluation set, demonstrating good performance.
Mixed Precision Training
Used native AMP for mixed precision training, improving training efficiency.
Model Capabilities
English Speech Recognition
Audio to Text Conversion
Use Cases
Healthcare
Dementia Voice Analysis
Can be used to analyze voice characteristics of dementia patients
Speech Recognition Applications
English Speech Transcription
Convert English speech content into text
Word error rate 0.2313
Featured Recommended AI Models