Wav2vec2 Base Dataset Asr Demo Colab
W
Wav2vec2 Base Dataset Asr Demo Colab
Developed by aminnaghavi
This is a speech recognition model fine-tuned on the superb dataset based on distilhubert, primarily used for Automatic Speech Recognition (ASR) tasks.
Downloads 34
Release Time : 6/17/2022
Model Overview
This model is a fine-tuned speech recognition model based on ntu-spml/distilhubert, trained on the superb dataset, capable of converting speech to text.
Model Features
Efficient Speech Recognition
Fine-tuned on the superb dataset, offering good speech recognition capabilities
Lightweight Model
Based on the distilhubert architecture, more lightweight compared to the full model
Mixed Precision Training
Uses native AMP for mixed precision training, improving training efficiency
Model Capabilities
Speech-to-Text
Automatic Speech Recognition
Use Cases
Speech Transcription
Meeting Minutes
Automatically convert meeting recordings into text transcripts
Subtitle Generation
Automatically generate subtitles for video content
Featured Recommended AI Models
Š 2025AIbase