W

Wav2vec2 Base Toy Train Data Augment 0.1

Developed by scasutt
A speech recognition model fine-tuned from facebook/wav2vec2-base, trained on a toy dataset with 0.1 ratio data augmentation applied
Downloads 22
Release Time : 3/25/2022

Model Overview

This model is a fine-tuned version of wav2vec2-base, primarily used for speech recognition tasks, but currently exhibits poor performance (WER as high as 0.9954)

Model Features

Data augmentation training
Applied 0.1 ratio data augmentation technique during training
Based on wav2vec2 architecture
Uses facebook's wav2vec2-base as the base model

Model Capabilities

Speech recognition
Audio feature extraction

Use Cases

Speech processing
Speech-to-text
Convert speech content to text
Currently has high word error rate (WER=0.9954)
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase