W

Wav2vec2 Base Toy Train Data Random Noise

Developed by scasutt
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-base using toy training data, primarily designed for speech recognition tasks in random noise environments.
Downloads 22
Release Time : 3/27/2022

Model Overview

This is a fine-tuned wav2vec2-base model specifically designed for speech recognition in random noise environments. The model achieved a word error rate of 0.7351 on the evaluation set.

Model Features

Noise Environment Adaptation
Optimized for random noise environments, improving speech recognition accuracy in noisy conditions
Based on wav2vec2 Architecture
Built upon the powerful wav2vec2-base model, inheriting its excellent speech feature extraction capabilities

Model Capabilities

Speech Recognition
Speech processing in noisy environments

Use Cases

Speech Transcription
Speech Transcription in Noisy Environments
Convert speech containing random noise into text
Word Error Rate 0.7351
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase