W

Wav2vec2 Base Toy Train Data Masked Audio 10ms

Developed by scasutt
A speech recognition model fine-tuned based on facebook/wav2vec2-base, trained on 10ms masked audio tasks
Downloads 22
Release Time : 3/26/2022

Model Overview

This model is a fine-tuned version of wav2vec2-base, focusing on processing masked audio data, suitable for speech recognition tasks.

Model Features

10ms masked audio processing
Specially optimized for training on masked audio data with 10ms intervals
Fine-tuned based on wav2vec2-base
Targeted optimization based on the mature wav2vec2-base architecture

Model Capabilities

Speech recognition
Masked audio processing

Use Cases

Speech processing
Incomplete audio recognition
Recognizing speech content that is partially masked or missing
WER 0.7145
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase