W

Wav2vec2 Base Toy Train Data Slow 10pct

Developed by scasutt
A speech recognition model fine-tuned on an unknown dataset based on facebook/wav2vec2-base, with a Word Error Rate (WER) of 0.7175
Downloads 22
Release Time : 3/27/2022

Model Overview

This model is a fine-tuned version of wav2vec2-base, primarily used for speech recognition tasks. The model demonstrates certain recognition capabilities on the evaluation set but still has room for improvement.

Model Features

Fine-tuned based on wav2vec2-base
Fine-tuned on the base wav2vec2 model to adapt to specific speech recognition tasks
Linear Learning Rate Scheduling
Adopts a linear learning rate scheduling strategy with a 1000-step warm-up period
Gradient Accumulation Training
Uses gradient accumulation (steps=2) to increase effective batch size

Model Capabilities

Speech-to-Text
Automatic Speech Recognition

Use Cases

Speech Transcription
Meeting Minutes Transcription
Convert meeting recordings into text transcripts
Word Error Rate 0.7175
Voice Command Recognition
Recognize simple voice commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase