A

Alphadelay

Developed by renBaikau
A speech recognition model fine-tuned based on facebook/wav2vec2-base, with a word error rate (WER) of 1.0
Downloads 17
Release Time : 3/2/2022

Model Overview

This model is a fine-tuned speech recognition (ASR) model based on the facebook/wav2vec2-base architecture, suitable for tasks converting speech to text.

Model Features

Based on wav2vec2 architecture
Utilizes the proven wav2vec2-base architecture with excellent speech feature extraction capabilities
Fine-tuning optimization
Underwent 15 rounds of fine-tuning on the base model to optimize performance in specific scenarios

Model Capabilities

Speech-to-text
Automatic Speech Recognition

Use Cases

Speech Transcription
Meeting Minutes
Automatically convert meeting recordings into text transcripts
Voice Notes
Convert voice memos into searchable text
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase