W

Wav2vec2 Base MIR ST500 ASR 109

Developed by gary109
A fine-tuned automatic speech recognition model based on facebook/wav2vec2-base on the MIR_ST500 dataset
Downloads 15
Release Time : 4/15/2022

Model Overview

This model is a fine-tuned version for automatic speech recognition (ASR) tasks, trained on the MIR_ST500 dataset, capable of converting speech to text.

Model Features

Based on wav2vec2 architecture
Uses facebook's wav2vec2-base as the foundational architecture with excellent speech feature extraction capabilities
Domain-specific fine-tuning
Fine-tuned on the MIR_ST500 dataset, potentially optimized for specific domains or accents
Multi-GPU training
Utilizes 2 GPUs for distributed training, improving training efficiency

Model Capabilities

Speech-to-text
Automatic speech recognition

Use Cases

Speech transcription
Meeting minutes
Automatically convert meeting recordings into written transcripts
Voice notes
Convert voice memos into searchable text
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase