Wav2vec2 Base Libir Zenodo
W
Wav2vec2 Base Libir Zenodo
Developed by samantharhay
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-base-960h on an unknown dataset, primarily used for automatic speech recognition tasks.
Downloads 25
Release Time : 3/2/2022
Model Overview
wav2vec2-base-libir-zenodo is a speech recognition model based on the wav2vec2 architecture, fine-tuned for converting speech to text.
Model Features
Based on wav2vec2 Architecture
Uses facebook/wav2vec2-base-960h as the base model, featuring robust speech feature extraction capabilities.
Fine-tuning Optimization
Underwent 30 epochs of fine-tuning on a specific dataset to optimize speech recognition performance.
Mixed Precision Training
Utilizes native AMP mixed precision training technology to enhance training efficiency.
Model Capabilities
Speech Recognition
Audio to Text Conversion
Use Cases
Speech Transcription
Meeting Minutes
Automatically converts meeting recordings into written transcripts.
Voice Notes
Converts voice memos into searchable text.
Featured Recommended AI Models
Š 2025AIbase