W

Wav2vec2 Base Libir Zenodo

Developed by samantharhay
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-base-960h on an unknown dataset, primarily used for automatic speech recognition tasks.
Downloads 25
Release Time : 3/2/2022

Model Overview

wav2vec2-base-libir-zenodo is a speech recognition model based on the wav2vec2 architecture, fine-tuned for converting speech to text.

Model Features

Based on wav2vec2 Architecture
Uses facebook/wav2vec2-base-960h as the base model, featuring robust speech feature extraction capabilities.
Fine-tuning Optimization
Underwent 30 epochs of fine-tuning on a specific dataset to optimize speech recognition performance.
Mixed Precision Training
Utilizes native AMP mixed precision training technology to enhance training efficiency.

Model Capabilities

Speech Recognition
Audio to Text Conversion

Use Cases

Speech Transcription
Meeting Minutes
Automatically converts meeting recordings into written transcripts.
Voice Notes
Converts voice memos into searchable text.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase