Wav2vec2-base-960h-finetuned Common Voice 3 Open-source Speech Recognition Model - Suitable for General Speech Recognition Tasks

Wav2vec2 Base 960h Finetuned Common Voice3

Developed by obokkkk

A speech recognition model fine-tuned based on facebook/wav2vec2-base-960h, suitable for general speech recognition tasks

Downloads 20

Release Time : 4/28/2022

Model Overview

This model is a fine-tuned version of wav2vec2-base-960h on the Common Voice dataset, primarily used for Automatic Speech Recognition (ASR) tasks.

Based on wav2vec2 Architecture

Utilizes the advanced wav2vec2 architecture to provide high-quality speech recognition capabilities

Fine-tuned on Common Voice Dataset

The model was fine-tuned on the Common Voice dataset, improving recognition accuracy

Supports Large-Scale Training

Used a total batch size of 1024 during training to ensure the model fully learns data features

Speech Recognition

Audio-to-Text Conversion

Speech Transcription

Meeting Minutes

Automatically convert meeting recordings into text transcripts

Subtitle Generation

Automatically generate subtitles for video content

Voice Assistants

Voice Command Recognition

Recognize and process user voice commands

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base