wav2vec2-base-dataset_asr-demo-colab Open-source Speech Recognition Model - Free for Automatic Speech Recognition Tasks

Wav2vec2 Base Dataset Asr Demo Colab

Developed by aminnaghavi

This is a speech recognition model fine-tuned on the superb dataset based on distilhubert, primarily used for Automatic Speech Recognition (ASR) tasks.

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Speech Recognition #Low Word Error Rate #Fine-tuned Model

Downloads 34

Release Time : 6/17/2022

Model Overview

This model is a fine-tuned speech recognition model based on ntu-spml/distilhubert, trained on the superb dataset, capable of converting speech to text.

Model Features

Efficient Speech Recognition

Fine-tuned on the superb dataset, offering good speech recognition capabilities

Lightweight Model

Based on the distilhubert architecture, more lightweight compared to the full model

Mixed Precision Training

Uses native AMP for mixed precision training, improving training efficiency

Model Capabilities

Speech-to-Text

Automatic Speech Recognition

Use Cases

Speech Transcription

Meeting Minutes

Automatically convert meeting recordings into text transcripts

Subtitle Generation

Automatically generate subtitles for video content

Training Loss	Epoch	Step	Validation Loss	Wer
5638.536	1.6	500	409.4785	0.8556
2258.6455	3.19	1000	326.0520	0.8369
1389.4919	4.79	1500	295.0834	0.8282

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Base Dataset Asr Demo Colab

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 wav2vec2-base-dataset_asr-demo-colab

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

🔧 Technical Details

Training procedure

Training hyperparameters

Training results

Framework versions

📄 License