Distil-wav2vec2 Open-source Automatic Speech Recognition Model

Distil Wav2vec2

Developed by OthmaneJ

Distil-wav2vec2 is a distilled version of the wav2vec2 model, with a 45% reduction in size and a two-fold increase in inference speed, suitable for automatic speech recognition tasks.

Speech Recognition

Transformers

EnglishOpen Source License:Apache-2.0 #Lightweight Speech Recognition #Efficient Inference #Low Word Error Rate

Downloads 854

Release Time : 3/2/2022

Model Overview

This model is a lightweight version of the wav2vec2 model, focusing on automatic speech recognition tasks, achieving a smaller model size and faster inference speed through distillation techniques.

Model Features

Lightweight

The model size is 45% smaller than the original wav2vec2 base model, making it more suitable for resource-constrained environments.

Efficient Inference

Inference speed is doubled, with CPU time at 0.4006 seconds and GPU time at 0.0046 seconds (with a batch size of 64).

Balanced Performance

Maintains a relatively low word error rate while significantly improving operational efficiency.

Model Capabilities

English Speech Recognition

Audio-to-Text Conversion

Use Cases

Speech Transcription

Meeting Minutes

Automatically transcribe meeting recordings into text

Word error rate on Librispeech-test-clean is 0.0983

Voice Assistant

Used as the speech recognition module for lightweight voice assistants

Achieves fast response on resource-constrained devices

Property	Details
Model Type	Distil - wav2vec2
Training Data	librispeech_asr
Tags	speech, audio, automatic - speech - recognition
License	apache - 2.0

Model	Size	WER Librispeech - test - clean	WER Librispeech - test - other	Speed on cpu	speed on gpu
Distil - wav2vec2	197.9 Mb	0.0983	0.2266	0.4006s	0.0046s
wav2vec2 - base	360 Mb	0.0389	0.1047	0.4919s	0.0082s

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Distil Wav2vec2

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Distil-wav2vec2

🚀 Quick Start

✨ Features

📚 Documentation

Evaluation results

Usage

📄 License