Wav2vec2-base-ft-keyword-spotting-int8 Open Source Model

Home

Wav2vec2 Base Ft Keyword Spotting Int8

Developed by dkurt

A speech keyword detection model based on the wav2vec2 architecture, optimized with OpenVINO quantization

Speech Recognition

Transformers

#Keyword Detection #Speech Recognition #OpenVINO Quantization

Downloads 14

Release Time : 3/2/2022

Model Overview

This model is a speech processing model based on the wav2vec2 architecture, specifically designed for keyword detection tasks. After Optimum OpenVINO quantization, the model improves inference efficiency while maintaining high accuracy.

Model Features

Efficient Quantization

Quantized with Optimum OpenVINO, significantly improving inference efficiency with only a 2.74% drop in accuracy

High Accuracy

Achieves a baseline accuracy of 0.9828 on the evaluation set, maintaining a high accuracy of 0.9553 after quantization

Lightweight

Based on the wav2vec2-base architecture, relatively lightweight and suitable for edge device deployment (inferred)

Model Capabilities

Speech keyword recognition

Real-time audio processing

Edge device deployment

Use Cases

Smart Home

Wake Word Detection

Used to detect device wake words such as 'Hey Siri', 'OK Google', etc.

High accuracy ensures reliable device response

Industrial Applications

Voice Command Recognition

Recognizes specific voice commands in noisy industrial environments

Quantized model is suitable for deployment on edge devices

Property	Details
Accuracy on eval (baseline)	0.9828
Accuracy on eval (quantized)	0.9553 (-0.0274)

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Base Ft Keyword Spotting Int8

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 anton-l/wav2vec2-base-ft-keyword-spotting Quantized Model

📚 Documentation

Accuracy Comparison