wav2vec2-keyword-spotting-int8 Open-Source Speech Keyword Detection Model

Home

Wav2vec2 Keyword Spotting Int8

Developed by sampras343

A speech keyword detection model based on the wav2vec2 architecture, optimized with Optimum OpenVINO quantization

Speech Recognition

Transformers

#Keyword Detection #Speech Recognition #OpenVINO Quantization

Downloads 17

Release Time : 6/13/2022

Model Overview

This model is based on the wav2vec2 architecture, specifically designed for speech keyword detection tasks, capable of identifying specific keywords in audio.

Model Features

Quantization Optimization

Enhanced inference efficiency through Optimum OpenVINO quantization

High Accuracy

Achieves a benchmark accuracy of 0.9828 on the evaluation set

Lightweight

Based on the wav2vec2-base architecture, relatively lightweight

Model Capabilities

Speech Keyword Detection

Real-time Audio Processing

Use Cases

Voice Interaction

Wake Word Detection

Used for wake word detection in smart devices

High accuracy in recognizing specific wake words

Voice Command Recognition

Recognizes simple voice commands

Quick response to voice instructions

Property	Details
Accuracy on eval (baseline)	0.9828
Accuracy on eval (quantized)	0.9553 (-0.0274)

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Keyword Spotting Int8

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Wav2Vec2 Base Fine-Tuned for Keyword Spotting (Quantized)

📊 Accuracy Comparison