Wav2vec2 Keyword Spotting Int8
A speech keyword detection model based on the wav2vec2 architecture, optimized with Optimum OpenVINO quantization
Downloads 17
Release Time : 6/13/2022
Model Overview
This model is based on the wav2vec2 architecture, specifically designed for speech keyword detection tasks, capable of identifying specific keywords in audio.
Model Features
Quantization Optimization
Enhanced inference efficiency through Optimum OpenVINO quantization
High Accuracy
Achieves a benchmark accuracy of 0.9828 on the evaluation set
Lightweight
Based on the wav2vec2-base architecture, relatively lightweight
Model Capabilities
Speech Keyword Detection
Real-time Audio Processing
Use Cases
Voice Interaction
Wake Word Detection
Used for wake word detection in smart devices
High accuracy in recognizing specific wake words
Voice Command Recognition
Recognizes simple voice commands
Quick response to voice instructions
Featured Recommended AI Models