W

Wav2vec2 Base Superb Ks

Developed by superb
SUPERB keyword spotting model based on wav2vec2-base, specifically designed for 16kHz speech
Downloads 5,820
Release Time : 3/2/2022

Model Overview

This model is adapted from S3PRL for keyword spotting tasks, capable of classifying speech into predefined vocabularies to recognize registered keywords.

Model Features

High Accuracy
Achieves 96.43% accuracy on the Speech Commands v1.0 test set
On-device Friendly
Designed for on-device operation balancing accuracy, model size, and inference speed
Standardized Processing
Uses a uniform 16kHz sampling rate to ensure standardized speech input processing

Model Capabilities

Speech Classification
Keyword Recognition
Silence Detection
Unknown Word Detection

Use Cases

Smart Device Control
Voice Assistant Wake Word Detection
Used to detect device wake words such as 'Hey Siri' or 'OK Google'
High accuracy recognition reduces false triggers
Accessibility Technology
Voice Control Interface
Provides voice command recognition for users with mobility impairments
Enables efficient and accurate command recognition
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase