Ast Finetuned Speech Commands V2
A voice command recognition model based on AST architecture, optimized for web deployment in ONNX format
Downloads 15
Release Time : 6/27/2023
Model Overview
This model is a voice command recognition model released by MIT, fine-tuned on the Audio Spectrogram Transformer (AST) architecture and converted to ONNX format to meet the web deployment requirements of the Transformers.js library
Model Features
Web Optimization
Converted to ONNX format to adapt to Transformers.js, supporting direct operation in browser environments
Lightweight Deployment
Designed for edge computing scenarios, suitable for client applications with limited resources
Real-time Processing
Optimized for voice command recognition scenarios with low latency characteristics
Model Capabilities
Voice Command Recognition
Audio Classification
Real-time Voice Processing
Use Cases
Smart Home
Voice-controlled Devices
Control smart home devices through voice commands
Achieve high-accuracy touch-free control
Assistive Technology
Voice Assistive Systems
Provide voice interaction interfaces for users with mobility impairments
Lower the barrier to device operation
Featured Recommended AI Models
Š 2025AIbase