A

Ast Finetuned Speech Commands V2

Developed by Xenova
A voice command recognition model based on AST architecture, optimized for web deployment in ONNX format
Downloads 15
Release Time : 6/27/2023

Model Overview

This model is a voice command recognition model released by MIT, fine-tuned on the Audio Spectrogram Transformer (AST) architecture and converted to ONNX format to meet the web deployment requirements of the Transformers.js library

Model Features

Web Optimization
Converted to ONNX format to adapt to Transformers.js, supporting direct operation in browser environments
Lightweight Deployment
Designed for edge computing scenarios, suitable for client applications with limited resources
Real-time Processing
Optimized for voice command recognition scenarios with low latency characteristics

Model Capabilities

Voice Command Recognition
Audio Classification
Real-time Voice Processing

Use Cases

Smart Home
Voice-controlled Devices
Control smart home devices through voice commands
Achieve high-accuracy touch-free control
Assistive Technology
Voice Assistive Systems
Provide voice interaction interfaces for users with mobility impairments
Lower the barrier to device operation
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase