AST VoxCelebSpoof Synthetic Voice Detection
A
AST VoxCelebSpoof Synthetic Voice Detection
Developed by MattyB95
A synthetic speech detection model fine-tuned based on MIT/ast-finetuned-audioset-10-10-0.4593, demonstrating outstanding performance on the VoxCelebSpoof dataset
Downloads 9,518
Release Time : 1/16/2024
Model Overview
This model is used to detect synthetic speech, fine-tuned on audio classification tasks based on the AST architecture, specifically optimized for voice spoofing detection scenarios
Model Features
High accuracy
Achieves 99.99% accuracy and F1 score on the evaluation set
AST-based architecture
Utilizes the Audio Spectrogram Transformer architecture, excelling in processing audio spectral features
Specialized for synthetic speech detection
Optimized for the VoxCelebSpoof dataset, particularly suitable for voice spoofing detection scenarios
Model Capabilities
Audio classification
Synthetic speech detection
Voice spoofing recognition
Use Cases
Security verification
Voice authentication systems
Used to detect synthetic speech attacks in voice authentication systems
Can effectively identify 99.99% of synthetic speech samples
Content moderation
Fake audio detection
Identifies AI-generated fake audio content on social media
Featured Recommended AI Models
Š 2025AIbase