A

AST VoxCelebSpoof Synthetic Voice Detection

Developed by MattyB95
A synthetic speech detection model fine-tuned based on MIT/ast-finetuned-audioset-10-10-0.4593, demonstrating outstanding performance on the VoxCelebSpoof dataset
Downloads 9,518
Release Time : 1/16/2024

Model Overview

This model is used to detect synthetic speech, fine-tuned on audio classification tasks based on the AST architecture, specifically optimized for voice spoofing detection scenarios

Model Features

High accuracy
Achieves 99.99% accuracy and F1 score on the evaluation set
AST-based architecture
Utilizes the Audio Spectrogram Transformer architecture, excelling in processing audio spectral features
Specialized for synthetic speech detection
Optimized for the VoxCelebSpoof dataset, particularly suitable for voice spoofing detection scenarios

Model Capabilities

Audio classification
Synthetic speech detection
Voice spoofing recognition

Use Cases

Security verification
Voice authentication systems
Used to detect synthetic speech attacks in voice authentication systems
Can effectively identify 99.99% of synthetic speech samples
Content moderation
Fake audio detection
Identifies AI-generated fake audio content on social media
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase