W

Whisper Large V3 Speech Flow

Developed by tiantiaf
A speech fluency classification model based on Whisper Large v3, capable of detecting speech fluency and disfluency types
Downloads 157
Release Time : 5/22/2025

Model Overview

This model implements a speech fluency classification method, first detecting whether speech is fluent, and if not, further classifying the disfluency type (blocking, prolongation, sound repetition, word repetition, interjection).

Model Features

Fluency Detection
Accurately distinguishes between fluent and disfluent speech segments
Disfluency Type Classification
Further classifies disfluent speech into 5 specific types
Windowed Processing
Uses 3-second window size and 1-second step size for processing long speech

Model Capabilities

Speech Fluency Detection
Disfluency Type Classification
Long Speech Segmentation Processing

Use Cases

Speech Therapy
Stuttering Assessment
Assists speech therapists in evaluating the severity and types of stuttering in patients
Quantitative analysis of the frequency and type distribution of disfluent speech
Speech Quality Analysis
Speech Fluency Scoring
Provides fluency metrics for speech quality assessment systems
Automatically generates speech fluency reports
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase