A

Asr Hubert Cluster Bart Base

Developed by voidful
An automatic speech recognition model based on Hubert and BART architecture, achieving speech-to-text conversion through clustered feature transformation
Downloads 13
Release Time : 3/2/2022

Model Overview

This model combines Hubert's speech feature extraction capability with BART's sequence-to-sequence transformation ability, specifically designed for automatic speech recognition (ASR) tasks.

Model Features

Hubert Feature Clustering
Uses Hubert to extract speech features and encodes them through k-means clustering
BART Sequence Transformation
Utilizes the BART model to convert clustered feature sequences into text sequences
Efficient Speech Processing
Capable of processing speech inputs at various sample rates and converting them into text

Model Capabilities

English Speech Recognition
Speech Feature Extraction
Sequence-to-Text Conversion

Use Cases

Speech Transcription
Lecture Transcription
Convert recorded lectures into written transcripts
Example result: 'Moving along the muddy country roads, speaking for two weeks in damp schoolhouses to damp audiences...'
Voice Assistants
Voice Command Recognition
Recognize and convert user voice commands into executable commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase