H

Hubert Xlarge Ls960 Ft

Developed by facebook
A fine-tuned HuBERT extra-large speech recognition model based on 960 hours of Librispeech data, achieving a WER of only 1.8 on the LibriSpeech test set
Downloads 8,160
Release Time : 3/2/2022

Model Overview

This model is a fine-tuned version of Facebook's HuBERT self-supervised speech representation learning model, specifically designed for English automatic speech recognition tasks.

Model Features

Self-supervised Learning
Utilizes HuBERT's self-supervised learning approach, providing BERT-style prediction loss targets through offline clustering steps
High Performance
Achieves a WER of only 1.8 on the LibriSpeech clean test set, demonstrating excellent performance
Large-scale Training
Fine-tuned on 960 hours of Librispeech audio data

Model Capabilities

English speech recognition
16kHz sample rate audio processing

Use Cases

Speech Transcription
Meeting Minutes Transcription
Automatically transcribe English meeting recordings into text
Highly accurate text transcription results
Audio Content Indexing
Create searchable text indexes for audio content
Improved searchability of audio content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase