A

Assignment1 Jane

Developed by Classroom-workshop
s2t-small-librispeech-asr is a speech-to-text (S2T) model for automatic speech recognition (ASR), based on a sequence-to-sequence transformer architecture.
Downloads 29
Release Time : 6/2/2022

Model Overview

This model is an end-to-end sequence-to-sequence transformer model trained using standard autoregressive cross-entropy loss and generates transcriptions autoregressively.

Model Features

End-to-end speech recognition
Directly generates text output from speech input without intermediate processing steps.
Autoregressive generation
Generates transcriptions autoregressively to ensure coherence and accuracy.
Trained on LibriSpeech
Trained on the LibriSpeech dataset, suitable for English speech recognition tasks.

Model Capabilities

Speech recognition
English transcription

Use Cases

Speech-to-text
Meeting minutes
Automatically convert meeting recordings into text transcripts.
Voice notes
Convert voice notes into editable text format.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase