K

Kamo Naoyuki Mini An4 Asr Train Raw Bpe Valid.acc.best

Developed by espnet
This is an automatic speech recognition (ASR) pretrained model based on the ESPnet2 framework, trained on the mini-an4 dataset and supports English speech recognition.
Downloads 425
Release Time : 3/2/2022

Model Overview

This model is an end-to-end automatic speech recognition model capable of converting input speech signals into corresponding text content.

Model Features

End-to-end speech recognition
Adopts an end-to-end architecture, directly converting speech signals to text
Based on ESPnet framework
Trained using ESPnet, a mature end-to-end speech processing toolkit
BPE tokenization
Uses Byte Pair Encoding (BPE) for text processing

Model Capabilities

English speech recognition
End-to-end speech-to-text

Use Cases

Speech transcription
Meeting transcription
Automatically converts English meeting recordings into text transcripts
Voice command recognition
Recognizes English voice commands and converts them into executable commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase