Wav2vec2 Conformer Rel Pos Large 960h Ft
Apache-2.0
A Wav2Vec2-Conformer model based on 16kHz sampled speech audio, using relative positional embedding technology, pre-trained and fine-tuned on 960 hours of Librispeech data
Speech Recognition
Transformers English