Wav2vec2 Xls R 300m Timit Phoneme
Apache-2.0
This is an automatic phoneme recognition model fine-tuned on the TIMIT dataset based on the facebook/wav2vec2-xls-r-300m model, primarily used for phoneme-level recognition of English speech.
Speech Recognition
Transformers English