# Multitask speech processing
Canary 1b Flash
NVIDIA NeMo Canary Flash is a family of multilingual multitask models that achieves state-of-the-art performance across multiple speech benchmarks. Supports automatic speech recognition and translation tasks in four languages.
Speech Recognition Supports Multiple Languages
C
nvidia
125.22k
186
Owsm Ctc V3.2 Ft 1B
OWSM-CTC is an encoder-only speech foundation model based on hierarchical multitask self-conditioned CTC, supporting multilingual speech recognition, speech translation, and language identification.
Speech Recognition Other
O
espnet
110
4
Featured Recommended AI Models