AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Multitask speech processing

# Multitask speech processing

Canary 1b Flash
NVIDIA NeMo Canary Flash is a family of multilingual multitask models that achieves state-of-the-art performance across multiple speech benchmarks. Supports automatic speech recognition and translation tasks in four languages.
Speech Recognition Supports Multiple Languages
C
nvidia
125.22k
186
Owsm Ctc V3.2 Ft 1B
OWSM-CTC is an encoder-only speech foundation model based on hierarchical multitask self-conditioned CTC, supporting multilingual speech recognition, speech translation, and language identification.
Speech Recognition Other
O
espnet
110
4
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase