C

Canary 180m Flash

Developed by nvidia
NVIDIA NeMo Canary Flash is a multilingual multitask speech model supporting automatic speech recognition and translation tasks in English, German, French, and Spanish.
Downloads 15.17k
Release Time : 3/11/2025

Model Overview

Canary 180M Flash is a multilingual multitask model based on the Canary architecture, achieving state-of-the-art performance in multiple speech benchmarks. It supports automatic speech-to-text recognition (ASR) in 4 languages and translation between multiple languages.

Model Features

Multilingual Support
Supports speech recognition and translation in four languages: English, German, French, and Spanish
Multitask Processing
Capable of handling both automatic speech recognition and automatic speech translation tasks simultaneously
Timestamp Functionality
Provides experimental word-level and segment-level timestamp features
Efficient Inference
Achieves over 1200 RTFx inference speed, suitable for real-time applications

Model Capabilities

Speech Recognition
Speech Translation
Timestamp Generation
Multilingual Processing

Use Cases

Speech Transcription
Meeting Minutes
Automatically transcribes meeting recordings into text
Supports accurate transcription in multiple languages
Subtitle Generation
Automatically generates subtitles for video content
Can produce subtitles with timestamps
Speech Translation
Real-time Translation
Translates speech from one language to another in real-time
Supports mutual translation between multiple languages
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase