C

Canary 1b Flash

Developed by nvidia
NVIDIA NeMo Canary Flash is a family of multilingual multitask models that achieves state-of-the-art performance across multiple speech benchmarks. Supports automatic speech recognition and translation tasks in four languages.
Downloads 125.22k
Release Time : 3/7/2025

Model Overview

Canary 1B Flash is a multilingual multitask model based on the Canary architecture, supporting automatic speech-to-text recognition (ASR) for English, German, French, and Spanish, as well as translation between these languages. The model also provides experimental timestamp functionality.

Model Features

Multilingual support
Supports speech recognition and translation in four languages: English, German, French, and Spanish
Multitask capability
Simultaneously supports automatic speech recognition and speech translation tasks
Timestamp functionality
Provides experimental word-level and segment-level timestamp functionality
Efficient inference
Achieves inference speeds exceeding 1000 RTFx on the open-asr-leaderboard dataset

Model Capabilities

English speech recognition
German speech recognition
French speech recognition
Spanish speech recognition
English-German translation
English-French translation
English-Spanish translation
German-English translation
French-English translation
Spanish-English translation
Timestamp generation

Use Cases

Speech transcription
Meeting minutes
Automatically transcribe meeting recordings into text
Supports accurate transcription in four languages
Subtitle generation
Generate subtitles for video content
Can generate subtitles with timestamps
Speech translation
Real-time translation
Translate speech from one language to text in another language in real time
Supports mutual translation between four languages
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase