C

Canary 1b

Developed by nvidia
Canary-1B is a multilingual multi-task model developed by NVIDIA NeMo, supporting automatic speech recognition and speech translation tasks in English, German, French, and Spanish.
Downloads 7,734
Release Time : 2/7/2024

Model Overview

Canary-1B is an encoder-decoder model based on the FastConformer and Transformer architectures, specifically designed for high-precision automatic speech recognition (ASR) and speech-to-text translation (AST) tasks.

Model Features

Multilingual support
Supports speech recognition and translation in four languages: English, German, French, and Spanish
Multi-task capability
Can perform automatic speech recognition and speech-to-text translation tasks simultaneously
High performance
Achieves state-of-the-art performance levels in multiple benchmark tests
Flexible configuration
Allows selection of output with or without punctuation and capitalization (PnC)
High performance
Achieves state-of-the-art performance levels in multiple benchmark tests
Flexible configuration
Allows selection of output with or without punctuation and capitalization (PnC)

Model Capabilities

English speech recognition
German speech recognition
French speech recognition
Spanish speech recognition
English to German translation
English to French translation
English to Spanish translation
German to English translation
French to English translation
Spanish to English translation

Use Cases

Speech transcription
Meeting record transcription
Convert English meeting recordings into text records
WER reaches 2.89 on the LibriSpeech test set
Multilingual subtitle generation
Generate subtitles in multiple languages for video content
Real-time translation
Cross-lingual conference translation
Translate the speaker's speech into text in other languages in real-time
BLEU value of English-German translation is 32.15, BLEU value of English-French translation is 40.76
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase