F

Fireredasr AED L

Developed by FireRedTeam
FireRedASR is a series of open-source, industrial-grade automatic speech recognition (ASR) models supporting Mandarin, Chinese dialects, and English. It achieves state-of-the-art (SOTA) performance on public Mandarin ASR benchmarks while also excelling in lyrics recognition.
Downloads 216
Release Time : 1/24/2025

Model Overview

To meet diverse application needs for superior performance and optimal efficiency, FireRedASR offers two variants: FireRedASR-LLM and FireRedASR-AED. The former adopts an encoder-adapter-large language model framework, aiming for SOTA performance and supporting end-to-end speech interaction. The latter is based on an attention-based encoder-decoder architecture, balancing high performance with computational efficiency, serving as an efficient speech representation module in LLM-based speech models.

Model Features

Multilingual support
Supports automatic speech recognition for Mandarin, Chinese dialects, and English
Industrial-grade performance
Achieves SOTA level on public Mandarin ASR benchmarks
Excellent lyrics recognition
Delivers outstanding performance in lyrics recognition
Two architecture options
Offers both LLM and AED architectures to meet diverse scenario requirements

Model Capabilities

Mandarin speech recognition
Chinese dialect speech recognition
English speech recognition
Lyrics recognition

Use Cases

Speech-to-text
Meeting transcription
Convert meeting recordings into text transcripts
4.67% CER on the ws_meeting dataset
Voice assistant
Used as the speech recognition module in smart voice assistants
Multimedia processing
Subtitle generation
Automatically generate subtitles for video content
Lyrics recognition
Identify lyrics from music
Delivers outstanding lyrics recognition performance
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase