Reverb Asr
Rev's Reverb ASR model is trained on 200,000 hours of professionally transcribed English speech data, making it one of the most accurate open-source automatic speech recognition systems for English.
Downloads 17
Release Time : 8/26/2024
Model Overview
Reverb ASR is an efficient automatic speech recognition system that can run on CPU or GPU, allowing users to customize the verbatim fidelity of the output text.
Model Features
High-quality training data
Trained on 200,000 hours of professionally transcribed English speech data, it is currently the largest manually transcribed audio corpus used for training open-source models.
Adjustable transcription style
Control the output style with the verbatimicity parameter, supporting fully verbatim, non-verbatim, or intermediate transcription styles.
Multiple decoding modes
Supports various decoding modes, including attention, ctc_greedy_search, ctc_prefix_beam_search, attention_rescoring, and joint_decoding.
Efficient architecture
The model architecture is efficient and can run on CPU or GPU, making it suitable for various application scenarios.
Model Capabilities
English speech recognition
Verbatim transcription
Non-verbatim transcription
Semi-verbatim transcription
Use Cases
Speech transcription
Professional meeting minutes
Used for recording professional meeting content, supporting fully verbatim transcription to ensure no details are missed.
High-precision transcription, suitable for occasions requiring complete records.
Audio editing
Ideal for audio editing scenarios, generating clear and readable transcriptions or fully verbatim records.
Adjustable transcription style to meet different editing needs.
Speech analysis
Spoken language analysis
Used to analyze hesitations, repetitions, and corrections in spoken language, helping to improve oral expression.
Provides detailed verbatim records for analyzing speech habits.
Featured Recommended AI Models