W

Wav2vec2 Xls R 300m En Atc Atcosim

Developed by Jzuluaga
This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on the ATCOSIM corpus, specifically designed for automatic speech recognition tasks in air traffic control communications.
Downloads 104
Release Time : 11/16/2022

Model Overview

An automatic speech recognition (ASR) model optimized for Air Traffic Control (ATC) communications, demonstrating excellent performance in domain shift scenarios.

Model Features

Strong Domain Adaptability
Optimized for the specific scenarios of air traffic control communications, maintaining high performance even under domain shifts.
Efficient Fine-tuning
Significantly improves recognition accuracy in the ATC domain with only a small amount of labeled data.
Noise Robustness
Maintains stable performance in challenging environments with signal-to-noise ratios of 5-20dB.

Model Capabilities

Air traffic control speech recognition
English speech-to-text
Noisy environment speech processing

Use Cases

Air Traffic Control
Control Command Recognition
Converts voice communications between pilots and controllers into text
Reduces word error rate by 20-40% compared to traditional methods
Communication Log Analysis
Automatically transcribes ATC communications for subsequent analysis and archiving
Achieves a word error rate of 7.36% on the test set
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase