Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding

05/02/2023
by   Juan Zuluaga-Gomez, et al.
0

Voice communication between air traffic controllers (ATCos) and pilots is critical for ensuring safe and efficient air traffic control (ATC). This task requires high levels of awareness from ATCos and can be tedious and error-prone. Recent attempts have been made to integrate artificial intelligence (AI) into ATC in order to reduce the workload of ATCos. However, the development of data-driven AI systems for ATC demands large-scale annotated datasets, which are currently lacking in the field. This paper explores the lessons learned from the ATCO2 project, a project that aimed to develop a unique platform to collect and preprocess large amounts of ATC data from airspace in real time. Audio and surveillance data were collected from publicly accessible radio frequency channels with VHF receivers owned by a community of volunteers and later uploaded to Opensky Network servers, which can be considered an "unlimited source" of data. In addition, this paper reviews previous work from ATCO2 partners, including (i) robust automatic speech recognition, (ii) natural language processing, (iii) English language identification of ATC communications, and (iv) the integration of surveillance data such as ADS-B. We believe that the pipeline developed during the ATCO2 project, along with the open-sourcing of its data, will encourage research in the ATC field. A sample of the ATCO2 corpus is available on the following website: https://www.atco2.org/data, while the full corpus can be purchased through ELDA at http://catalog.elra.info/en-us/repository/browse/ELRA-S0484. We demonstrated that ATCO2 is an appropriate dataset to develop ASR engines when little or near to no ATC in-domain data is available. For instance, with the CNN-TDNNf kaldi model, we reached the performance of as low as 17.9 WER on public ATC datasets which is 6.6/7.6 supervised CNN-TDNNf model.

READ FULL TEXT

page 5

page 8

page 11

page 12

page 16

page 17

page 20

page 21

research
06/18/2020

Automatic Speech Recognition Benchmark for Air-Traffic Communications

Advances in Automatic Speech Recognition (ASR) over the last decade open...
research
12/14/2022

Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator

This paper describes a simple yet efficient repetition-based modular sys...
research
04/16/2023

A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers

In this paper we propose a novel virtual simulation-pilot engine for spe...
research
03/06/2022

Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset

Automatic speech recognition (ASR) on low resource languages improves t...
research
02/08/2022

A two-step approach to leverage contextual data: speech recognition in air-traffic communications

Automatic Speech Recognition (ASR), as the assistance of speech communic...
research
10/30/2018

The Airbus Air Traffic Control speech recognition 2018 challenge: towards ATC automatic transcription and call sign detection

In this paper, we describe the outcomes of the challenge organized and r...

Please sign up or login with your details

Forgot password? Click here to reset