Constrained Contextual Bandit Learning for Adaptive Radar Waveform Selection

03/09/2021
by   Charles E. Thornton, et al.
0

A sequential decision process in which an adaptive radar system repeatedly interacts with a finite-state target channel is studied. The radar is capable of passively sensing the spectrum at regular intervals, which provides side information for the waveform selection process. The radar transmitter uses the sequence of spectrum observations as well as feedback from a collocated receiver to select waveforms which accurately estimate target parameters. It is shown that the waveform selection problem can be effectively addressed using a linear contextual bandit formulation in a manner that is both computationally feasible and sample efficient. Stochastic and adversarial linear contextual bandit models are introduced, allowing the radar to achieve effective performance in broad classes of physical environments. Simulations in a radar-communication coexistence scenario, as well as in an adversarial radar-jammer scenario, demonstrate that the proposed formulation provides a substantial improvement in target detection performance when Thompson Sampling and EXP3 algorithms are used to drive the waveform selection process. Further, it is shown that the harmful impacts of pulse-agile behavior on coherently processed radar data can be mitigated by adopting a time-varying constraint on the radar's waveform catalog.

READ FULL TEXT
research
10/29/2020

Constrained Online Learning to Mitigate Distortion Effects in Pulse-Agile Cognitive Radar

Pulse-agile radar systems have demonstrated favorable performance in dyn...
research
08/02/2021

Waveform Selection for Radar Tracking in Target Channels With Memory via Universal Learning

In tracking radar, the sensing environment often varies significantly ov...
research
10/21/2021

Online Meta-Learning for Scene-Diverse Waveform-Agile Radar Target Tracking

A fundamental problem for waveform-agile radar systems is that the true ...
research
02/10/2022

Universal Learning Waveform Selection Strategies for Adaptive Target Tracking

Online selection of optimal waveforms for target tracking with active se...
research
01/30/2021

Multi-player Bandits for Distributed Cognitive Radar

With new applications for radar networks such as automotive control or i...
research
08/24/2020

Efficient Online Learning for Cognitive Radar-Cellular Coexistence via Contextual Thompson Sampling

This paper describes a sequential, or online, learning scheme for adapti...
research
07/07/2022

Online Bayesian Meta-Learning for Cognitive Tracking Radar

A key component of cognitive radar is the ability to generalize, or achi...

Please sign up or login with your details

Forgot password? Click here to reset