Lifelong Learning for Minimizing Age of Information in Internet of Things Networks

by   Zhenzhen Gong, et al.

In this paper, a lifelong learning problem is studied for an Internet of Things (IoT) system. In the considered model, each IoT device aims to balance its information freshness and energy consumption tradeoff by controlling its computational resource allocation at each time slot under dynamic environments. An unmanned aerial vehicle (UAV) is deployed as a flying base station so as to enable the IoT devices to adapt to novel environments. To this end, a new lifelong reinforcement learning algorithm, used by the UAV, is proposed in order to adapt the operation of the devices at each visit by the UAV. By using the experience from previously visited devices and environments, the UAV can help devices adapt faster to future states of their environment. To do so, a knowledge base shared by all devices is maintained at the UAV. Simulation results show that the proposed algorithm can converge 25% to 50% faster than a policy gradient baseline algorithm that optimizes each device's decision making problem in isolation.



page 1

page 2

page 3

page 4

page 5

page 6


Deep Reinforcement Learning for Delay-Oriented IoT Task Scheduling in Space-Air-Ground Integrated Network

In this paper, we investigate a computing task scheduling problem in spa...

Reinforcement Learning for Minimizing Age of Information in Real-time Internet of Things Systems with Realistic Physical Dynamics

In this paper, the problem of minimizing the weighted sum of age of info...

RIS-assisted UAV Communications for IoT with Wireless Power Transfer Using Deep Reinforcement Learning

Many of the devices used in Internet-of-Things (IoT) applications are en...

AoI-minimizing Scheduling in UAV-relayed IoT Networks

Due to flexibility, autonomy and low operational cost, unmanned aerial v...

Meta-Reinforcement Learning for Trajectory Design in Wireless UAV Networks

In this paper, the design of an optimal trajectory for an energy-constra...

Multi-Agent Deep Stochastic Policy Gradient for Event Based Dynamic Spectrum Access

We consider the dynamic spectrum access (DSA) problem where K Internet o...

Joint Cluster Head Selection and Trajectory Planning in UAV-Aided IoT Networks by Reinforcement Learning with Sequential Model

Employing unmanned aerial vehicles (UAVs) has attracted growing interest...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.