Value Iteration Networks with Gated Summarization Module

05/11/2023
by   Jinyu Cai, et al.
0

In this paper, we address the challenges faced by Value Iteration Networks (VIN) in handling larger input maps and mitigating the impact of accumulated errors caused by increased iterations. We propose a novel approach, Value Iteration Networks with Gated Summarization Module (GS-VIN), which incorporates two main improvements: (1) employing an Adaptive Iteration Strategy in the Value Iteration module to reduce the number of iterations, and (2) introducing a Gated Summarization module to summarize the iterative process. The adaptive iteration strategy uses larger convolution kernels with fewer iteration times, reducing network depth and increasing training stability while maintaining the accuracy of the planning process. The gated summarization module enables the network to emphasize the entire planning process, rather than solely relying on the final global planning outcome, by temporally and spatially resampling the entire planning process within the VI module. We conduct experiments on 2D grid world path-finding problems and the Atari Mr. Pac-man environment, demonstrating that GS-VIN outperforms the baseline in terms of single-step accuracy, planning success rate, and overall performance across different map sizes. Additionally, we provide an analysis of the relationship between input size, kernel size, and the number of iterations in VI-based models, which is applicable to a majority of VI-based models and offers valuable insights for researchers and industrial deployment.

READ FULL TEXT

page 1

page 3

page 6

page 7

page 9

page 11

research
06/08/2017

Generalized Value Iteration Networks: Life Beyond Lattices

In this paper, we introduce a generalized value iteration network (GVIN)...
research
04/29/2021

Capability Iteration Network for Robot Path Planning

Path planning is an important topic in robotics. Recently, value iterati...
research
06/17/2018

Gated Path Planning Networks

Value Iteration Networks (VINs) are effective differentiable path planni...
research
05/28/2018

Value Propagation Networks

We present Value Propagation (VProp), a parameter-efficient differentiab...
research
01/28/2022

Planning and Learning with Adaptive Lookahead

The classical Policy Iteration (PI) algorithm alternates between greedy ...
research
05/27/2019

Value Iteration Networks on Multiple Levels of Abstraction

Learning-based methods are promising to plan robot motion without perfor...
research
07/20/2021

Into Summarization Techniques for IoT Data Discovery Routing

In this paper, we consider the IoT data discovery problem in very large ...

Please sign up or login with your details

Forgot password? Click here to reset