Task-Relevant Object Discovery and Categorization for Playing First-person Shooter Games

06/17/2018
by   Junchi Liang, et al.
0

We consider the problem of learning to play first-person shooter (FPS) video games using raw screen images as observations and keyboard inputs as actions. The high-dimensionality of the observations in this type of applications leads to prohibitive needs of training data for model-free methods, such as the deep Q-network (DQN), and its recurrent variant DRQN. Thus, recent works focused on learning low-dimensional representations that may reduce the need for data. This paper presents a new and efficient method for learning such representations. Salient segments of consecutive frames are detected from their optical flow, and clustered based on their feature descriptors. The clusters typically correspond to different discovered categories of objects. Segments detected in new frames are then classified based on their nearest clusters. Because only a few categories are relevant to a given task, the importance of a category is defined as the correlation between its occurrence and the agent's performance. The result is encoded as a vector indicating objects that are in the frame and their locations, and used as a side input to DRQN. Experiments on the game Doom provide a good evidence for the benefit of this approach.

READ FULL TEXT

page 1

page 4

page 7

research
06/22/2022

Optical Flow Regularization of Implicit Neural Representations for Video Frame Interpolation

Recent works have shown the ability of Implicit Neural Representations (...
research
04/22/2022

Deep Reinforcement Learning Using a Low-Dimensional Observation Filter for Visual Complex Video Game Playing

Deep Reinforcement Learning (DRL) has produced great achievements since ...
research
11/10/2020

A Self-supervised Learning System for Object Detection in Videos Using Random Walks on Graphs

This paper presents a new self-supervised system for learning to detect ...
research
05/16/2018

Fast Retinomorphic Event Stream for Video Recognition and ReinforcementLearning

Good temporal representations are crucial for video understanding, and t...
research
12/17/2016

Learning to predict where to look in interactive environments using deep recurrent q-learning

Bottom-Up (BU) saliency models do not perform well in complex interactiv...
research
07/27/2023

Using Gameplay Videos for Detecting Issues in Video Games

Context. The game industry is increasingly growing in recent years. Ever...

Please sign up or login with your details

Forgot password? Click here to reset