Content Masked Loss: Human-Like Brush Stroke Planning in a Reinforcement Learning Painting Agent

12/18/2020
by   Peter Schaldenbrand, et al.
8

The objective of most Reinforcement Learning painting agents is to minimize the loss between a target image and the paint canvas. Human painter artistry emphasizes important features of the target image rather than simply reproducing it (DiPaola 2007). Using adversarial or L2 losses in the RL painting models, although its final output is generally a work of finesse, produces a stroke sequence that is vastly different from that which a human would produce since the model does not have knowledge about the abstract features in the target image. In order to increase the human-like planning of the model without the use of expensive human data, we introduce a new loss function for use with the model's reward function: Content Masked Loss. In the context of robot painting, Content Masked Loss employs an object detection model to extract features which are used to assign higher weight to regions of the canvas that a human would find important for recognizing content. The results, based on 332 human evaluators, show that the digital paintings produced by our Content Masked model show detectable subject matter earlier in the stroke sequence than existing methods without compromising on the quality of the final painting.

READ FULL TEXT

page 2

page 4

page 5

page 6

page 7

research
06/27/2019

Demonstration-Guided Deep Reinforcement Learning of Control Policies for Dexterous Human-Robot Interaction

In this paper, we propose a method for training control policies for hum...
research
12/15/2017

Impossibility of deducing preferences and rationality from human policy

Inverse reinforcement learning (IRL) attempts to infer human rewards or ...
research
05/15/2017

Repeated Inverse Reinforcement Learning

We introduce a novel repeated Inverse Reinforcement Learning problem: th...
research
09/23/2020

What is the Reward for Handwriting? – Handwriting Generation by Imitation Learning

Analyzing the handwriting generation process is an important issue and h...
research
12/19/2022

Inverse Reinforcement Learning for Text Summarization

Current state-of-the-art summarization models are trained with either ma...
research
11/27/2018

Quality-Aware Multimodal Saliency Detection via Deep Reinforcement Learning

Incorporating various modes of information into the machine learning pro...
research
07/08/2023

Improving Prototypical Part Networks with Reward Reweighing, Reselection, and Retraining

In recent years, work has gone into developing deep interpretable method...

Please sign up or login with your details

Forgot password? Click here to reset