Deep Reinforcement Learning with a Combinatorial Action Space for Predicting Popular Reddit Threads

06/12/2016
by   Ji He, et al.
0

We introduce an online popularity prediction and tracking task as a benchmark task for reinforcement learning with a combinatorial, natural language action space. A specified number of discussion threads predicted to be popular are recommended, chosen from a fixed window of recent comments to track. Novel deep reinforcement learning architectures are studied for effective modeling of the value function associated with actions comprised of interdependent sub-actions. The proposed model, which represents dependence between sub-actions through a bi-directional LSTM, gives the best performance across different experimental configurations and domains, and it also generalizes well with varying numbers of recommendation requests.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/20/2017

Reinforcement Learning with External Knowledge and Two-Stage Q-functions for Predicting Popular Reddit Threads

This paper addresses the problem of predicting popularity of comments in...
research
11/14/2015

Deep Reinforcement Learning with a Natural Language Action Space

This paper introduces a novel architecture for reinforcement learning wi...
research
10/22/2020

Reinforcement Learning with Combinatorial Actions: An Application to Vehicle Routing

Value-function-based methods have long played an important role in reinf...
research
10/29/2021

Brick-by-Brick: Combinatorial Construction with Deep Reinforcement Learning

Discovering a solution in a combinatorial space is prevalent in many rea...
research
02/04/2019

The Natural Language of Actions

We introduce Act2Vec, a general framework for learning context-based act...
research
03/07/2018

Extracting Action Sequences from Texts Based on Deep Reinforcement Learning

Extracting action sequences from texts in natural language is challengin...
research
11/11/2020

Adaptive Neural Architectures for Recommender Systems

Deep learning has proved an effective means to capture the non-linear as...

Please sign up or login with your details

Forgot password? Click here to reset