Reinforcement Learning with Policy Mixture Model for Temporal Point Processes Clustering

05/29/2019
by   Weichang Wu, et al.
4

Temporal point process is an expressive tool for modeling event sequences over time. In this paper, we take a reinforcement learning view whereby the observed sequences are assumed to be generated from a mixture of latent policies. The purpose is to cluster the sequences with different temporal patterns into the underlying policies while learning each of the policy model. The flexibility of our model lies in: i) all the components are networks including the policy network for modeling the intensity function of temporal point process; ii) to handle varying-length event sequences, we resort to inverse reinforcement learning by decomposing the observed sequence into states (RNN hidden embedding of history) and actions (time interval to next event) in order to learn the reward function, thus achieving better performance or increasing efficiency compared to existing methods using rewards over the entire sequence such as log-likelihood or Wasserstein distance. We adopt an expectation-maximization framework with the E-step estimating the cluster labels for each sequence, and the M-step aiming to learn the respective policy. Extensive experiments show the efficacy of our method against state-of-the-arts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/12/2018

Learning Temporal Point Processes via Reinforcement Learning

Social goods, such as healthcare, smart city, and information networks, ...
research
05/23/2021

THP: Topological Hawkes Processes for Learning Granger Causality on Event Sequences

Learning Granger causality among event types on multi-type event sequenc...
research
05/24/2017

Modeling The Intensity Function Of Point Process Via Recurrent Neural Networks

Event sequence, asynchronously generated with random timestamp, is ubiqu...
research
08/11/2023

Reinforcement Logic Rule Learning for Temporal Point Processes

We propose a framework that can incrementally expand the explanatory tem...
research
01/31/2017

A Dirichlet Mixture Model of Hawkes Processes for Event Sequence Clustering

We propose an effective method to solve the event sequence clustering pr...
research
09/04/2019

Meta Learning with Relational Information for Short Sequences

This paper proposes a new meta-learning method -- named HARMLESS (HAwkes...
research
10/28/2019

Learning Latent Process from High-Dimensional Event Sequences via Efficient Sampling

We target modeling latent dynamics in high-dimension marked event sequen...

Please sign up or login with your details

Forgot password? Click here to reset