Automatic Detection of Trends in Dynamical Text: An Evolutionary Approach

by   Lourdes Araujo, et al.

This paper presents an evolutionary algorithm for modeling the arrival dates of document streams, which is any time-stamped collection of documents, such as newscasts, e-mails, IRC conversations, scientific journals archives and weblog postings. This algorithm assigns frequencies (number of document arrivals per time unit) to time intervals so that it produces an optimal fit to the data. The optimization is a trade off between accurately fitting the data and avoiding too many frequency changes; this way the analysis is able to find fits which ignore the noise. Classical dynamic programming algorithms are limited by memory and efficiency requirements, which can be a problem when dealing with long streams. This suggests to explore alternative search methods which allow for some degree of uncertainty to achieve tractability. Experiments have shown that the designed evolutionary algorithm is able to reach the same solution quality as those classical dynamic programming algorithms in a shorter time. We have also explored different probabilistic models to optimize the fitting of the date streams, and applied these algorithms to infer whether a new arrival increases or decreases interest in the topic the document stream is about.


page 1

page 2

page 3

page 4


Evolutionary Algorithms and Dynamic Programming

Recently, it has been proven that evolutionary algorithms produce good r...

Fitting a Multi-modal Density by Dynamic Programming

We consider the problem of fitting a probability density function when i...

Contextualization for the Organization of Text Documents Streams

There has been a significant effort by the research community to address...

Evolutionary Computation plus Dynamic Programming for the Bi-Objective Travelling Thief Problem

This research proposes a novel indicator-based hybrid evolutionary appro...

A multilinear HJB-POD method for the optimal control of PDEs

Optimal control problems driven by evolutionary partial differential equ...

Dynamic Programming Approach to Template-based OCR

In this paper we propose a dynamic programming solution to the template-...

Please sign up or login with your details

Forgot password? Click here to reset