Canonical Trends: Detecting Trend Setters in Web Data

06/27/2012
by   Felix Biessmann, et al.
0

Much information available on the web is copied, reused or rephrased. The phenomenon that multiple web sources pick up certain information is often called trend. A central problem in the context of web data mining is to detect those web sources that are first to publish information which will give rise to a trend. We present a simple and efficient method for finding trends dominating a pool of web sources and identifying those web sources that publish the information relevant to a trend before others. We validate our approach on real data collected from influential technology news feeds.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/11/2011

"Improved FCM algorithm for Clustering on Web Usage Mining"

In this paper we present clustering method is very sensitive to the init...
research
06/20/2011

Intelligent Self-Repairable Web Wrappers

The amount of information available on the Web grows at an incredible hi...
research
10/22/2020

What is Web Scraping: Introduction, Applications and Best Practices

Web scraping typically extracts large amounts of #data from #websites fo...
research
06/24/2011

Wrapper Maintenance: A Machine Learning Approach

The proliferation of online information sources has led to an increased ...
research
10/22/2020

Transform Data Complexity into Profitability through Data Mining Services

Data Mining experts are able to efficiently search and extract data from...
research
05/01/2022

Conventions and Mutual Expectations – understanding sources for web genres

Genres can be understood in many different ways. They are often perceive...
research
07/23/2019

Uncertainty in the MAN Data Calibration & Trend Estimates

We investigate trend identification in the LML and MAN atmospheric ammon...

Please sign up or login with your details

Forgot password? Click here to reset