Evolutionary Biclustering of Clickstream Data

06/12/2011
by   R. Rathipriya, et al.
0

Biclustering is a two way clustering approach involving simultaneous clustering along two dimensions of the data matrix. Finding biclusters of web objects (i.e. web users and web pages) is an emerging topic in the context of web usage mining. It overcomes the problem associated with traditional clustering methods by allowing automatic discovery of browsing pattern based on a subset of attributes. A coherent bicluster of clickstream data is a local browsing pattern such that users in bicluster exhibit correlated browsing pattern through a subset of pages of a web site. This paper proposed a new application of biclustering to web data using a combination of heuristics and meta-heuristics such as K-means, Greedy Search Procedure and Genetic Algorithms to identify the coherent browsing pattern. Experiment is conducted on the benchmark clickstream msnbc dataset from UCI repository. Results demonstrate the efficiency and beneficial outcome of the proposed method by correlating the users and pages of a web site in high degree.This approach shows excellent performance at finding high degree of overlapped coherent biclusters from web data.

READ FULL TEXT
research
03/25/2011

User Modeling Combining Access Logs, Page Content and Semantics

The paper proposes an approach to modeling users of large Web sites base...
research
03/30/2021

Local and Global Topics in Text Modeling of Web Pages Nested in Web Sites

Topic models are popular models for analyzing a collection of text docum...
research
04/27/2018

Modified Apriori Graph Algorithm for Frequent Pattern Mining

Web Usage Mining is an application of Data Mining Techniques to discover...
research
05/08/2014

Integrating Vague Association Mining with Markov Model

The increasing demand of world wide web raises the need of predicting th...
research
04/26/2021

Boolean Reasoning-Based Biclustering for Shifting Pattern Extraction

Biclustering is a powerful approach to search for patterns in data, as i...
research
04/27/2018

Extracting Parallel Paragraphs from Common Crawl

Most of the current methods for mining parallel texts from the web assum...
research
09/06/2011

An Efficient Preprocessing Methodology for Discovering Patterns and Clustering of Web Users using a Dynamic ART1 Neural Network

In this paper, a complete preprocessing methodology for discovering patt...

Please sign up or login with your details

Forgot password? Click here to reset