Web Page Categorization Using Artificial Neural Networks

09/25/2010
by   S. M. Kamruzzaman, et al.
0

Web page categorization is one of the challenging tasks in the world of ever increasing web technologies. There are many ways of categorization of web pages based on different approach and features. This paper proposes a new dimension in the way of categorization of web pages using artificial neural network (ANN) through extracting the features automatically. Here eight major categories of web pages have been selected for categorization; these are business & economy, education, government, entertainment, sports, news & media, job search, and science. The whole process of the proposed system is done in three successive stages. In the first stage, the features are automatically extracted through analyzing the source of the web pages. The second stage includes fixing the input values of the neural network; all the values remain between 0 and 1. The variations in those values affect the output. Finally the third stage determines the class of a certain web page out of eight predefined classes. This stage is done using back propagation algorithm of artificial neural network. The proposed concept will facilitate web mining, retrievals of information from the web and also the search engines.

READ FULL TEXT
research
12/08/2017

Difficulties of Timestamping Archived Web Pages

We show that state-of-the-art services for creating trusted timestamps i...
research
05/15/2021

A Large Visual, Qualitative and Quantitative Dataset of Web Pages

The World Wide Web is not only one of the most important platforms of co...
research
03/29/2020

Clickbait Detection using Multiple Categorization Techniques

Clickbaits are online articles with deliberately designed misleading tit...
research
10/11/2017

Explaining Trained Neural Networks with Semantic Web Technologies: First Steps

The ever increasing prevalence of publicly available structured data on ...
research
05/30/2020

Web page classification with Google Image Search results

In this paper, we introduce a novel method that combines multiple neural...
research
03/07/2011

Design of Automatically Adaptable Web Wrappers

Nowadays, the huge amount of information distributed through the Web mot...
research
12/03/2021

User-click Modelling for Predicting Purchase Intent

This thesis contributes a structured inquiry into the open actuarial mat...

Please sign up or login with your details

Forgot password? Click here to reset