Comparative analysis of various web crawler algorithms

06/21/2023
by   Nithin T K, et al.
0

This presentation focuses on the importance of web crawling and page ranking algorithms in dealing with the massive amount of data present on the World Wide Web. As the web continues to grow exponentially, efficient search and retrieval methods become crucial. Web crawling is a process that converts unstructured data into structured data, enabling effective information retrieval. Additionally, page ranking algorithms play a significant role in assessing the quality and popularity of web pages. The presentation explores the background of these algorithms and evaluates five different crawling algorithms: Shark Search, Priority-Based Queue, Naive Bayes, Breadth-First, and Depth-First. The goal is to identify the most effective algorithm for crawling web pages. By understanding these algorithms, we can enhance our ability to navigate the web and extract valuable information efficiently.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/03/2015

Intelligent Search Optimization using Artificial Fuzzy Logics

Information on the web is prodigious; searching relevant information is ...
research
10/18/2022

CPS-MEBR: Click Feedback-Aware Web Page Summarization for Multi-Embedding-Based Retrieval

Embedding-based retrieval (EBR) is a technique to use embeddings to repr...
research
08/01/2019

A Hessenberg-type Algorithm for Computing PageRank Problems

PageRank is a greatly essential ranking algorithm in web information ret...
research
08/10/2012

Analysis of Statistical Hypothesis based Learning Mechanism for Faster Crawling

The growth of world-wide-web (WWW) spreads its wings from an intangible ...
research
05/30/2018

DATA:SEARCH'18 – Searching Data on the Web

This half day workshop explores challenges in data search, with a partic...
research
09/07/2017

Advanced Page Rank Algorithm with Semantics, In Links, Out Links and Google Analytics

In this paper we have modified the existing page ranking mechanism as an...
research
11/05/2020

Infer XPath

We propose reformulation of discovery of data structure within a web pag...

Please sign up or login with your details

Forgot password? Click here to reset