Parsisanj: a semi-automatic component-based approach towards search engine evaluation

09/25/2020
by   Amin Heydari Alashti, et al.
0

Accessing to required data on the internet is wide via search engines in the last two decades owing to the huge amount of available data and the high rate of new data is generating daily. Accordingly, search engines are encouraged to make the most valuable existing data on the web searchable. Knowing how to handle a large amount of data in each step of a search engines' procedure from crawling to indexing and ranking is just one of the challenges that a professional search engine should solve. Moreover, it should also have the best practices in handling users' traffics, state-of-the-art natural language processing tools, and should also address many other challenges on the edge of science and technology. As a result, evaluating these systems is too challenging due to the level of internal complexity they have, and is crucial for finding the improvement path of the existing system. Therefore, an evaluation procedure is a normal subsystem of a search engine that has the role of building its roadmap. Recently, several countries have developed national search engine programs to build an infrastructure to provide special services based on their needs on the available data of their language on the web. This research is conducted accordingly to enlighten the advancement path of two Iranian national search engines: Yooz and Parsijoo in comparison with two international ones, Google and Bing. Unlike related work, it is a semi-automatic method to evaluate the search engines at the first pace. Eventually, we obtained some interesting results which based on them the component-based improvement roadmap of national search engines could be illustrated concretely.

READ FULL TEXT

page 12

page 14

page 16

research
02/04/2011

Intelligent Semantic Web Search Engines: A Brief Survey

The World Wide Web (WWW) allows the people to share the information (dat...
research
03/09/2019

The Web is missing an essential part of infrastructure: an Open Web Index

A proposal for building an index of the Web that separates the infrastru...
research
05/27/2021

A functionality taxonomy for document search engines

In this paper a functionality taxonomy for document search engines is pr...
research
09/08/2022

Data Management Challenges for Internet-scale 3D Search Engines

This paper describes some of the major challenges encountered by Physna ...
research
10/26/2022

NeuralSearchX: Serving a Multi-billion-parameter Reranker for Multilingual Metasearch at a Low Cost

The widespread availability of search API's (both free and commercial) b...
research
08/15/2023

Delphic Costs and Benefits in Web Search: A utilitarian and historical analysis

We present a new framework to conceptualize and operationalize the total...
research
07/26/2018

Judging the Judges: Evaluating the Performance of International Gymnastics Judges

Judging a gymnastics routine is a noisy process, and the performance of ...

Please sign up or login with your details

Forgot password? Click here to reset