Towards an Open Science Platform for the Evaluation of Data Fusion

02/12/2018
by   Weinan Huang, et al.
0

Combining the results of different search engines in order to improve upon their performance has been the subject of many research papers. This has become known as the "Data Fusion" task, and has great promise in dealing with the vast quantity of unstructured textual data that is a feature of many Big Data scenarios. However, no universally-accepted evaluation methodology has emerged in the community. This makes it difficult to make meaningful comparisons between the various proposed techniques from reading the literature alone. Variations in the datasets, metrics, and baseline results have all contributed to this difficulty. This paper argues that a more unified approach is required, and that a centralised software platform should be developed to aid researchers in making comparisons between their algorithms and others. The desirable qualities of such a system have been identified and proposed, and an early prototype has been developed. Re-implementing algorithms published by other researchers is a great burden on those proposing new techniques. The prototype system has the potential to greatly reduce this burden and thus encourage more comparable results being generated and published more easily.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/09/2022

A novel evaluation methodology for supervised Feature Ranking algorithms

Both in the domains of Feature Selection and Interpretable AI, there exi...
research
07/09/2020

Enhancing spatial and textual analysis with EUPEG: an extensible and unified platform for evaluating geoparsers

A rich amount of geographic information exists in unstructured texts, su...
research
10/24/2017

Implementing Recommendation Algorithms in a Large-Scale Biomedical Science Knowledge Base

The number of biomedical research articles published has doubled in the ...
research
02/04/2021

FuzzSplore: Visualizing Feedback-Driven Fuzzing Techniques

Fuzz Testing techniques are the state of the art in software testing for...
research
04/16/2018

M-PACT: Michigan Platform for Activity Classification in Tensorflow

Action classification is a widely known and popular task that offers an ...
research
03/28/2022

A systematic review and meta-analysis of Digital Elevation Model (DEM) fusion: pre-processing, methods and applications

The remote sensing community has identified data fusion as one of the ke...
research
02/06/2013

A Target Classification Decision Aid

A submarine's sonar team is responsible for detecting, localising and cl...

Please sign up or login with your details

Forgot password? Click here to reset