State of the Art on the Quality of Big Data: A Systematic Literature Review and Classification Framework

by   Mostafa Mirzaie, et al.

One of the most significant problems of Big Data is to extract knowledge through the huge amount of data. The usefulness of the extracted information depends strongly on data quality. In addition to the importance, data quality has recently been taken into consideration by the big data community and there is not any comprehensive review conducted in this area. Therefore, the purpose of this study is to review and present the state of the art on the quality of big data research through a hierarchical framework. The dimensions of the proposed framework cover various aspects in the quality assessment of Big Data including 1) the processing types of big data, i.e. stream, batch, and hybrid, 2) the main task, and 3) the method used to conduct the task. We compare and critically review all of the studies reported during the last ten years through our proposed framework to identify which of the available data quality assessment methods have been successfully adopted by the big data community. Finally, we provide a critical discussion on the limitations of existing methods and offer suggestions on potential valuable research directions that can be taken in future research in this domain.


page 23

page 25


Contextualization of Big Data Quality: A framework for comparison

With the advent of big data applications and the increasing amount of da...

Big Data Quality: A systematic literature review and future research directions

One of the challenges manifested after global growth of social networks ...

Quality Assurance Technologies of Big Data Applications: A Systematic Literature Review

Big data applications are currently used in many application domains, ra...

Architectural Tactics for Big Data Cybersecurity Analytic Systems: A Review

Context: Big Data Cybersecurity Analytics is aimed at protecting network...

A Survey of Community Search Over Big Graphs

With the rapid development of information technologies, various big grap...

Big Data Privacy Context: Literature Effects On Secure Informational Assets

This article's objective is the identification of research opportunities...

Social Credibility Incorporating Semantic Analysis and Machine Learning: A Survey of the State-of-the-Art and Future Research Directions

The wealth of Social Big Data (SBD) represents a unique opportunity for ...

Please sign up or login with your details

Forgot password? Click here to reset