Greening Big Data Networks: The Impact of Veracity

by   Ali M. Al-Salim, et al.

The continuous increase in big data applications, in number and types, creates new challenges that should be tackled by the green ICT community. Big data is mainly characterized by 4 Vs volume, variety, velocity, and veracity. Each V poses a number of challenges that have implications on the energy efficiency of the underlying networks carrying the big data. Addressing the veracity of the data is a more serious challenge to data scientists, since they need to distinguish between the meaningful data and the dirty data. In this article, we investigate the impact of big data veracity on greening IP by developing a Mixed Integer Linear Programming, MILP, model that encapsulates the distinctive features of veracity. In our analyses, the big data network was greened by cleansing the raw big data before processing and then progressively processing the cleansed big data at strategic locations, dubbed processing nodes, PNs. The PNs are built into the network along the path from the sources to the centralized datacenters. At each PN, the cleansed data was processed and smaller volume of useful information was extracted progressively, thereby, reducing the network power consumption. Furthermore, a backup for the cleansed data was stored in an optimally selected Backup Node, BN. We evaluated the network power saving that can be achieved by a green big data network compared to the classical non-progressive approach. We obtained up to 52 percent network power savings, on average, in the green big data approach compared to the classical approach.



There are no comments yet.


page 1

page 2

page 4

page 5

page 11

page 12


Standards for Energy Efficient Virtualization, Content Distribution and Big Data in Beyond 5G Networks

Power consumption in communication networks and the supporting computing...

DV-DVFS: Merging Data Variety and DVFS Technique to Manage the Energy Consumption of Big Data Processing

Data variety is one of the most important features of Big Data. Data var...

Big Data Refinement

"Big data" has become a major area of research and associated funding, a...

A Survey of Big Data Machine Learning Applications Optimization in Cloud Data Centers and Networks

This survey article reviews the challenges associated with deploying and...

Big Data Challenges in Genome Informatics

In recent years, we have witnessed a dramatic data explosion in genomics...

Node Centrality Metrics for Hotspots Analysis in Telecom Big Data

In this work, we are interested in the applications of big data in the t...

On the Scalability of Big Data Cyber Security Analytics Systems

Big Data Cyber Security Analytics (BDCA) systems use big data technologi...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.