Greening Big Data Networks: The Impact of Veracity

12/26/2018
by   Ali M. Al-Salim, et al.
0

The continuous increase in big data applications, in number and types, creates new challenges that should be tackled by the green ICT community. Big data is mainly characterized by 4 Vs volume, variety, velocity, and veracity. Each V poses a number of challenges that have implications on the energy efficiency of the underlying networks carrying the big data. Addressing the veracity of the data is a more serious challenge to data scientists, since they need to distinguish between the meaningful data and the dirty data. In this article, we investigate the impact of big data veracity on greening IP by developing a Mixed Integer Linear Programming, MILP, model that encapsulates the distinctive features of veracity. In our analyses, the big data network was greened by cleansing the raw big data before processing and then progressively processing the cleansed big data at strategic locations, dubbed processing nodes, PNs. The PNs are built into the network along the path from the sources to the centralized datacenters. At each PN, the cleansed data was processed and smaller volume of useful information was extracted progressively, thereby, reducing the network power consumption. Furthermore, a backup for the cleansed data was stored in an optimally selected Backup Node, BN. We evaluated the network power saving that can be achieved by a green big data network compared to the classical non-progressive approach. We obtained up to 52 percent network power savings, on average, in the green big data approach compared to the classical approach.

READ FULL TEXT

page 1

page 2

page 4

page 5

page 11

page 12

research
05/04/2020

Standards for Energy Efficient Virtualization, Content Distribution and Big Data in Beyond 5G Networks

Power consumption in communication networks and the supporting computing...
research
02/07/2021

DV-DVFS: Merging Data Variety and DVFS Technique to Manage the Energy Consumption of Big Data Processing

Data variety is one of the most important features of Big Data. Data var...
research
06/07/2016

Big Data Refinement

"Big data" has become a major area of research and associated funding, a...
research
10/01/2019

A Survey of Big Data Machine Learning Applications Optimization in Cloud Data Centers and Networks

This survey article reviews the challenges associated with deploying and...
research
03/13/2019

Node Centrality Metrics for Hotspots Analysis in Telecom Big Data

In this work, we are interested in the applications of big data in the t...
research
02/02/2019

Big Data and Geospatial Analysis

Perhaps one of the mostly hotly debated topics in recent years has been ...
research
03/20/2018

Big Data Challenges in Genome Informatics

In recent years, we have witnessed a dramatic data explosion in genomics...

Please sign up or login with your details

Forgot password? Click here to reset