Using Deep Neural Networks to Automate Large Scale Statistical Analysis for Big Data Applications

08/09/2017
by   Rongrong Zhang, et al.
0

Statistical analysis (SA) is a complex process to deduce population properties from analysis of data. It usually takes a well-trained analyst to successfully perform SA, and it becomes extremely challenging to apply SA to big data applications. We propose to use deep neural networks to automate the SA process. In particular, we propose to construct convolutional neural networks (CNNs) to perform automatic model selection and parameter estimation, two most important SA tasks. We refer to the resulting CNNs as the neural model selector and the neural model estimator, respectively, which can be properly trained using labeled data systematically generated from candidate models. Simulation study shows that both the selector and estimator demonstrate excellent performances. The idea and proposed framework can be further extended to automate the entire SA process and have the potential to revolutionize how SA is performed in big data analytics.

READ FULL TEXT
research
03/29/2018

Statistical Validity and Consistency of Big Data Analytics: A General Framework

Informatics and technological advancements have triggered generation of ...
research
06/28/2023

Integrating Big Data and Survey Data for Efficient Estimation of the Median

An ever-increasing deluge of big data is becoming available to national ...
research
07/22/2020

Big Issues for Big Data: challenges for critical spatial data analytics

In this paper we consider some of the issues of working with big data an...
research
07/29/2022

Big Data and Analytics Implementation in Tertiary Institutions to Predict Students Performance in Nigeria

The term Big Data has been coined to refer to the gargantuan bulk of dat...
research
01/13/2020

Towards Automated Swimming Analytics Using Deep Neural Networks

Methods for creating a system to automate the collection of swimming ana...
research
07/21/2023

Transferability of Convolutional Neural Networks in Stationary Learning Tasks

Recent advances in hardware and big data acquisition have accelerated th...
research
09/14/2020

A Hybrid Framework for Topology Identification of Distribution Grid with Renewables Integration

Topology identification (TI) is a key task for state estimation (SE) in ...

Please sign up or login with your details

Forgot password? Click here to reset