Analysis of Large Heterogeneous Repairable System Reliability Data with Static System Attributes and Dynamic Sensor Measurement in Big Data Environment

04/01/2019
by   Xiao Liu, et al.
0

In Big Data environment, one pressing challenge facing engineers is to perform reliability analysis for a large fleet of heterogeneous repairable systems with covariates. In addition to static covariates, which include time-invariant system attributes such as nominal operating conditions, geo-locations, etc., the recent advances of sensing technologies have also made it possible to obtain dynamic sensor measurement of system operating and environmental conditions. As a common practice in the Big Data environment, the massive reliability data are typically stored in some distributed storage systems. Leveraging the power of modern statistical learning, this paper investigates a statistical approach which integrates the Random Forests algorithm and the classical data analysis methodologies for repairable system reliability, such as the nonparametric estimator for the Mean Cumulative Function and the parametric models based on the Nonhomogeneous Poisson Process. We show that the proposed approach effectively addresses some common challenges arising from practice, including system heterogeneity, covariate selection, model specification and data locality due to the distributed data storage. The large sample properties as well as the uniform consistency of the proposed estimator is established. Two numerical examples and a case study are presented to illustrate the application of the proposed approach. The strengths of the proposed approach are demonstrated by comparison studies.

READ FULL TEXT
research
03/16/2018

Big Data and Reliability Applications: The Complexity Dimension

Big data features not only large volumes of data but also data with comp...
research
04/25/2019

Storage Solutions for Big Data Systems: A Qualitative Study and Comparison

Big data systems development is full of challenges in view of the variet...
research
08/26/2019

Statistical Analysis of Modern Reliability Data

Traditional reliability analysis has been using time to event data, degr...
research
02/05/2020

Quality Assurance Technologies of Big Data Applications: A Systematic Literature Review

Big data applications are currently used in many application domains, ra...
research
11/26/2015

Random Forests for Big Data

Big Data is one of the major challenges of statistical science and has n...
research
11/01/2019

Bivariate, Cluster and Suitability Analysis of NoSQL Solutions for Different Application Areas

Big data systems development is full of challenges in view of the variet...
research
12/23/2022

Balanced Subsampling for Big Data with Categorical Covariates

The use and analysis of massive data are challenging due to the high sto...

Please sign up or login with your details

Forgot password? Click here to reset