SAIC: Identifying Configuration Files for System Configuration Management

11/06/2017
by   Zhen Huang, et al.
0

Systems can become misconfigured for a variety of reasons such as operator errors or buggy patches. When a misconfiguration is discovered, usually the first order of business is to restore availability, often by undoing the misconfiguration. To simplify this task, we propose the Statistical Analysis for Identifying Configuration Files (SAIC), which analyzes how the contents of a file changes over time to automatically determine which files contain configuration state. In this way, SAIC reduces the number of files a user must manually examine during recovery and allows versioning file systems to make more efficient use of their versioning storage. The two key insights that enable SAIC to identify configuration files are that configuration state must persist across executions of an application and that configuration state changes at a slower rate than other types of application state. SAIC applies these insights through a set of filters, which eliminate non-persistent files from consideration, and a novel similarity metric, which measures how similar a file's versions are to each other. Together, these two mechanisms enable SAIC to identify all 72 configuration files out of 2363 versioned files from 6 common applications in two user traces, while mistaking only 33 non-configuration files as configuration files, which allows a versioning file system to eliminate roughly 66 non-configuration file versions from its logs, thus reducing the number of file versions that a user must try to recover from a misconfiguration.

READ FULL TEXT
research
03/17/2020

An Exploratory Study of Bot Commits

Background: Bots help automate many of the tasks performed by software d...
research
04/19/2021

Inferring Drop-in Binary Parsers from Program Executions

We present BIEBER (Byte-IdEntical Binary parsER), the first system to mo...
research
05/20/2020

Information Freshness in Cache Updating Systems with Limited Cache Storage Capacity

We consider a cache updating system with a source, a cache with limited ...
research
11/02/2017

Ocasta: Clustering Configuration Settings For Error Recovery

Effective machine-aided diagnosis and repair of configuration errors con...
research
05/25/2016

As-exact-as-possible repair of unprintable STL files

The class of models that can be represented by STL files is larger than ...
research
05/18/2018

Extending Dynamic Bayesian Networks for Anomaly Detection in Complex Logs

Checking various log files from different processes can be a tedious tas...
research
04/23/2018

Forensic Analysis of the exFAT artefacts

Although keeping some basic concepts inherited from FAT32, the exFAT fil...

Please sign up or login with your details

Forgot password? Click here to reset