Feature Subset Selection for Software Cost Modelling and Estimation

10/03/2012
by   Efi Papatheocharous, et al.
0

Feature selection has been recently used in the area of software engineering for improving the accuracy and robustness of software cost models. The idea behind selecting the most informative subset of features from a pool of available cost drivers stems from the hypothesis that reducing the dimensionality of datasets will significantly minimise the complexity and time required to reach to an estimation using a particular modelling technique. This work investigates the appropriateness of attributes, obtained from empirical project databases and aims to reduce the cost drivers used while preserving performance. Finding suitable subset selections that may cater improved predictions may be considered as a pre-processing step of a particular technique employed for cost estimation (filter or wrapper) or an internal (embedded) step to minimise the fitting error. This paper compares nine relatively popular feature selection methods and uses the empirical values of selected attributes recorded in the ISBSG and Desharnais datasets to estimate software development effort.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/28/2010

Software Effort Estimation with Ridge Regression and Evolutionary Attribute Selection

Software cost estimation is one of the prerequisite managerial activitie...
research
05/10/2020

Embedded Chaotic Whale Survival Algorithm for Filter-Wrapper Feature Selection

Classification accuracy provided by a machine learning model depends a l...
research
01/05/2014

Feature Selection Using Classifier in High Dimensional Data

Feature selection is frequently used as a pre-processing step to machine...
research
01/23/2021

Feature Selection Using Reinforcement Learning

With the decreasing cost of data collection, the space of variables or f...
research
06/06/2023

A Review Of Progress for Component Based Software Cost Estimation From 1965 to 2023

Component Based Software Engineering (CBSE) is used to develop software ...
research
12/23/2018

A determinantal point process for column subset selection

Dimensionality reduction is a first step of many machine learning pipeli...
research
01/10/2019

Modified Jaccard Index Analysis and Adaptive Feature Selection for Location Fingerprinting with Limited Computational Complexity

We propose an approach for fingerprinting-based positioning which reduce...

Please sign up or login with your details

Forgot password? Click here to reset