Using Genetic Programming to Improve Software Effort Estimation Based on General Data Sets

Software organizations rely on the estimation of efforts required for the development of software to negotiate customers and plan the schedule of the project. Proper estimation of efforts reduces the chances of project failures. Historical data of projects have been used to predict the effort required for software development. In recent years, various ensemble of machine learning techniques have been used to predict software effort. In the present work, a novel ensemble technique of hybrid search-based algorithms (EHSBA) is used for software effort estimation. Four HSBAs—fuzzy and random sets-based modeling (FRSBM-R), symbolic fuzzy learning based on genetic programming (GFS-GP-R), symbolic fuzzy learning based on genetic programming grammar operators and simulated annealing (GFS_GSP_R), and least mean squares linear regression (LinearLMS_R)—are used to create an ensemble (EHSBA). The EHSBA is compared with machine learning-based ensemble bagging, vote, and stacking on datasets obtained from PROMISE repository. Obtained results reported that EHSBA outperformed all other techniques.

Download Full-text

Evolving Software Effort Estimation Models Using Multigene Symbolic Regression Genetic Programming

INTERNATIONAL JOURNAL OF ADVANCED RESEARCH IN ARTIFICIAL INTELLIGENCE ◽

10.14569/ijarai.2013.021207 ◽

2013 ◽

Vol 2 (12) ◽

Cited By ~ 5

Author(s):

Sultan Aljahdali ◽

Alaa Sheta

Keyword(s):

Genetic Programming ◽

Symbolic Regression ◽

Effort Estimation ◽

Software Effort Estimation ◽

Estimation Models

Download Full-text

Can genetic programming improve software effort estimation? A comparative evaluation

Information and Software Technology ◽

10.1016/s0950-5849(01)00192-6 ◽

2001 ◽

Vol 43 (14) ◽

pp. 863-873 ◽

Cited By ~ 154

Author(s):

Colin J Burgess ◽

Martin Lefley

Keyword(s):

Genetic Programming ◽

Comparative Evaluation ◽

Effort Estimation ◽

Software Effort Estimation

Download Full-text

Estimating Software Development Efforts Using a Random Forest-Based Stacked Ensemble Approach

Electronics ◽

10.3390/electronics10101195 ◽

2021 ◽

Vol 10 (10) ◽

pp. 1195

Author(s):

Priya Varshini A G ◽

Anitha Kumari K ◽

Vijayakumar Varadarajan

Keyword(s):

Deep Learning ◽

Random Forest ◽

Software Development ◽

Weighted Averaging ◽

Software Project ◽

Effort Estimation ◽

Software Effort Estimation ◽

Single Model ◽

Ensemble Techniques ◽

Project Estimation

Software Project Estimation is a challenging and important activity in developing software projects. Software Project Estimation includes Software Time Estimation, Software Resource Estimation, Software Cost Estimation, and Software Effort Estimation. Software Effort Estimation focuses on predicting the number of hours of work (effort in terms of person-hours or person-months) required to develop or maintain a software application. It is difficult to forecast effort during the initial stages of software development. Various machine learning and deep learning models have been developed to predict the effort estimation. In this paper, single model approaches and ensemble approaches were considered for estimation. Ensemble techniques are the combination of several single models. Ensemble techniques considered for estimation were averaging, weighted averaging, bagging, boosting, and stacking. Various stacking models considered and evaluated were stacking using a generalized linear model, stacking using decision tree, stacking using a support vector machine, and stacking using random forest. Datasets considered for estimation were Albrecht, China, Desharnais, Kemerer, Kitchenham, Maxwell, and Cocomo81. Evaluation measures used were mean absolute error, root mean squared error, and R-squared. The results proved that the proposed stacking using random forest provides the best results compared with single model approaches using the machine or deep learning algorithms and other ensemble techniques.

Download Full-text

Software Effort Estimation Using Early COSMIC to Substitute Use Case Weight

2020 International Seminar on Application for Technology of Information and Communication (iSemantic) ◽

10.1109/isemantic50169.2020.9234227 ◽

2020 ◽

Author(s):

Renny Sari Dewi ◽

Riyanarto Sarno

Keyword(s):

Use Case ◽

Effort Estimation ◽

Software Effort Estimation

Download Full-text

Software Effort Estimation from Use Case Diagrams Using Nonlinear Regression Analysis

2020 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE) ◽

10.1109/ccece47787.2020.9255712 ◽

2020 ◽

Author(s):

Ali Bou Nassif ◽

Manar AbuTalib ◽

Luiz Fernando Capretz

Keyword(s):

Regression Analysis ◽

Nonlinear Regression ◽

Use Case ◽

Effort Estimation ◽

Nonlinear Regression Analysis ◽

Software Effort Estimation

Download Full-text

Binary Image Classification: A Genetic Programming Approach to the Problem of Limited Training Instances

Evolutionary Computation ◽

10.1162/evco_a_00146 ◽

2016 ◽

Vol 24 (1) ◽

pp. 143-182 ◽

Cited By ~ 10

Author(s):

Harith Al-Sahaf ◽

Mengjie Zhang ◽

Mark Johnston

Keyword(s):

Computer Vision ◽

Pattern Recognition ◽

Genetic Programming ◽

Visual System ◽

Image Classification ◽

Human Visual System ◽

Binary Classification ◽

Programming Approach ◽

Data Sets ◽

New Class

In the computer vision and pattern recognition fields, image classification represents an important yet difficult task. It is a challenge to build effective computer models to replicate the remarkable ability of the human visual system, which relies on only one or a few instances to learn a completely new class or an object of a class. Recently we proposed two genetic programming (GP) methods, one-shot GP and compound-GP, that aim to evolve a program for the task of binary classification in images. The two methods are designed to use only one or a few instances per class to evolve the model. In this study, we investigate these two methods in terms of performance, robustness, and complexity of the evolved programs. We use ten data sets that vary in difficulty to evaluate these two methods. We also compare them with two other GP and six non-GP methods. The results show that one-shot GP and compound-GP outperform or achieve results comparable to competitor methods. Moreover, the features extracted by these two methods improve the performance of other classifiers with handcrafted features and those extracted by a recently developed GP-based method in most cases.

Download Full-text

Class point approach for software effort estimation using stochastic gradient boosting technique

ACM SIGSOFT Software Engineering Notes ◽

10.1145/2597716.2597725 ◽

2014 ◽

Vol 39 (3) ◽

pp. 1-6 ◽

Cited By ~ 7

Author(s):

Shashank Mouli Satapathy ◽

Barada Prasanna Acharya ◽

Santanu Kumar Rath

Keyword(s):

Stochastic Gradient ◽

Gradient Boosting ◽

Effort Estimation ◽

Software Effort Estimation ◽

Stochastic Gradient Boosting ◽

Boosting Technique ◽

Class Point

Download Full-text