Performance Assessment of Ensemble Learning Model for Prediction of Cardiac Disease Among Smokers Based on HRV Features

Smoking impacts the pattern of heart rate variability (HRV); HRV therefore acts as a predictor of cardiac diseases (CD). In this study, to predict CD non-invasively among smokers, ensemble machine learning methods have been used. A single model is created based on ensemble voting classifier with a combined boosting technique to improve the accuracy of predictive model. The final ensemble model shows an accuracy of 95.20%, precision of 97.27%, sensitivity of 92.35%, specificity of 98.07%, F1 score of 0.95, AUC of 0.961, MCE of 0.0479, kappa statistics value of 0.9041, and MSE of 0.2189. The obtained accuracy by using the proposed method is the highest value achieved so far for the prediction of CD among smokers using HRV data.

Download Full-text

An Optimized Stacking Ensemble Model for Phishing Websites Detection

Electronics ◽

10.3390/electronics10111285 ◽

2021 ◽

Vol 10 (11) ◽

pp. 1285

Author(s):

Mohammed Al-Sarem ◽

Faisal Saeed ◽

Zeyad Ghaleb Al-Mekhlafi ◽

Badiea Abdulkarem Mohammed ◽

Tawfik Al-Hadhrami ◽

...

Keyword(s):

Machine Learning ◽

Random Forests ◽

Ensemble Method ◽

Detection Methods ◽

Detection Accuracy ◽

Ensemble Model ◽

Security Attacks ◽

Data Set ◽

Machine Learning Methods ◽

Ensemble Machine Learning

Security attacks on legitimate websites to steal users’ information, known as phishing attacks, have been increasing. This kind of attack does not just affect individuals’ or organisations’ websites. Although several detection methods for phishing websites have been proposed using machine learning, deep learning, and other approaches, their detection accuracy still needs to be enhanced. This paper proposes an optimized stacking ensemble method for phishing website detection. The optimisation was carried out using a genetic algorithm (GA) to tune the parameters of several ensemble machine learning methods, including random forests, AdaBoost, XGBoost, Bagging, GradientBoost, and LightGBM. The optimized classifiers were then ranked, and the best three models were chosen as base classifiers of a stacking ensemble method. The experiments were conducted on three phishing website datasets that consisted of both phishing websites and legitimate websites—the Phishing Websites Data Set from UCI (Dataset 1); Phishing Dataset for Machine Learning from Mendeley (Dataset 2, and Datasets for Phishing Websites Detection from Mendeley (Dataset 3). The experimental results showed an improvement using the optimized stacking ensemble method, where the detection accuracy reached 97.16%, 98.58%, and 97.39% for Dataset 1, Dataset 2, and Dataset 3, respectively.

Download Full-text

A novel multi-stage ensemble model with multiple K-means-based selective undersampling: An application in credit scoring

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-201954 ◽

2021 ◽

Vol 40 (5) ◽

pp. 9471-9484

Author(s):

Yilun Jin ◽

Yanan Liu ◽

Wenyu Zhang ◽

Shuai Zhang ◽

Yu Lou

Keyword(s):

Machine Learning ◽

Predictive Accuracy ◽

Credit Scoring ◽

Imbalanced Data ◽

Ensemble Model ◽

Selective Sampling ◽

Machine Learning Methods ◽

Multi Stage ◽

Proposed Model ◽

New Feature

With the advancement of machine learning, credit scoring can be performed better. As one of the widely recognized machine learning methods, ensemble learning has demonstrated significant improvements in the predictive accuracy over individual machine learning models for credit scoring. This study proposes a novel multi-stage ensemble model with multiple K-means-based selective undersampling for credit scoring. First, a new multiple K-means-based undersampling method is proposed to deal with the imbalanced data. Then, a new selective sampling mechanism is proposed to select the better-performing base classifiers adaptively. Finally, a new feature-enhanced stacking method is proposed to construct an effective ensemble model by composing the shortlisted base classifiers. In the experiments, four datasets with four evaluation indicators are used to evaluate the performance of the proposed model, and the experimental results prove the superiority of the proposed model over other benchmark models.

Download Full-text

Prediction of Cesarean Childbirth using Ensemble Machine Learning Methods

Proceedings of the 22nd International Conference on Information Integration and Web-based Applications & Services ◽

10.1145/3428757.3429138 ◽

2020 ◽

Cited By ~ 1

Author(s):

Nafiz Imtiaz Khan ◽

Tahasin Mahmud ◽

Muhammad Nazrul Islam ◽

Sumaiya Nuha Mustafina

Keyword(s):

Machine Learning ◽

Learning Methods ◽

Machine Learning Methods ◽

Ensemble Machine Learning

Download Full-text

Prediction of Change-Prone Classes Using Machine Learning and Statistical Techniques

Advanced Research and Trends in New Technologies, Software, Human-Computer Interaction, and Communicability - Advances in Human and Social Aspects of Technology ◽

10.4018/978-1-4666-4490-8.ch019 ◽

2014 ◽

pp. 193-202

Author(s):

LinRuchika Malhotra ◽

Ankita Jain Bansal

Keyword(s):

Machine Learning ◽

Software Quality ◽

Software Maintenance ◽

Learning Methods ◽

Machine Learning Methods ◽

Change Proneness ◽

Regression Techniques ◽

Reducing Costs ◽

Boosting Technique ◽

The Relationship

For software development, availability of resources is limited, thereby necessitating efficient and effective utilization of resources. This can be achieved through prediction of key attributes, which affect software quality such as fault proneness, change proneness, effort, maintainability, etc. The primary aim of this chapter is to investigate the relationship between object-oriented metrics and change proneness. Predicting the classes that are prone to changes can help in maintenance and testing. Developers can focus on the classes that are more change prone by appropriately allocating resources. This will help in reducing costs associated with software maintenance activities. The authors have constructed models to predict change proneness using various machine-learning methods and one statistical method. They have evaluated and compared the performance of these methods. The proposed models are validated using open source software, Frinika, and the results are evaluated using Receiver Operating Characteristic (ROC) analysis. The study shows that machine-learning methods are more efficient than regression techniques. Among the machine-learning methods, boosting technique (i.e. Logitboost) outperformed all the other models. Thus, the authors conclude that the developed models can be used to predict the change proneness of classes, leading to improved software quality.

Download Full-text

Prediction of Concrete Properties Using Ensemble Machine Learning Methods

Journal of Physics Conference Series ◽

10.1088/1742-6596/1625/1/012024 ◽

2020 ◽

Vol 1625 ◽

pp. 012024

Author(s):

D Prayogo ◽

D I Santoso ◽

D Wijaya ◽

T Gunawan ◽

J A Widjaja

Keyword(s):

Machine Learning ◽

Learning Methods ◽

Machine Learning Methods ◽

Concrete Properties ◽

Ensemble Machine Learning

Download Full-text

Wearable ECG Recorder with MATLAB

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.a9473.109119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 442-444

Keyword(s):

Heart Rate ◽

Electrical Activity ◽

Cardiac Disease ◽

Wearable Device ◽

Cardiac Diseases ◽

Ecg Signal ◽

Cardiac Health ◽

Current Scenario

Looking at the current scenario and lifestyle of individual, cardiac diseases have become common problem irrespective of person’s age. In some cases if this kind of cardiac disease is at severe level than it become the reason for death .Electrocardiograph is electrical activity of heart. By acquiring it through device and analysis, the cardiac health of person can be diagnosed. In this paper we are utilizing 3-lead wet electrode to acquire ECG Signal The ECG signal is conditioned and filtered by AD8232 IC and acquired in MATLAB through microcontroller. It is simply wearable device and heart rate of person is displayed in MATLAB.

Download Full-text

Investigating Algorithmic Stock Market Trading using Ensemble Machine Learning Methods

Informatica ◽

10.31449/inf.v44i3.2904 ◽

2020 ◽

Vol 44 (3) ◽

Author(s):

Ramzi Saifan ◽

Khaled Sharif ◽

Mohammad Abu-Ghazaleh ◽

Mohammad Abdel-Majeed

Keyword(s):

Machine Learning ◽

Stock Market ◽

Learning Methods ◽

Machine Learning Methods ◽

Ensemble Machine Learning ◽

Stock Market Trading

Download Full-text

Modelling and Forecasting Temporal PM2.5 Concentration Using Ensemble Machine Learning Methods

Buildings ◽

10.3390/buildings12010046 ◽

2022 ◽

Vol 12 (1) ◽

pp. 46

Author(s):

Obuks Augustine Ejohwomu ◽

Olakekan Shamsideen Oshodi ◽

Majeed Oladokun ◽

Oyegoke Teslim Bukoye ◽

Nwabueze Emekwuru ◽

...

Keyword(s):

Machine Learning ◽

Air Pollutants ◽

Data Science ◽

Climatic Factors ◽

Extreme Weather Events ◽

Learning Methods ◽

Rapid Urbanization ◽

Machine Learning Methods ◽

Ensemble Machine Learning ◽

Pm2.5 Concentration

Exposure of humans to high concentrations of PM2.5 has adverse effects on their health. Researchers estimate that exposure to particulate matter from fossil fuel emissions accounted for 18% of deaths in 2018—a challenge policymakers argue is being exacerbated by the increase in the number of extreme weather events and rapid urbanization as they tinker with strategies for reducing air pollutants. Drawing on a number of ensemble machine learning methods that have emerged as a result of advancements in data science, this study examines the effectiveness of using ensemble models for forecasting the concentrations of air pollutants, using PM2.5 as a representative case. A comprehensive evaluation of the ensemble methods was carried out by comparing their predictive performance with that of other standalone algorithms. The findings suggest that hybrid models provide useful tools for PM2.5 concentration forecasting. The developed models show that machine learning models are efficient in predicting air particulate concentrations, and can be used for air pollution forecasting. This study also provides insights into how climatic factors influence the concentrations of pollutants found in the air.

Download Full-text

APPLICATION OF ENSEMBLE MACHINE LEARNING METHODS FOR PREDICTING THE TECHNICAL STATE OF AN OBJECT

Izvestiya of Samara Scientific Center of the Russian Academy of Sciences ◽

10.37313/1990-5378-2021-23-1-111-114 ◽

2021 ◽

Vol 23 (1) ◽

pp. 111-114

Author(s):

Yu.E. Kuvayskova ◽

Keyword(s):

Machine Learning ◽

The State ◽

Technical State ◽

Supervised Machine Learning ◽

Time Interval ◽

Technical Documentation ◽

Technical Object ◽

Learning Methods ◽

Machine Learning Methods ◽

Ensemble Machine Learning

To ensure the reliable functioning of a technical object, it is necessary to predict its state for the upcoming time interval. Let the technical state of the object be characterized at a certain point in time by a set of parameters established by the technical documentation for the object. It is assumed that for certain values of these parameters, the object may be in a good or faulty state. It is required by the values of these parameters to estimate the state of the object in the upcoming time interval. Supervised machine learning methods can be applied to solve this problem. However, to obtain good results in predicting the state of an object, it is necessary to choose the correct training model. One of the disadvantages of machine learning models is high bias and too much scatter. In this paper, to reduce the scatter of the model, it is proposed to use ensemble machine learning methods, namely, the bagging procedure. The main idea of the ensemble of methods is that with the right combination of weak models, more accurate and robust models can be obtained. The purpose of bagging is to create an ensemble model that is more reliable than the individual models that make up it. One of the big advantages of bagging is its concurrency, since different ensemble models are trained independently of each other. The effectiveness of the proposed approach is shown by the example of predicting the technical state of an object by eight parameters of its functioning. To assess the effectiveness of the application of ensemble machine learning methods for predicting the technical state of an object, the quality criteria of binary classification are used: accuracy, completeness, and F-measure. It is shown that the use of ensemble machine learning methods can improve the accuracy of predicting the state of a technical object by 4% –9% in comparison with basic machine learning methods. This approach can be used by specialists to predict the technical condition of objects in many technical applications, in particular, in aviation.

Download Full-text