A Comparative Study of Machine Learning Algorithms in Predicting Severe Complications after Bariatric Surgery

Yang Cao; Xin Fang; Johan Ottosson; Erik Näslund; Erik Stenberg

doi:10.3390/jcm8050668

A Comparative Study of Machine Learning Algorithms in Predicting Severe Complications after Bariatric Surgery

Journal of Clinical Medicine ◽

10.3390/jcm8050668 ◽

2019 ◽

Vol 8 (5) ◽

pp. 668 ◽

Cited By ~ 17

Author(s):

Yang Cao ◽

Xin Fang ◽

Johan Ottosson ◽

Erik Näslund ◽

Erik Stenberg

Keyword(s):

Machine Learning ◽

Bariatric Surgery ◽

Test Data ◽

Imbalanced Data ◽

Preoperative Assessment ◽

High Accuracy ◽

Machine Learning Algorithms ◽

Global Public Health ◽

Bariatric Surgical Procedure ◽

A Minor

Background: Severe obesity is a global public health threat of growing proportions. Accurate models to predict severe postoperative complications could be of value in the preoperative assessment of potential candidates for bariatric surgery. So far, traditional statistical methods have failed to produce high accuracy. We aimed to find a useful machine learning (ML) algorithm to predict the risk for severe complication after bariatric surgery. Methods: We trained and compared 29 supervised ML algorithms using information from 37,811 patients that operated with a bariatric surgical procedure between 2010 and 2014 in Sweden. The algorithms were then tested on 6250 patients operated in 2015. We performed the synthetic minority oversampling technique tackling the issue that only 3% of patients experienced severe complications. Results: Most of the ML algorithms showed high accuracy (>90%) and specificity (>90%) in both the training and test data. However, none of the algorithms achieved an acceptable sensitivity in the test data. We also tried to tune the hyperparameters of the algorithms to maximize sensitivity, but did not yet identify one with a high enough sensitivity that can be used in clinical praxis in bariatric surgery. However, a minor, but perceptible, improvement in deep neural network (NN) ML was found. Conclusion: In predicting the severe postoperative complication among the bariatric surgery patients, ensemble algorithms outperform base algorithms. When compared to other ML algorithms, deep NN has the potential to improve the accuracy and it deserves further investigation. The oversampling technique should be considered in the context of imbalanced data where the number of the interested outcome is relatively small.

Download Full-text

A comparative study of machine learning algorithms in predicting severe complications after bariatric surgery

10.1101/376038 ◽

2018 ◽

Author(s):

Yang Cao ◽

Xin Fang ◽

Johan Ottosson ◽

Erik Näslund ◽

Erik Stenberg

Keyword(s):

Machine Learning ◽

Bariatric Surgery ◽

Test Data ◽

Preoperative Assessment ◽

High Accuracy ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Complications After Bariatric Surgery ◽

Bariatric Surgical Procedure ◽

Accuracy Of Prediction

AbstractAccurate models to predict severe postoperative complications could be of value in the preoperative assessment of potential candidates for bariatric surgery. Traditional statistical methods have so far failed to produce high accuracy. To find a useful algorithm to predict the risk for severe complication after bariatric surgery, we trained and compared 29 supervised machine learning (ML) algorithms using information from 37,811 patients operated with a bariatric surgical procedure between 2010 and 2014 in Sweden. The algorithms were then tested on 6,250 patients operated in 2015. Most ML algorithms showed high accuracy (>90%) and specificity (>0.9) in both the training and test data. However, none achieved an acceptable sensitivity in the test data. ML methods may improve accuracy of prediction but we did not yet identify one with a high enough sensitivity that can be used in clinical praxis in bariatric surgery. Further investigation on deeper neural network algorithms is needed.

Download Full-text

Machine learning algorithms for predicting undernutrition among under-five children in Ethiopia

Public Health Nutrition ◽

10.1017/s1368980021004262 ◽

2021 ◽

pp. 1-29

Author(s):

Fikrewold H. Bitew ◽

Corey S. Sparks ◽

Samuel H. Nyarko

Keyword(s):

Machine Learning ◽

Linear Models ◽

Learning Algorithms ◽

Public Health Problem ◽

Water Source ◽

Machine Learning Algorithms ◽

Gradient Boosting ◽

Global Public Health ◽

Prediction Ability ◽

Extreme Gradient Boosting

Abstract Objective: Child undernutrition is a global public health problem with serious implications. In this study, estimate predictive algorithms for the determinants of childhood stunting by using various machine learning (ML) algorithms. Design: This study draws on data from the Ethiopian Demographic and Health Survey of 2016. Five machine learning algorithms including eXtreme gradient boosting (xgbTree), k-nearest neighbors (K-NN), random forest (RF), neural network (NNet), and the generalized linear models (GLM) were considered to predict the socio-demographic risk factors for undernutrition in Ethiopia. Setting: Households in Ethiopia. Participants: A total of 9,471 children below five years of age. Results: The descriptive results show substantial regional variations in child stunting, wasting, and underweight in Ethiopia. Also, among the five ML algorithms, xgbTree algorithm shows a better prediction ability than the generalized linear mixed algorithm. The best predicting algorithm (xgbTree) shows diverse important predictors of undernutrition across the three outcomes which include time to water source, anemia history, child age greater than 30 months, small birth size, and maternal underweight, among others. Conclusions: The xgbTree algorithm was a reasonably superior ML algorithm for predicting childhood undernutrition in Ethiopia compared to other ML algorithms considered in this study. The findings support improvement in access to water supply, food security, and fertility regulation among others in the quest to considerably improve childhood nutrition in Ethiopia.

Download Full-text

Deep learning and taphonomy: high accuracy in the classification of cut marks made on fleshed and defleshed bones using convolutional neural networks

Scientific Reports ◽

10.1038/s41598-019-55439-6 ◽

2019 ◽

Vol 9 (1) ◽

Cited By ~ 6

Author(s):

Gabriel Cifuentes-Alcobendas ◽

Manuel Domínguez-Rodrigo

Keyword(s):

Machine Learning ◽

Accurate Method ◽

High Accuracy ◽

Machine Learning Algorithms ◽

Accurate Identification ◽

Cut Marks ◽

Eating Meat ◽

Microscopic Features ◽

Immense Potential

AbstractAccurate identification of bone surface modifications (BSM) is crucial for the taphonomic understanding of archaeological and paleontological sites. Critical interpretations of when humans started eating meat and animal fat or when they started using stone tools, or when they occupied new continents or interacted with predatory guilds impinge on accurate identifications of BSM. Until now, interpretations of Plio-Pleistocene BSM have been contentious because of the high uncertainty in discriminating among taphonomic agents. Recently, the use of machine learning algorithms has yielded high accuracy in the identification of BSM. A branch of machine learning methods based on imaging, computer vision (CV), has opened the door to a more objective and accurate method of BSM identification. The present work has selected two extremely similar types of BSM (cut marks made on fleshed an defleshed bones) to test the immense potential of artificial intelligence methods. This CV approach not only produced the highest accuracy in the classification of these types of BSM until present (95% on complete images of BSM and 88.89% of images of only internal mark features), but it also has enabled a method for determining which inconspicuous microscopic features determine successful BSM discrimination. The potential of this method in other areas of taphonomy and paleobiology is enormous.

Download Full-text

Test data reuse for evaluation of adaptive machine learning algorithms: over-fitting to a fixed 'test' dataset and a potential solution

Medical Imaging 2018: Image Perception, Observer Performance, and Technology Assessment ◽

10.1117/12.2293818 ◽

2018 ◽

Cited By ~ 2

Author(s):

Alexej Gossmann ◽

Aria Pezeshk ◽

Berkman Sahiner

Keyword(s):

Machine Learning ◽

Test Data ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Data Reuse ◽

Potential Solution ◽

Test Dataset

Download Full-text

Does Machine Learning Automate Moral Hazard and Error?

The American Economic Review ◽

10.1257/aer.p20171084 ◽

2017 ◽

Vol 107 (5) ◽

pp. 476-480 ◽

Cited By ~ 31

Author(s):

Sendhil Mullainathan ◽

Ziad Obermeyer

Keyword(s):

Machine Learning ◽

Health Care ◽

Characteristic Feature ◽

Health Data ◽

Machine Learning Algorithms ◽

Learning Tools ◽

Human Judgment ◽

Measurement Issues ◽

Applications Of Machine Learning ◽

A Minor

Machine learning tools are beginning to be deployed en masse in health care. While the statistical underpinnings of these techniques have been questioned with regard to causality and stability, we highlight a different concern here, relating to measurement issues. A characteristic feature of health data, unlike other applications of machine learning, is that neither y nor x is measured perfectly. Far from a minor nuance, this can undermine the power of machine learning algorithms to drive change in the health care system--and indeed, can cause them to reproduce and even magnify existing errors in human judgment.

Download Full-text

Effective Methodology for Co-Referential Aspect Based Sentiment Analysis of Tourist Reviews

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit2062149 ◽

2020 ◽

pp. 523-529

Author(s):

Kamlesh A. Waghmare ◽

Sheetal K. Bhala

Keyword(s):

Neural Network ◽

Machine Learning ◽

Sentiment Analysis ◽

High Accuracy ◽

Machine Learning Algorithms ◽

Data Sets ◽

Knowledge Processing ◽

The Neural Network ◽

The World ◽

Sentiment Score

Tourist reviews are the source of data that is going to be used for the travelers around the world to find the hotels for their stay according to their comfort. In this the hotels are ranked over the parameters or aspects considered keeping travelers in mind. This computation of data sets is done with the help of the machine learning algorithms and the neural network. The knowledge processing done over the reviews generates the sentiment score for each hotel with respect to the aspects defined. Here, the explicit , implicit and co-referential aspects are identified by suppressing the noise. This paper proposes the method that can be best used for the detection of the sentiments with the high accuracy.

Download Full-text

Reverse-engineering human olfactory perception from chemical features of odor molecules

10.1101/082495 ◽

2016 ◽

Cited By ~ 2

Author(s):

Andreas Keller ◽

Richard C. Gerkin ◽

Yuanfang Guan ◽

Amit Dhurandhar ◽

Gabor Turu ◽

...

Keyword(s):

Machine Learning ◽

Random Forest ◽

Molecular Mechanisms ◽

Linear Models ◽

Predictive Accuracy ◽

High Accuracy ◽

Machine Learning Algorithms ◽

Olfactory Perception ◽

Theoretical Limit ◽

Reverse Engineer

AbstractDespite 25 years of progress in understanding the molecular mechanisms of olfaction, it is still not possible to predict whether a given molecule will have a perceived odor, or what olfactory percept it will produce. To address this stimulus-percept problem for olfaction, we organized the crowd-sourced DREAM Olfaction Prediction Challenge. Working from a large olfactory psychophysical dataset, teams developed machine learning algorithms to predict sensory attributes of molecules based on their chemoinformatic features. The resulting models predicted odor intensity and pleasantness with high accuracy, and also successfully predicted eight semantic descriptors (“garlic”, “fish”, “sweet”, “fruit”, “burnt”, “spices”, “flower”, “sour”). Regularized linear models performed nearly as well as random-forest-based approaches, with a predictive accuracy that closely approaches a key theoretical limit. The models presented here make it possible to predict the perceptual qualities of virtually any molecule with an impressive degree of accuracy to reverse-engineer the smell of a molecule.One Sentence SummaryResults of a crowdsourcing competition show that it is possible to accurately predict and reverse-engineer the smell of a molecule.

Download Full-text

An Improved Random Forest Algorithm for Class-Imbalanced Data Classification and its Application in PAD Risk Factors Analysis

The Open Electrical & Electronic Engineering Journal ◽

10.2174/1874129001307010062 ◽

2013 ◽

Vol 7 (1) ◽

pp. 62-70 ◽

Cited By ~ 9

Author(s):

Dengju Yao ◽

Jing Yang ◽

Xiaojuan Zhan

Keyword(s):

Machine Learning ◽

Random Forest ◽

Imbalanced Data ◽

Machine Learning Algorithms ◽

Majority Voting ◽

Training Dataset ◽

Random Forest Algorithm ◽

Research Subjects ◽

Minority Class ◽

Imbalanced Data Classification

The classification problem is one of the important research subjects in the field of machine learning. However, most machine learning algorithms train a classifier based on the assumption that the number of training examples of classes is almost equal. When a classifier was trained on imbalanced data, the performance of the classifier declined clearly. For resolving the class-imbalanced problem, an improved random forest algorithm was proposed based on sampling with replacement. We extracted multiple example subsets randomly with replacement from majority class, and the example number of extracted example subsets is as the same with minority class example dataset. Then, multiple new training datasets were constructed by combining the each exacted majority example subset and minority class dataset respectively, and multiple random forest classifiers were training on these training dataset. For a prediction example, the class was determined by majority voting of multiple random forest classifiers. The experimental results on five groups UCI datasets and a real clinical dataset show that the proposed method could deal with the class-imbalanced data problem and the improved random forest algorithm outperformed original random forest and other methods in literatures.

Download Full-text

Imbalance class problems in data mining: a review

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v14.i3.pp1552-1563 ◽

2019 ◽

Vol 14 (3) ◽

pp. 1552 ◽

Cited By ~ 3

Author(s):

Haseeb Ali ◽

Mohd Najib Mohd Salleh ◽

Rohmat Saedudin ◽

Kashif Hussain ◽

Muhammad Faheem Mushtaq

Keyword(s):

Machine Learning ◽

Data Mining ◽

Imbalanced Data ◽

Machine Learning Algorithms ◽

The Other ◽

Minority Class ◽

Future Directions ◽

Significant Research ◽

Comprehensive Survey ◽

Imbalanced Class

<span>The imbalanced data problems in data mining are common nowadays, which occur due to skewed nature of data. These problems impact the classification process negatively in machine learning process. In such problems, classes have different ratios of specimens in which a large number of specimens belong to one class and the other class has fewer specimens that is usually an essential class, but unfortunately misclassified by many classifiers. So far, significant research is performed to address the imbalanced data problems by implementing different techniques and approaches. In this research, a comprehensive survey is performed to identify the challenges of handling imbalanced class problems during classification process using machine learning algorithms. We discuss the issues of classifiers which endorse bias for majority class and ignore the minority class. Furthermore, the viable solutions and potential future directions are provided to handle the problems<em>.</em></span>

Download Full-text

Improved practices in machine learning algorithms for NTL detection with imbalanced data

2017 IEEE Power & Energy Society General Meeting ◽

10.1109/pesgm.2017.8273852 ◽

2017 ◽

Cited By ~ 8

Author(s):

Gerardo Figueroa ◽

Yi-Shin Chen ◽

Nelson Avila ◽

Chia-Chi Chu

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Imbalanced Data ◽

Machine Learning Algorithms

Download Full-text