Machine learning improves the prediction of febrile neutropenia in Korean inpatients undergoing chemotherapy for breast cancer

Bum-Joo Cho; Kyoung Min Kim; Sanchir-Erdene Bilegsaikhan; Yong Joon Suh

doi:10.1038/s41598-020-71927-6

Machine learning improves the prediction of febrile neutropenia in Korean inpatients undergoing chemotherapy for breast cancer

Scientific Reports ◽

10.1038/s41598-020-71927-6 ◽

2020 ◽

Vol 10 (1) ◽

Cited By ~ 1

Author(s):

Bum-Joo Cho ◽

Kyoung Min Kim ◽

Sanchir-Erdene Bilegsaikhan ◽

Yong Joon Suh

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Risk Factors ◽

Febrile Neutropenia ◽

Prediction Models ◽

Learning Algorithms ◽

Area Under The Curve ◽

Primary Prophylaxis ◽

Machine Learning Algorithms ◽

Significant Difference

Abstract Febrile neutropenia (FN) is one of the most concerning complications of chemotherapy, and its prediction remains difficult. This study aimed to reveal the risk factors for and build the prediction models of FN using machine learning algorithms. Medical records of hospitalized patients who underwent chemotherapy after surgery for breast cancer between May 2002 and September 2018 were selectively reviewed for development of models. Demographic, clinical, pathological, and therapeutic data were analyzed to identify risk factors for FN. Using machine learning algorithms, prediction models were developed and evaluated for performance. Of 933 selected inpatients with a mean age of 51.8 ± 10.7 years, FN developed in 409 (43.8%) patients. There was a significant difference in FN incidence according to age, staging, taxane-based regimen, and blood count 5 days after chemotherapy. The area under the curve (AUC) built based on these findings was 0.870 on the basis of logistic regression. The AUC improved by machine learning was 0.908. Machine learning improves the prediction of FN in patients undergoing chemotherapy for breast cancer compared to the conventional statistical model. In these high-risk patients, primary prophylaxis with granulocyte colony-stimulating factor could be considered.

Download Full-text

Emperical Evaluation of Machine Learning algorithms for Breast Cancer Data Classification

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i10.346351 ◽

2018 ◽

Vol 6 (10) ◽

pp. 346-351

Author(s):

S. Kumaravel ◽

S. Ophilia Domanica Vithya

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Learning Algorithms ◽

Data Classification ◽

Machine Learning Algorithms ◽

Breast Cancer Data ◽

Cancer Data

Download Full-text

Feature Selection with Fast Correlation-Based Filter for Breast Cancer Prediction and Classification Using Machine Learning Algorithms

2018 International Symposium on Advanced Electrical and Communication Technologies (ISAECT) ◽

10.1109/isaect.2018.8618688 ◽

2018 ◽

Author(s):

Youness Khourdifi ◽

Mohamed Bahaj

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Feature Selection ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Cancer Prediction

Download Full-text

Breast Cancer Prediction Analysis using Machine Learning Algorithms

2020 International Conference on Communication, Computing and Industry 4.0 (C2I4) ◽

10.1109/c2i451079.2020.9368911 ◽

2020 ◽

Author(s):

Vinayak A. Telsang ◽

Kavyashree Hegde

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Prediction Analysis ◽

Cancer Prediction

Download Full-text

Employing Machine Learning Algorithms to Discover Risk Factors of Glaucoma

10.1109/prai53619.2021.9551082 ◽

2021 ◽

Author(s):

Abdulkawi Yahya Radman Al-Shamiri

Keyword(s):

Machine Learning ◽

Risk Factors ◽

Learning Algorithms ◽

Machine Learning Algorithms

Download Full-text

A Comparative Study on Machine Learning Algorithms for Predicting Breast Cancer Prognosis in Improving Clinical Trials

2020 International Conference on Computational Science and Computational Intelligence (CSCI) ◽

10.1109/csci51800.2020.00152 ◽

2020 ◽

Author(s):

Neetu Sangari ◽

Yanzhen Qu

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Clinical Trials ◽

Comparative Study ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Breast Cancer Prognosis ◽

Cancer Prognosis

Download Full-text

Breast Cancer Diagnosis Using Machine Learning Algorithms - A Survey

International Journal of Distributed and Parallel systems ◽

10.5121/ijdps.2013.4309 ◽

2013 ◽

Vol 4 (3) ◽

pp. 105-112 ◽

Cited By ~ 15

Author(s):

Gayathri B.M ◽

Sumathi C.P ◽

Santhanam T

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Cancer Diagnosis ◽

Learning Algorithms ◽

Breast Cancer Diagnosis ◽

Machine Learning Algorithms

Download Full-text

Predicting hospitalization following psychiatric crisis care using machine learning

BMC Medical Informatics and Decision Making ◽

10.1186/s12911-020-01361-1 ◽

2020 ◽

Vol 20 (1) ◽

Author(s):

Matthijs Blankers ◽

Louk F. M. van der Post ◽

Jack J. M. Dekker

Keyword(s):

Machine Learning ◽

Logistic Regression ◽

Prediction Models ◽

Learning Algorithms ◽

Nearest Neighbors ◽

Machine Learning Algorithms ◽

Gradient Boosting ◽

Ensemble Model ◽

K Nearest Neighbors ◽

Crisis Care

Abstract Background Accurate prediction models for whether patients on the verge of a psychiatric criseis need hospitalization are lacking and machine learning methods may help improve the accuracy of psychiatric hospitalization prediction models. In this paper we evaluate the accuracy of ten machine learning algorithms, including the generalized linear model (GLM/logistic regression) to predict psychiatric hospitalization in the first 12 months after a psychiatric crisis care contact. We also evaluate an ensemble model to optimize the accuracy and we explore individual predictors of hospitalization. Methods Data from 2084 patients included in the longitudinal Amsterdam Study of Acute Psychiatry with at least one reported psychiatric crisis care contact were included. Target variable for the prediction models was whether the patient was hospitalized in the 12 months following inclusion. The predictive power of 39 variables related to patients’ socio-demographics, clinical characteristics and previous mental health care contacts was evaluated. The accuracy and area under the receiver operating characteristic curve (AUC) of the machine learning algorithms were compared and we also estimated the relative importance of each predictor variable. The best and least performing algorithms were compared with GLM/logistic regression using net reclassification improvement analysis and the five best performing algorithms were combined in an ensemble model using stacking. Results All models performed above chance level. We found Gradient Boosting to be the best performing algorithm (AUC = 0.774) and K-Nearest Neighbors to be the least performing (AUC = 0.702). The performance of GLM/logistic regression (AUC = 0.76) was slightly above average among the tested algorithms. In a Net Reclassification Improvement analysis Gradient Boosting outperformed GLM/logistic regression by 2.9% and K-Nearest Neighbors by 11.3%. GLM/logistic regression outperformed K-Nearest Neighbors by 8.7%. Nine of the top-10 most important predictor variables were related to previous mental health care use. Conclusions Gradient Boosting led to the highest predictive accuracy and AUC while GLM/logistic regression performed average among the tested algorithms. Although statistically significant, the magnitude of the differences between the machine learning algorithms was in most cases modest. The results show that a predictive accuracy similar to the best performing model can be achieved when combining multiple algorithms in an ensemble model.

Download Full-text

Mean Received Resources Meet Machine Learning Algorithms to Improve Link Prediction Methods

Information ◽

10.3390/info13010035 ◽

2022 ◽

Vol 13 (1) ◽

pp. 35

Author(s):

Jibouni Ayoub ◽

Dounia Lotfi ◽

Ahmed Hammouch

Keyword(s):

Machine Learning ◽

Link Prediction ◽

Learning Algorithms ◽

Area Under The Curve ◽

Machine Learning Algorithms ◽

Actual State ◽

The Future ◽

Auc Value ◽

The Mean ◽

Analysis Of Social Networks

The analysis of social networks has attracted a lot of attention during the last two decades. These networks are dynamic: new links appear and disappear. Link prediction is the problem of inferring links that will appear in the future from the actual state of the network. We use information from nodes and edges and calculate the similarity between users. The more users are similar, the higher the probability of their connection in the future will be. The similarity metrics play an important role in the link prediction field. Due to their simplicity and flexibility, many authors have proposed several metrics such as Jaccard, AA, and Katz and evaluated them using the area under the curve (AUC). In this paper, we propose a new parameterized method to enhance the AUC value of the link prediction metrics by combining them with the mean received resources (MRRs). Experiments show that the proposed method improves the performance of the state-of-the-art metrics. Moreover, we used machine learning algorithms to classify links and confirm the efficiency of the proposed combination.

Download Full-text

Predicting the Risk of Hypertension Based on Several Easy-to-Collect Risk Factors: A Machine Learning Method

Frontiers in Public Health ◽

10.3389/fpubh.2021.619429 ◽

2021 ◽

Vol 9 ◽

Author(s):

Huanhuan Zhao ◽

Xiaoyu Zhang ◽

Yang Xu ◽

Lisheng Gao ◽

Zuchang Ma ◽

...

Keyword(s):

Machine Learning ◽

Risk Factors ◽

Logistic Regression ◽

Risk Prediction ◽

Disease Risk ◽

Learning Algorithms ◽

Large Population ◽

Machine Learning Algorithms ◽

Hypertension Risk ◽

Model Training

Hypertension is a widespread chronic disease. Risk prediction of hypertension is an intervention that contributes to the early prevention and management of hypertension. The implementation of such intervention requires an effective and easy-to-implement hypertension risk prediction model. This study evaluated and compared the performance of four machine learning algorithms on predicting the risk of hypertension based on easy-to-collect risk factors. A dataset of 29,700 samples collected through a physical examination was used for model training and testing. Firstly, we identified easy-to-collect risk factors of hypertension, through univariate logistic regression analysis. Then, based on the selected features, 10-fold cross-validation was utilized to optimize four models, random forest (RF), CatBoost, MLP neural network and logistic regression (LR), to find the best hyper-parameters on the training set. Finally, the performance of models was evaluated by AUC, accuracy, sensitivity and specificity on the test set. The experimental results showed that the RF model outperformed the other three models, and achieved an AUC of 0.92, an accuracy of 0.82, a sensitivity of 0.83 and a specificity of 0.81. In addition, Body Mass Index (BMI), age, family history and waist circumference (WC) are the four primary risk factors of hypertension. These findings reveal that it is feasible to use machine learning algorithms, especially RF, to predict hypertension risk without clinical or genetic data. The technique can provide a non-invasive and economical way for the prevention and management of hypertension in a large population.

Download Full-text

Using Machine Learning Algorithms to Identify Risk Factors Correlated With Ectopic Pregnancies at a Large IVF Program

Fertility and Sterility ◽

10.1016/j.fertnstert.2013.01.005 ◽

2013 ◽

Vol 99 (3) ◽

pp. S4 ◽

Cited By ~ 3

Author(s):

Joseph Lee ◽

Jennifer Cohen ◽

Hrishikesh Karvir ◽

Piraye Yurttas Beim ◽

Jason Barritt ◽

...

Keyword(s):

Machine Learning ◽

Risk Factors ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Ectopic Pregnancies

Download Full-text