optimal feature selection
Recently Published Documents


TOTAL DOCUMENTS

234
(FIVE YEARS 102)

H-INDEX

24
(FIVE YEARS 6)

2021 ◽  
Vol 14 (1) ◽  
pp. 108-113
Author(s):  
Zeina Rayan ◽  
Marco Alfonse ◽  
Abdel-Badeeh M. Salem

Background: As sepsis is one of the life-threatening diseases, predicting sepsis with high accuracy could help save lives. Methods: Efficiency and accuracy of predicting sepsis can be enhanced through optimal feature selection. In this work, a support vector machine model is proposed to automatically predict a patient’s risk of sepsis based on physiological data collected from the ICU. Results: The support vector machine algorithm that uses the extracted features has a great impact on sepsis prediction, which yields the accuracy of 0.73. Conclusion: Predicting sepsis can be accurately performed using the main vital signs and support vector machine.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Feng Lin ◽  
Jiarui Han ◽  
Teng Xue ◽  
Jilan Lin ◽  
Shenggen Chen ◽  
...  

AbstractMany studies report predictions for cognitive function but there are few predictions in epileptic patients; therefore, we established a workflow to efficiently predict outcomes of both the Mini-Mental State Examination (MMSE) and Montreal Cognitive Assessment (MoCA) in outpatients with epilepsy. Data from 441 outpatients with epilepsy were included; of these, 433 patients met the 12 clinical characteristic criteria and were divided into training (n = 304) and experimental (n = 129) groups. After descriptive statistics were analyzed, cross-validation was used to select the optimal model. The random forest (RF) algorithm was combined with the redundancy analysis (RDA) algorithm; then, optimal feature selection and resampling were carried out after removing linear redundancy information. The features that contributed more to multiple outcomes were selected. Finally, the external traceability of the model was evaluated using the follow-up data. The RF algorithm was the best prediction model for both MMSE and MoCA outcomes. Finally, seven markers were screened by overlapping the top ten important features for MMSE ranked by RF modeling, those ranked for MoCA ranked by RF modeling, and those for both assessments ranked by RDA. The optimal combination of features were namely, sex, age, age of onset, seizure frequency, brain MRI abnormalities, epileptiform discharge in EEG and usage of drugs. which was the most efficient in predicting outcomes of MMSE, MoCA, and both assessments.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Nageswara Rao Eluri ◽  
Gangadhara Rao Kancharla ◽  
Suresh Dara ◽  
Venkatesulu Dondeti

PurposeGene selection is considered as the fundamental process in the bioinformatics field. The existing methodologies pertain to cancer classification are mostly clinical basis, and its diagnosis capability is limited. Nowadays, the significant problems of cancer diagnosis are solved by the utilization of gene expression data. The researchers have been introducing many possibilities to diagnose cancer appropriately and effectively. This paper aims to develop the cancer data classification using gene expression data.Design/methodology/approachThe proposed classification model involves three main phases: “(1) Feature extraction, (2) Optimal Feature Selection and (3) Classification”. Initially, five benchmark gene expression datasets are collected. From the collected gene expression data, the feature extraction is performed. To diminish the length of the feature vectors, optimal feature selection is performed, for which a new meta-heuristic algorithm termed as quantum-inspired immune clone optimization algorithm (QICO) is used. Once the relevant features are selected, the classification is performed by a deep learning model called recurrent neural network (RNN). Finally, the experimental analysis reveals that the proposed QICO-based feature selection model outperforms the other heuristic-based feature selection and optimized RNN outperforms the other machine learning methods.FindingsThe proposed QICO-RNN is acquiring the best outcomes at any learning percentage. On considering the learning percentage 85, the accuracy of the proposed QICO-RNN was 3.2% excellent than RNN, 4.3% excellent than RF, 3.8% excellent than NB and 2.1% excellent than KNN for Dataset 1. For Dataset 2, at learning percentage 35, the accuracy of the proposed QICO-RNN was 13.3% exclusive than RNN, 8.9% exclusive than RF and 14.8% exclusive than NB and KNN. Hence, the developed QICO algorithm is performing well in classifying the cancer data using gene expression data accurately.Originality/valueThis paper introduces a new optimal feature selection model using QICO and QICO-based RNN for effective classification of cancer data using gene expression data. This is the first work that utilizes an optimal feature selection model using QICO and QICO-RNN for effective classification of cancer data using gene expression data.


Sign in / Sign up

Export Citation Format

Share Document