Robust ensemble feature selection for high dimensional data sets

Clustering is the most complex in multi/high dimensional data because of sub feature selection from overall features present in categorical data sources. Sub set feature be the aggressive approach to decrease feature dimensionality in mining of data, identification of patterns. Main aim behind selection of feature with respect to selection of optimal feature and decrease the redundancy. In-order to compute with redundant/irrelevant features in high dimensional sample data exploration based on feature selection calculation with data granular described in this document. Propose aNovel Granular Feature Multi-variant Clustering based Genetic Algorithm (NGFMCGA) model to evaluate the performance results in this implementation. This model main consists two phases, in first phase, based on theoretic graph grouping procedure divide features into different clusters, in second phase, select strongly representative related feature from each cluster with respect to matching of subset of features. Features present in this concept are independent because of features select from different clusters, proposed approach clustering have high probability in processing and increasing the quality of independent and useful features.Optimal subset feature selection improves accuracy of clustering and feature classification, performance of proposed approach describes better accuracy with respect to optimal subset selection is applied on publicly related data sets and it is compared with traditional supervised evolutionary approaches

Download Full-text

An Information-theoretic Approach to Unsupervised Feature Selection for High-Dimensional Data

10.3390/ecea-5-06697 ◽

2019 ◽

Cited By ~ 1

Author(s):

Shao-Lun Huang

Keyword(s):

Feature Selection ◽

High Dimensional Data ◽

High Dimensional ◽

Theoretic Approach ◽

Unsupervised Feature Selection ◽

Information Theoretic ◽

Selection For ◽

Information Theoretic Approach

Download Full-text

Feature Selection for Small Sample Sets with High Dimensional Data Using Heuristic Hybrid Approach

International Journal of Engineering ◽

10.5829/ije.2020.33.02b.05 ◽

2020 ◽

Vol 33 (2) ◽

Keyword(s):

Feature Selection ◽

High Dimensional Data ◽

Hybrid Approach ◽

Small Sample ◽

High Dimensional ◽

Selection For

Download Full-text

A Novel Method of Nonlinear Rapid Feature Selection for High Dimensional Data and Its Application in Peptide QSAR Modeling Based on Support Vector Machine

Acta Physico-Chimica Sinica ◽

10.3866/pku.whxb20110735 ◽

2011 ◽

Vol 27 (07) ◽

pp. 1654-1660 ◽

Cited By ~ 3

Author(s):

DAI Zhi-Jun ◽

◽

ZHOU Wei ◽

YUAN Zhe-Ming ◽

Keyword(s):

Support Vector Machine ◽

Feature Selection ◽

High Dimensional Data ◽

High Dimensional ◽

Support Vector ◽

Qsar Modeling ◽

Novel Method ◽

Selection For

Download Full-text