Feature selection for high-dimensional class-imbalanced data sets using Support Vector Machines

The paper deals with problems that imbalanced and overlapping datasets often en- counter. Performance indicators as accuracy, precision and recall of imbalanced data sets, both with and without overlapping, are discussed and compared with the same performance indicators of balanced datasets with overlapping. Three popular classification algorithms, namely, Decision Tree, KNN (k-Nearest Neighbors) and SVM (Support Vector Machines) classifiers are analyzed and compared.

Download Full-text

High dimensional data classification and feature selection using support vector machines

European Journal of Operational Research ◽

10.1016/j.ejor.2017.08.040 ◽

2018 ◽

Vol 265 (3) ◽

pp. 993-1004 ◽

Cited By ~ 63

Author(s):

Bissan Ghaddar ◽

Joe Naoum-Sawaya

Keyword(s):

Feature Selection ◽

Support Vector Machines ◽

High Dimensional Data ◽

Data Classification ◽

High Dimensional ◽

Support Vector ◽

Vector Machines

Download Full-text

Feature Selection for Support Vector Machines in Financial Time Series Forecasting

Intelligent Data Engineering and Automated Learning — IDEAL 2000. Data Mining, Financial Engineering, and Intelligent Agents - Lecture Notes in Computer Science ◽

10.1007/3-540-44491-2_38 ◽

2000 ◽

pp. 268-273 ◽

Cited By ~ 5

Author(s):

L. J. Cao ◽

Francis E. H. Tay

Keyword(s):

Time Series ◽

Feature Selection ◽

Support Vector Machines ◽

Financial Time Series ◽

Time Series Forecasting ◽

Support Vector ◽

Financial Time ◽

Vector Machines ◽

Selection For

Download Full-text

A multi-objective genetic algorithm for simultaneous model and feature selection for support vector machines

Artificial Intelligence Review ◽

10.1007/s10462-017-9543-9 ◽

2017 ◽

Vol 50 (2) ◽

pp. 261-281 ◽

Cited By ~ 10

Author(s):

Amal Bouraoui ◽

Salma Jamoussi ◽

Yassine BenAyed

Keyword(s):

Genetic Algorithm ◽

Feature Selection ◽

Support Vector Machines ◽

Support Vector ◽

Multi Objective ◽

Multi Objective Genetic Algorithm ◽

Vector Machines ◽

Selection For ◽

Simultaneous Model

Download Full-text

Feature selection for support vector machines

Proceedings 15th International Conference on Pattern Recognition. ICPR-2000 ◽

10.1109/icpr.2000.906174 ◽

2002 ◽

Cited By ~ 27

Author(s):

L. Hermes ◽

J.M. Buhmann

Keyword(s):

Feature Selection ◽

Support Vector Machines ◽

Support Vector ◽

Vector Machines ◽

Selection For

Download Full-text

Cost-based feature selection for Support Vector Machines: An application in credit scoring

European Journal of Operational Research ◽

10.1016/j.ejor.2017.02.037 ◽

2017 ◽

Vol 261 (2) ◽

pp. 656-665 ◽

Cited By ~ 46

Author(s):

Sebastián Maldonado ◽

Juan Pérez ◽

Cristián Bravo

Keyword(s):

Feature Selection ◽

Support Vector Machines ◽

Credit Scoring ◽

Support Vector ◽

Vector Machines ◽

Selection For

Download Full-text

FEATURE SELECTION FOR SUPPORT VECTOR MACHINES USING GENETIC ALGORITHMS

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213004001818 ◽

2004 ◽

Vol 13 (04) ◽

pp. 791-800 ◽

Cited By ~ 26

Author(s):

HOLGER FRÖHLICH ◽

OLIVIER CHAPELLE ◽

BERNHARD SCHÖLKOPF

Keyword(s):

Genetic Algorithms ◽

Feature Selection ◽

Support Vector Machines ◽

Cross Validation ◽

Support Vector ◽

Generalization Error ◽

New Approach ◽

Vector Machines ◽

Selection For ◽

Natural Way

The problem of feature selection is a difficult combinatorial task in Machine Learning and of high practical relevance, e.g. in bioinformatics. Genetic Algorithms (GAs) offer a natural way to solve this problem. In this paper we present a special Genetic Algorithm, which especially takes into account the existing bounds on the generalization error for Support Vector Machines (SVMs). This new approach is compared to the traditional method of performing cross-validation and to other existing algorithms for feature selection.

Download Full-text