missing value estimation
Recently Published Documents


TOTAL DOCUMENTS

91
(FIVE YEARS 8)

H-INDEX

13
(FIVE YEARS 0)

2021 ◽  
Vol 2 (4) ◽  
pp. 347-370
Author(s):  
Danh V. Nguyen ◽  
Naisyin Wang ◽  
Raymond J. Carroll

IEEE Access ◽  
2021 ◽  
Vol 9 ◽  
pp. 16899-16913
Author(s):  
Aiguo Wang ◽  
Jing Yang ◽  
Ning An

2020 ◽  
Vol 2020 ◽  
pp. 1-9
Author(s):  
Fei Yang ◽  
Jiazhi Du ◽  
Jiying Lang ◽  
Weigang Lu ◽  
Lei Liu ◽  
...  

Electrocardiogram (ECG) signal is critical to the classification of cardiac arrhythmia using some machine learning methods. In practice, the ECG datasets are usually with multiple missing values due to faults or distortion. Unfortunately, many established algorithms for classification require a fully complete matrix as input. Thus it is necessary to impute the missing data to increase the effectiveness of classification for datasets with a few missing values. In this paper, we compare the main methods for estimating the missing values in electrocardiogram data, e.g., the “Zero method”, “Mean method”, “PCA-based method”, and “RPCA-based method” and then propose a novel KNN-based classification algorithm, i.e., a modified kernel Difference-Weighted KNN classifier (MKDF-WKNN), which is fit for the classification of imbalance datasets. The experimental results on the UCI database indicate that the “RPCA-based method” can successfully handle missing values in arrhythmia dataset no matter how many values in it are missing and our proposed classification algorithm, MKDF-WKNN, is superior to other state-of-the-art algorithms like KNN, DS-WKNN, DF-WKNN, and KDF-WKNN for uneven datasets which impacts the accuracy of classification.


2020 ◽  
Vol 4 (4) ◽  
pp. 365-382
Author(s):  
Xiao Xu ◽  
Xiaoshuang Liu ◽  
Yanni Kang ◽  
Xian Xu ◽  
Junmei Wang ◽  
...  

Proceedings ◽  
2018 ◽  
Vol 2 (11) ◽  
pp. 698 ◽  
Author(s):  
Klemen Kenda ◽  
Filip Koprivec ◽  
Dunja Mladenić

In this study an algorithm for missing data imputation is presented. The algorithm uses measurements from neighboring sensors to estimate the missing values. Data-driven approach is used and methodology chooses the optimal available combination of modeling algorithm and available measurements to produce an estimate from the model with lowest error. The methodology was tested on Ljubljana polje aquifer data and has produced close to perfect results.


Sign in / Sign up

Export Citation Format

Share Document