Double-smoothing in Kernel Hazard Rate Estimation

A. Pfahlberg; O. Gefeller; R. Weißbach

doi:10.3414/me0447

Double-smoothing in Kernel Hazard Rate Estimation

Methods of Information in Medicine ◽

10.3414/me0447 ◽

2008 ◽

Vol 47 (02) ◽

pp. 167-173 ◽

Cited By ~ 5

Author(s):

A. Pfahlberg ◽

O. Gefeller ◽

R. Weißbach

Keyword(s):

Hazard Rate ◽

Nearest Neighbor ◽

Bandwidth Selection ◽

Study Data ◽

Computational Effort ◽

Finite Sample ◽

Adaptive Smoothing ◽

Rate Estimation ◽

Data Adaptive ◽

Double Smoothing

Summary Objectives: In oncological studies, the hazard rate can be used to differentiate subgroups of the study population according to their patterns of survival risk over time. Nonparametric curve estimation has been suggested as an exploratory means of revealing such patterns. The decision about the type of smoothing parameter is critical for performance in practice. In this paper, we study data-adaptive smoothing. Methods: A decade ago, the nearest-neighbor bandwidth was introduced for censored data in survival analysis. It is specified by one parameter, namely the number of nearest neighbors. Bandwidth selection in this setting has rarely been investigated, although the heuristical advantages over the frequently-studied fixed bandwidth are quite obvious. The asymptotical relationship between the fixed and the nearest-neighbor bandwidth can be used to generate novel approaches. Results: We develop a new selection algorithm termed double-smoothing for the nearest-neighbor bandwidth in hazard rate estimation. Our approach uses a finite sample approximation of the asymptotical relationship between the fixed and nearest-neighbor bandwidth. By so doing, we identify the nearest-neighbor bandwidth as an additional smoothing step and achieve further data-adaption after fixed bandwidth smoothing. We illustrate the application of the new algorithm in a clinical study and compare the outcome to the traditional fixed bandwidth result, thus demonstrating the practical performance of the technique. Conclusion: The double-smoothing approach enlarges the methodological repertoire for selecting smoothing parameters in nonparametric hazard rate estimation. The slight increase in computational effort is rewarded with a substantial amount of estimation stability, thus demonstrating the benefit of the technique for biostatistical applications.

Download Full-text

Local linear hazard rate estimation and bandwidth selection

Annals of the Institute of Statistical Mathematics ◽

10.1007/s10463-010-0277-6 ◽

2010 ◽

Vol 63 (5) ◽

pp. 1019-1046 ◽

Cited By ~ 9

Author(s):

Dimitrios Bagkavos

Keyword(s):

Hazard Rate ◽

Bandwidth Selection ◽

Rate Estimation ◽

Local Linear

Download Full-text

Smoothed bootstrap bandwidth selection for nonparametric hazard rate estimation

Journal of Statistical Computation and Simulation ◽

10.1080/00949655.2018.1532512 ◽

2018 ◽

Vol 89 (1) ◽

pp. 15-37 ◽

Cited By ~ 4

Author(s):

Inés Barbeito ◽

Ricardo Cao

Keyword(s):

Hazard Rate ◽

Bandwidth Selection ◽

Rate Estimation ◽

Smoothed Bootstrap ◽

Selection For

Download Full-text

Computationally Efficient Bootstrap Expressions for Bandwidth Selection in Nonparametric Curve Estimation

Proceedings ◽

10.3390/proceedings2181164 ◽

2018 ◽

Vol 2 (18) ◽

pp. 1164

Author(s):

Inés Barbeito ◽

Ricardo Cao

Keyword(s):

Hazard Rate ◽

Bandwidth Selection ◽

Kernel Density ◽

Density Estimator ◽

Computationally Efficient ◽

Rate Estimation ◽

Smoothed Bootstrap ◽

Stationary Bootstrap ◽

Moving Blocks Bootstrap ◽

Nonparametric Kernel

Bootstrap methods are used for bandwidth selection in: (1) nonparametric kernel density estimation with dependent data (smoothed stationary bootstrap and smoothed moving blocks bootstrap), and (2) nonparametric kernel hazard rate estimation (smoothed bootstrap). In these contexts, four new bandwidth parameter selectors are proposed based on closed bootstrap expressions of the MISE of the kernel density estimator (case 1) and two approximations of the kernel hazard rate estimation (case 2). These expressions turn out to be very useful since Monte Carlo approximation is no longer needed. Finally, these smoothing parameter selectors are empirically compared with the already existing ones via a simulation study.

Download Full-text

Nonparametric Methods for Hazard Rate Estimation from Right-Censored Samples.

10.21236/ada159131 ◽

1985 ◽

Author(s):

D. T. McNichols ◽

W. J. Padgett

Keyword(s):

Hazard Rate ◽

Nonparametric Methods ◽

Rate Estimation ◽

Censored Samples

Download Full-text

Transformations In Hazard Rate Estimation For Heavy-Tailed Data

IFAC Proceedings Volumes ◽

10.3182/20130619-3-ru-3018.00262 ◽

2013 ◽

Vol 46 (9) ◽

pp. 928-932

Author(s):

Dimitrios Bagkavos

Keyword(s):

Hazard Rate ◽

Rate Estimation ◽

Heavy Tailed

Download Full-text

Data Mining Approach to Analyze COVID-19 Clinical Dataset

10.53350/pjmhs211561812 ◽

2021 ◽

Vol 15 (6) ◽

pp. 1812-1819

Author(s):

Azita Yazdani ◽

Ramin Ravangard ◽

Roxana Sharifian

Keyword(s):

Artificial Intelligence ◽

Data Mining ◽

Support Vector Machine ◽

Nearest Neighbor ◽

Clinical Signs ◽

Study Data ◽

Mining Machine ◽

Support Vector ◽

K Nearest Neighbor ◽

Data Mining Approach

The new coronavirus has been spreading since the beginning of 2020 and many efforts have been made to develop vaccines to help patients recover. It is now clear that the world needs a rapid solution to curb the spread of COVID-19 worldwide with non-clinical approaches such as data mining, enhanced intelligence, and other artificial intelligence techniques. These approaches can be effective in reducing the burden on the health care system to provide the best possible way to diagnose and predict the COVID-19 epidemic. In this study, data mining models for early detection of Covid-19 in patients were developed using the epidemiological dataset of patients and individuals suspected of having Covid-19 in Iran. C4.5, support vector machine, Naive Bayes, logistic regression, Random Forest, and k-nearest neighbor algorithm were used directly on the dataset using Rapid miner to develop the models. By receiving clinical signs, this model diagnosis the risk of contracting the COVID-19 virus. Examination of the models in this study has shown that the support vector machine with 93.41% accuracy is more efficient in the diagnosis of patients with COVID-19 pandemic, which is the best model among other developed models. Keywords: COVID-19, Data mining, Machine Learning, Artificial Intelligence, Classification

Download Full-text

Minimax theory of nonparametric hazard rate estimation: efficiency and adaptation

Annals of the Institute of Statistical Mathematics ◽

10.1007/s10463-014-0487-4 ◽

2014 ◽

Vol 68 (1) ◽

pp. 25-75 ◽

Cited By ~ 4

Author(s):

Sam Efromovich

Keyword(s):

Hazard Rate ◽

Rate Estimation ◽

Minimax Theory ◽

Estimation Efficiency

Download Full-text

Finite sample performance of kernel-based regression methods for non-parametric additive models under common bandwidth selection criterion

Journal of Nonparametric Statistics ◽

10.1080/10485250701297933 ◽

2007 ◽

Vol 19 (1) ◽

pp. 23-62 ◽

Cited By ~ 10

Author(s):

Carlos Martins-Filho ◽

Ke Yang

Keyword(s):

Bandwidth Selection ◽

Selection Criterion ◽

Additive Models ◽

Finite Sample ◽

Regression Methods ◽

Non Parametric

Download Full-text

Bootstrap Selection of the Smoothing Parameter in Nonparametric Hazard Rate Estimation

Journal of the American Statistical Association ◽

10.2307/2291732 ◽

1996 ◽

Vol 91 (435) ◽

pp. 1130 ◽

Cited By ~ 19

Author(s):

W. Gonzalez-Manteiga ◽

R. Cao ◽

J. S. Marron

Keyword(s):

Hazard Rate ◽

Smoothing Parameter ◽

Rate Estimation ◽

Selection Of

Download Full-text

Hazard rate estimation for call center customer patience time

IISE Transactions ◽

10.1080/24725854.2019.1692159 ◽

2020 ◽

Vol 52 (8) ◽

pp. 890-903

Author(s):

Han Ye ◽

Lawrence D. Brown ◽

Haipeng Shen

Keyword(s):

Call Center ◽

Hazard Rate ◽

Rate Estimation

Download Full-text