rule discovery
Recently Published Documents


TOTAL DOCUMENTS

309
(FIVE YEARS 28)

H-INDEX

25
(FIVE YEARS 2)

2021 ◽  
Vol 1 (2) ◽  
pp. 54-66
Author(s):  
M. Hamdani Santoso

Data mining can generally be defined as a technique for finding patterns (extraction) or interesting information in large amounts of data that have meaning for decision support. One of the well-known and commonly used association rule discovery data mining methods is the Apriori algorithm. The Association Rule and the Apriori Algorithm are two very prominent algorithms for finding a number of frequently occurring sets of items from transaction data stored in databases. The calculation is done to determine the minimum value of support and minimum confidence that will produce the association rule. The association rule is used to produce the percentage of purchasing activity for an itemset within a certain period of time using the RapidMiner software. The results of the test using the priori algorithm method show that the association rule, that customers often buy toothpaste and detergents that have met the minimum confidence value. By searching for patterns using this a priori algorithm, it is hoped that the resulting information can improve further sales strategies.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Ying Wu ◽  
Shuai Huang ◽  
Xiangyu Chang

Abstract Background Sepsis, defined as life-threatening organ dysfunction caused by a dysregulated host response to infection, has become one of the major causes of death in Intensive Care Units (ICUs). The heterogeneity and complexity of this syndrome lead to the absence of golden standards for its diagnosis, treatment, and prognosis. The early prediction of in-hospital mortality for sepsis patients is not only meaningful to medical decision making, but more importantly, relates to the well-being of patients. Methods In this paper, a rule discovery and analysis (rule-based) method is used to predict the in-hospital death events of 2021 ICU patients diagnosed with sepsis using the MIMIC-III database. The method mainly includes two phases: rule discovery phase and rule analysis phase. In the rule discovery phase, the RuleFit method is employed to mine multiple hidden rules which are capable to predict individual in-hospital death events. In the rule analysis phase, survival analysis and decomposition analysis are carried out to test and justify the risk prediction ability of these rules. Then by leveraging a subset of these rules, we establish a prediction model that is both more accurate at the in-hospital death prediction task and more interpretable than most comparable methods. Results In our experiment, RuleFit generates 77 risk prediction rules, and the average area under the curve (AUC) of the prediction model based on 62 of these rules reaches 0.781 ($$\pm 0.018$$ ± 0.018 ) which is comparable to or even better than the AUC of existing methods (i.e., commonly used medical scoring system and benchmark machine learning models). External validation of the prediction power of these 62 rules on another 1468 sepsis patients not included in MIMIC-III in ICU provides further supporting evidence for the superiority of the rule-based method. In addition, we discuss and explain in detail the rules with better risk prediction ability. Glasgow Coma Scale (GCS), serum potassium, and serum bilirubin are found to be the most important risk factors for predicting patient death. Conclusion Our study demonstrates that, with the rule-based method, we could not only make accurate prediction on in-hospital death events of sepsis patients, but also reveal the complex relationship between sepsis-related risk factors through the rules themselves, so as to improve our understanding of the complexity of sepsis as well as its population.


2021 ◽  
Vol 8 (1) ◽  
pp. 53
Author(s):  
Zhaohao Sun ◽  
Paul Pinjik ◽  
Francisca Pambel

Business case mining and business rule discovery are at the center for entity relationship (E-R) modeling and database design to obtain E-R models. How to transform business cases through business rules into E-R models is a fundamental issue for database design. This article addresses this issue by exploring business case mining and E-R modeling optimization. Business case mining is business rule discovery from a business case. This article reviews case-based reasoning, explores business case-based reasoning, and presents a unified approach to business case mining for business rule discovery. The approach includes people-centered entity/business rule discovery and function-centered entity/business rule discovery. E-R modeling optimization aims to improve the E-R modeling process to get a better E-R diagram that reflects the business case properly. This article proposes a unified optimal method for E-R modeling. The unified optimal method includes people-centered E-R modeling, function-centered E-R modeling, and hierarchical E-R modeling. The approach proposed in this research will facilitate the research and development of E-R modeling, database design, data science, and big data analytics.


Author(s):  
Vito Janko ◽  
Gašper Slapničar ◽  
Erik Dovgan ◽  
Nina Reščič ◽  
Tine Kolenik ◽  
...  

The COVID-19 pandemic affected the whole world, but not all countries were impacted equally. This opens the question of what factors can explain the initial faster spread in some countries compared to others. Many such factors are overshadowed by the effect of the countermeasures, so we studied the early phases of the infection when countermeasures had not yet taken place. We collected the most diverse dataset of potentially relevant factors and infection metrics to date for this task. Using it, we show the importance of different factors and factor categories as determined by both statistical methods and machine learning (ML) feature selection (FS) approaches. Factors related to culture (e.g., individualism, openness), development, and travel proved the most important. A more thorough factor analysis was then made using a novel rule discovery algorithm. We also show how interconnected these factors are and caution against relying on ML analysis in isolation. Importantly, we explore potential pitfalls found in the methodology of similar work and demonstrate their impact on COVID-19 data analysis. Our best models using the decision tree classifier can predict the infection class with roughly 80% accuracy.


2021 ◽  
Vol 155 ◽  
pp. 106119
Author(s):  
Alfonso Montella ◽  
Filomena Mauriello ◽  
Mariano Pernetti ◽  
Maria Rella Riccardi
Keyword(s):  
The Road ◽  

2021 ◽  
pp. 107320
Author(s):  
Yong Shi ◽  
Wei Li ◽  
Luyao Zhu ◽  
Kun Guo ◽  
Erik Cambria

2020 ◽  
Vol 24 ◽  
pp. 105-122
Author(s):  
González-Méndez Andy ◽  
Martín Diana ◽  
Morales Eduardo ◽  
García-Borroto Milton

Associative classification is a pattern recognition approach that integrates classification and association rule discovery to build accurate classification models. These models are formed by a collection of contrast patterns that fulfill some restrictions. In this paper, we introduce an experimental comparison of the impact of using different restrictions in the classification accuracy. To the best of our knowledge, this is the first time that such analysis is performed, deriving some interesting findings about how restrictions impact on the classification results. Contrasting these results with previously published papers, we found that their conclusions could be unintentionally biased by the restrictions they used. We found, for example, that the jumping restriction could severely damage the pattern quality in the presence of dataset noise. We also found that the minimal support restriction has a different effect in the accuracy of two associative classifiers, therefore deciding which one is the best depends on the support value. This paper opens some interesting lines of research, mainly in the creation of new restrictions and new pattern types by joining different restrictions.


2020 ◽  
Vol 46 (10) ◽  
pp. 1807-1827 ◽  
Author(s):  
Hilary J. Don ◽  
Micah B. Goldwater ◽  
Justine K. Greenaway ◽  
Rosalind Hutchings ◽  
Evan J. Livesey

Sign in / Sign up

Export Citation Format

Share Document