Association rules mining on concept lattice using domain knowledge

Author(s):  
De-Xing Wang ◽  
Xue-Gang Hu ◽  
Xiao-Ping Liu ◽  
Hao Wang
2011 ◽  
Vol 460-461 ◽  
pp. 363-368
Author(s):  
Lei Zhang ◽  
Zhi Chao Wang

Traditional multi-level association rules mining approaches are based only on database contents. The relations of items in itemset are considered rarely. It leads to generate a lot of meaningless itemsets. Aiming at the problem,multi-level association rules mining algorithm based on semantic relativity is proposed. Domain knowledge is described by Ontology. Every item is seen as a concept in Ontology. Semantic relativity is used to measure the semantic meaning of itemsets. Minimum support of itemset is set according to its length and semantic relativity. Semantic related minimum support with length-decrease is defined to filter meaningless itemsets. Experiments results showed that the method in the paper can improve the efficiency of multi-level association rules mining and generated meaningful rules.


2014 ◽  
Vol 2014 ◽  
pp. 1-9 ◽  
Author(s):  
Yang Xu ◽  
Mingming Zeng ◽  
Quanhui Liu ◽  
Xiaofeng Wang

Multilevel association rules mining is an important domain to discover interesting relations between data elements with multiple levels abstractions. Most of the existing algorithms toward this issue are based on exhausting search methods such as Apriori, and FP-growth. However, when they are applied in the big data applications, those methods will suffer for extreme computational cost in searching association rules. To expedite multilevel association rules searching and avoid the excessive computation, in this paper, we proposed a novel genetic-based method with three key innovations. First, we use the category tree to describe the multilevel application data sets as the domain knowledge. Then, we put forward a special tree encoding schema based on the category tree to build the heuristic multilevel association mining algorithm. As the last part of our design, we proposed the genetic algorithm based on the tree encoding schema that will greatly reduce the association rule search space. The method is especially useful in mining multilevel association rules in big data related applications. We test the proposed method with some big datasets, and the experimental results demonstrate the effectiveness and efficiency of the proposed method in processing big data. Moreover, our results also manifest that the algorithm is fast convergent with a limited termination threshold.


Sign in / Sign up

Export Citation Format

Share Document