maximal frequent pattern Latest Research Papers

WMFP-Outlier: An Efficient Maximal Frequent-Pattern-Based Outlier Detection Approach for Weighted Data Streams

Information Technology And Control ◽

10.5755/j01.itc.48.4.22176 ◽

2019 ◽

Vol 48 (4) ◽

pp. 505-521 ◽

Cited By ~ 1

Author(s):

Saihua Cai ◽

Qian Li ◽

Sicong Li ◽

Gang Yuan ◽

Ruizhi Sun

Keyword(s):

Outlier Detection ◽

Data Streams ◽

Data Stream ◽

Pattern Mining ◽

Frequent Pattern ◽

Frequent Patterns ◽

Time Cost ◽

Detection Approach ◽

Maximal Frequent Pattern ◽

Weighted Data

Since outliers are the major factors that affect accuracy in data science, many outlier detection approaches have been proposed for effectively identifying the implicit outliers from static datasets, thereby improving the reliability of the data. In recent years, data streams have been the main form of data, and the data elements in a data stream are not always of equal importance. However, the existing outlier detection approaches do not consider the weight conditions; hence, these methods are not suitable for processing weighted data streams. In addition, the traditional pattern-based outlier detection approaches incur a high time cost in the outlier detection phase. Aiming at overcoming these problems, this paper proposes a two-phase pattern-based outlier detection approach, namely, WMFP-Outlier, for effectively detecting the implicit outliers from a weighted data stream, in which the maximal frequent patterns are used instead of the frequent patterns to accelerate the process of outlier detection. In the process of maximal frequent-pattern mining, the anti-monotonicity property and MFP-array structure are used to accelerate the mining operation. In the process of outlier detection, three deviation indices are designed for measuring the degree of abnormality of each transaction, and the transactions with the highest degrees of abnormality are judged as outliers. Last, several experimental studies are conducted on a synthetic dataset to evaluate the performance of the proposed WMFP-Outlier approach. The results demonstrate that the accuracy of the WMFP-Outlier approach is higher compared to the existing pattern-based outlier detection approaches, and the time cost of the outlier detection phase of WMFP-Outlier is lower than those of the other four compared pattern-based outlier detection approaches.

Download Full-text

Performance and characteristic analysis of maximal frequent pattern mining methods using additional factors

Soft Computing ◽

10.1007/s00500-017-2820-3 ◽

2017 ◽

Vol 22 (13) ◽

pp. 4267-4273 ◽

Cited By ~ 3

Author(s):

Gangin Lee ◽

Unil Yun

Keyword(s):

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Characteristic Analysis ◽

Maximal Frequent Pattern ◽

Mining Methods

Download Full-text

Analysis of Recent Maximal Frequent Pattern Mining Approaches

Advances in Computer Science and Ubiquitous Computing - Lecture Notes in Electrical Engineering ◽

10.1007/978-981-10-3023-9_135 ◽

2016 ◽

pp. 873-877 ◽

Cited By ~ 1

Author(s):

Gangin Lee ◽

Unil Yun

Keyword(s):

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Maximal Frequent Pattern

Download Full-text

Approximate Maximal Frequent Pattern Mining with Weight Conditions and Error Tolerance

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001416500129 ◽

2016 ◽

Vol 30 (06) ◽

pp. 1650012 ◽

Cited By ~ 14

Author(s):

Gangin Lee ◽

Unil Yun ◽

Heungmo Ryang ◽

Donggyu Kim

Keyword(s):

Pattern Mining ◽

Frequent Pattern Mining ◽

Error Tolerance ◽

Frequent Pattern ◽

Frequent Patterns ◽

Large Databases ◽

Maximal Frequent Pattern ◽

Negative Effect ◽

Previous State ◽

Pattern Information

Since the concept of frequent pattern mining was proposed, there have been many efforts to obtain useful pattern information from large databases. As one of them, applying weight conditions allows us to mine weighted frequent patterns considering unique importance of each item composing databases, and the result of analysis for the patterns provides more useful information than that of considering only frequency or support information. However, although this approach gives us more meaningful pattern information, the number of patterns found from large databases is extremely large in general; therefore, analyzing all of them may become inefficient and hard work. Thus, it is essential to apply a method that can selectively extract representative patterns from the enormous ones. Moreover, in the real-world applications, unexpected errors such as noise may occur, which can have a negative effect on the values of databases. Although the changes by the error are quite small, the characteristics of generated patterns can be turned definitely. For this reason, we propose a novel algorithm that can solve the above problems, called AWMax (an algorithm for mining Approximate weighted maximal frequent patterns (AWMFPs) considering error tolerance). Through the algorithm, we can obtain useful AWMFPs regardless of noise because of the consideration of error tolerance. Comprehensive performance experiments present that the proposed algorithm has more outstanding performance than previous state-of-the-art ones.

Download Full-text

Discovering long maximal frequent pattern

2016 Eighth International Conference on Advanced Computational Intelligence (ICACI) ◽

10.1109/icaci.2016.7449817 ◽

2016 ◽

Cited By ~ 1

Author(s):

Shu-Jing Lin ◽

Yi-Chung Chen ◽

Don-Lin Yang ◽

Jungpin Wu

Keyword(s):

Frequent Pattern ◽

Maximal Frequent Pattern

Download Full-text

Mining Maximal Frequent Patterns over Data Stream Based on Time Decaying

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.602-605.3835 ◽

2014 ◽

Vol 602-605 ◽

pp. 3835-3838

Author(s):

Fen Fen Zhou ◽

Jun Rui Yang

Keyword(s):

Data Stream ◽

Search Strategy ◽

Sliding Window ◽

Search Space ◽

Frequent Pattern ◽

Frequent Patterns ◽

Depth First Search ◽

Maximal Frequent Pattern ◽

Frequent Pattern Tree ◽

Enumeration Tree

A new algorithm DSMFP-Miner was proposed. When the data stream reach continuously, a maximal frequent pattern tree called MFP-Tree is adopted to maintain the transactions in data screams dynamically. Transactions in the same Transaction-sensitive sliding window are set to own the same “importance”. Besides, the support of the transactions in old window is decayed to reduce their influence to mining results, and infrequent patterns and overdue patterns are deleted periodically. In the mining process, the algorithm put an enumeration tree with each node of MFP-Tree as root as the search space, and use the "depth-first" search strategy to mining the maximal frequent patterns with this node as a suffix.

Download Full-text

Sliding window based weighted maximal frequent pattern mining over data streams

Expert Systems with Applications ◽

10.1016/j.eswa.2013.07.094 ◽

2014 ◽

Vol 41 (2) ◽

pp. 694-708 ◽

Cited By ~ 64

Author(s):

Gangin Lee ◽

Unil Yun ◽

Keun Ho Ryu

Keyword(s):

Data Streams ◽

Pattern Mining ◽

Frequent Pattern Mining ◽

Sliding Window ◽

Frequent Pattern ◽

Maximal Frequent Pattern

Download Full-text

Diversifying Search Results through Pattern-Based Subtopic Modeling

International Journal on Semantic Web and Information Systems ◽

10.4018/jswis.2012100103 ◽

2012 ◽

Vol 8 (4) ◽

pp. 37-56 ◽

Cited By ~ 4

Author(s):

Wei Zheng ◽

Hui Fang ◽

Hong Cheng ◽

Xuanhui Wang

Keyword(s):

Pattern Mining ◽

Frequent Pattern Mining ◽

Frequent Pattern ◽

Context Information ◽

Redundant Information ◽

Clustering Method ◽

Retrieval Models ◽

Search Results ◽

Maximal Frequent Pattern ◽

Single Pattern

Traditional information retrieval models do not necessarily provide users with optimal search experience because the top ranked documents may contain excessively redundant information. Therefore, satisfying search results should be not only relevant to the query but also diversified to cover different subtopics of the query. In this paper, the authors propose a novel pattern-based framework to diversify search results, where each pattern is a set of semantically related terms covering the same subtopic. They first apply a maximal frequent pattern mining algorithm to extract the patterns from retrieval results of the query. The authors then propose to model a subtopic with either a single pattern or a group of similar patterns. A profile-based clustering method is adapted to group similar patterns based on their context information. The search results are then diversified using the extracted subtopics. Experimental results show that the proposed pattern-based methods are effective to diversify the search results.

Download Full-text

Improving Star Join Queries Performance: A Maximal Frequent Pattern Based Approach for Automatic Selection of Indexes in Relational Data Warehouses

2011 International Conference on Internet Computing and Information Services ◽

10.1109/icicis.2011.36 ◽

2011 ◽

Author(s):

B. Ziani ◽

Y. Ouinten

Keyword(s):

Frequent Pattern ◽

Relational Data ◽

Data Warehouses ◽

Automatic Selection ◽

Maximal Frequent Pattern ◽

Join Queries ◽

Selection Of

Download Full-text

Research on Maximal Frequent Pattern Outlier Factor for Online High-Dimensional Time-Series Outlier Detection

Journal of Convergence Information Technology ◽

10.4156/jcit.vol5.issue10.9 ◽

2010 ◽

Vol 5 (10) ◽

pp. 66-71 ◽

Cited By ~ 5

Author(s):

Feng Lin ◽

Wang Le ◽

Jin Bo

Keyword(s):

Time Series ◽

Outlier Detection ◽

Frequent Pattern ◽

High Dimensional ◽

Maximal Frequent Pattern

Download Full-text

maximal frequent pattern
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

WMFP-Outlier: An Efficient Maximal Frequent-Pattern-Based Outlier Detection Approach for Weighted Data Streams

Performance and characteristic analysis of maximal frequent pattern mining methods using additional factors

Analysis of Recent Maximal Frequent Pattern Mining Approaches

Approximate Maximal Frequent Pattern Mining with Weight Conditions and Error Tolerance

Discovering long maximal frequent pattern

Mining Maximal Frequent Patterns over Data Stream Based on Time Decaying

Sliding window based weighted maximal frequent pattern mining over data streams

Diversifying Search Results through Pattern-Based Subtopic Modeling

Improving Star Join Queries Performance: A Maximal Frequent Pattern Based Approach for Automatic Selection of Indexes in Relational Data Warehouses

Research on Maximal Frequent Pattern Outlier Factor for Online High-Dimensional Time-Series Outlier Detection

Export Citation Format

maximal frequent patternRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

WMFP-Outlier: An Efficient Maximal Frequent-Pattern-Based Outlier Detection Approach for Weighted Data Streams

Performance and characteristic analysis of maximal frequent pattern mining methods using additional factors

Analysis of Recent Maximal Frequent Pattern Mining Approaches

Approximate Maximal Frequent Pattern Mining with Weight Conditions and Error Tolerance

Discovering long maximal frequent pattern

Mining Maximal Frequent Patterns over Data Stream Based on Time Decaying

Sliding window based weighted maximal frequent pattern mining over data streams

Diversifying Search Results through Pattern-Based Subtopic Modeling

Improving Star Join Queries Performance: A Maximal Frequent Pattern Based Approach for Automatic Selection of Indexes in Relational Data Warehouses

Research on Maximal Frequent Pattern Outlier Factor for Online High-Dimensional Time-Series Outlier Detection

maximal frequent pattern
Recently Published Documents