Document Clustering Using K-Means, Heuristic K-Means and Fuzzy C-Means

Author(s):  
Vivek Kumar Singh ◽  
Nisha Tiwari ◽  
Shekhar Garg
2013 ◽  
Vol 3 (2) ◽  
Author(s):  
Stuti Karol ◽  
Veenu Mangat

AbstractClustering, an extremely important technique in Data Mining is an automatic learning technique aimed at grouping a set of objects into subsets or clusters. The goal is to create clusters that are coherent internally, but substantially different from each other. Text Document Clustering refers to the clustering of related text documents into groups based upon their content. It is a fundamental operation used in unsupervised document organization, text data mining, automatic topic extraction, and information retrieval. Fast and high-quality document clustering algorithms play an important role in effectively navigating, summarizing, and organizing information. The documents to be clustered can be web news articles, abstracts of research papers etc. This paper proposes two techniques for efficient document clustering involving the application of soft computing approach as an intelligent hybrid approach PSO algorithm. The proposed approach involves partitioning Fuzzy C-Means algorithm and K-Means algorithm each hybridized with Particle Swarm Optimization (PSO). The performance of these hybrid algorithms has been evaluated against traditional partitioning techniques (K-Means and Fuzzy C Means).


2010 ◽  
Vol 42 (12) ◽  
pp. 13-21
Author(s):  
Anatoliy F. Bulat ◽  
Elena M. Kiseleva ◽  
Sergey A. Pichugov ◽  
Oleg B. Blyuss

Sign in / Sign up

Export Citation Format

Share Document