clustering similarity
Recently Published Documents


TOTAL DOCUMENTS

13
(FIVE YEARS 5)

H-INDEX

3
(FIVE YEARS 1)

2021 ◽  
Vol 2021 ◽  
pp. 1-19
Author(s):  
Ziquan Jiao ◽  
Zhiqiang Feng ◽  
Na Lv ◽  
Wenjing Liu ◽  
Haijian Qin

A clustering similarity particle filter based on state trajectory consistency is presented for the mathematical modeling, performance estimation, and smart sensing of nonlinear systems. Starting from an information fusion model based on the consistency principle of the spatial state trajectory, the predicted observation information of the current particle filter (original trajectory) and future multistage Gaussian particle filter (modified trajectory) are selected as the state trajectories of the sampling particles. Clustering similarity methods are used to measure the state trajectories of the sampling particles and the actual system (reference trajectory). The importance weight of a first-order Markov model is updated with the measurement results. By integrating the targeted compensation scheme of the latest measurement information into the sequential importance sampling process, the adverse effects of the particle degradation phenomenon are effectively reduced. The convergence theorems of the improved particle filter are proposed and proved. The improved filter is applied to practical cases of nonlinear process estimation, economic statistical prediction, and battery health assessment, and the simulation results show that the improved particle filter is superior to traditional filters in estimation accuracy, efficiency, and robustness.


2021 ◽  
Vol 7 (3) ◽  
pp. 34-52
Author(s):  
Maria Trindade ◽  
Paulo Sousa ◽  
Maria Moreira

This paper is inspired by a manual picking retail company where shape and weight constraints affect the order-picking process. We proposed an alternative clustering similarity index that considers the similarity, the weight and the shape of products. This similarity index was further incorporated in a storage allocation heuristic procedure to set the location of the products. We test the procedure in a retail company that supplies over 191 stores, in Northern Portugal. When comparing the strategy currently used in the company with this procedure, we found out that our approach enabled a reduction of up to 40% on the picking distance; a percentage of improvement that is 32% higher than the one achieved by applying the Jaccard index, a similarity index commonly used in the literature. This allows warehouses to save time and work faster.


2019 ◽  
Author(s):  
Douglas E. U. Silva ◽  
Roberto A. Bittencourt ◽  
Rodrigo T. Calumby

Automated software architecture recovery of module views from source code is a challenging research issue. Different similarity measures are used to evaluate clustering algorithms in the software architecture recovery of module views. However, few studies seek to evaluate whether such measures accurately capture the similarities between two clusterings. This work presents an evaluation of six clustering similarity measures through the use of intrinsic quality and stability measures and the use of ground truth architectures proposed by developers. The results suggest that the MeCl metric is the most adequate to measure similarity in the context of comparison with ground truth models provided by developers. However, when the architectural models do not exist, the Purity metric shows the best results, as measured by the correlation with the intrinsic Silhouette coefficient.


2019 ◽  
Vol 4 (35) ◽  
pp. 1264 ◽  
Author(s):  
Alexander Gates ◽  
Yong-Yeol Ahn

2019 ◽  
Vol 3 (1) ◽  
pp. 62-79
Author(s):  
Ahmed N. Albatineh ◽  
Meredith L. Wilcox ◽  
Bashar Zogheib ◽  
Magdalena Niewiadomska-Bugaj

2018 ◽  
Author(s):  
Alexander J. Gates ◽  
Yong-Yeol Ahn

SummaryQuantifying the similarity of clusterings is a fundamental step in data analysis. Clustering similarity is the basis for method evaluation, consensus clustering, and tracking the temporal evolution of clusters, among many other tasks. Here we provide CluSim, a comprehensive Python package for the comparison of partitions, overlapping clusterings, and hierarchical clusterings (dendrograms) with more than 20 similarity measures. The CluSim package provides both analytic and empirical methods for assessing the similarity of clusterings in the context of a random model, and provides the novel element-centric approaches for clustering similarity measure that we introduced recently. We illustrate the use of the package through two examples: an evaluation of the clustering of Gene Expression data in the context of different random models, and detailed analysis of model incongruence using element-centric comparisons between a set of phylogentic trees (dendrograms).Availability and implementationThe CluSim Python package and accompanying jupyter notebook is available at https://github.com/Hoosier-Clusters/clusim with the MIT open source [email protected] [email protected]


2017 ◽  
Author(s):  
Alexander J. Gates ◽  
Yong-Yeol Ahn

AbstractClustering is a central approach for unsupervised learning. After clustering is applied, the most fundamental analysis is to quantitatively compare clusterings. Such comparisons are crucial for the evaluation of clustering methods as well as other tasks such as consensus clustering. It is often argued that, in order to establish a baseline, clustering similarity should be assessed in the context of a random ensemble of clusterings. The prevailing assumption for the random clustering ensemble is the permutation model in which the number and sizes of clusters are fixed. However, this assumption does not necessarily hold in practice; for example, multiple runs of K-means clustering returns clusterings with a fixed number of clusters, while the cluster size distribution varies greatly. Here, we derive corrected variants of two clustering similarity measures (the Rand index and Mutual Information) in the context of two random clustering ensembles in which the number and sizes of clusters vary. In addition, we study the impact of one-sided comparisons in the scenario with a reference clustering. The consequences of different random models are illustrated using synthetic examples, handwriting recognition, and gene expression data. We demonstrate that the choice of random model can have a drastic impact on the ranking of similar clustering pairs, and the evaluation of a clustering method with respect to a random baseline; thus, the choice of random clustering model should be carefully justified.


Author(s):  
Trong Nhan Phan ◽  
Markus Jäger ◽  
Stefan Nadschläger ◽  
Pablo Gómez-Pérez ◽  
Christian Huber ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document