CK-Modes Clustering Algorithm Based on Node Cohesion in Labeled Property Graph

Da-Wei Wang; Wan-Qiu Cui; Biao Qin

doi:10.1007/s11390-019-1966-0

Da-Wei Wang, Wan-Qiu Cui, Biao Qin. CK-Modes Clustering Algorithm Based on Node Cohesion in Labeled Property GraphJ. Journal of Computer Science and Technology, 2019, 34(5): 1152-1166. DOI: 10.1007/s11390-019-1966-0

Citation:

CK-Modes Clustering Algorithm Based on Node Cohesion in Labeled Property Graph

Abstract

Abstract

The designation of the cluster number K and the initial centroids is essential for K-modes clustering algorithm. However, most of the improved methods based on K-modes specify the K value manually and generate the initial centroids randomly, which makes the clustering algorithm significantly dependent on human-based decisions and unstable on the iteration time. To overcome this limitation, we propose a cohesive K-modes (CK-modes) algorithm to generate the cluster number K and the initial centroids automatically. Explicitly, we construct a labeled property graph based on index-free adjacency to capture both global and local cohesion of the node in the sample of the input datasets. The cohesive node calculated based on the property similarity is exploited to split the graph to a K-node tree that determines the K value, and then the initial centroids are selected from the split subtrees. Since the property graph construction and the cohesion calculation are only performed once, they account for a small amount of execution time of the clustering operation with multiple iterations, but significantly accelerate the clustering convergence. Experimental validation in both real-world and synthetic datasets shows that the CK-modes algorithm outperforms the state-of-the-art algorithms.

FullText(HTML)

References (30)

Relative Articles

Supplements (1)

Cited By

CK-Modes Clustering Algorithm Based on Node Cohesion in Labeled Property Graph

Abstract

Catalog

Export File

Citation

Format

Content