Journal of Computer Science and Technology
Quick Search in JCST
 Advanced Search 
      Home | PrePrint | SiteMap | Contact Us | FAQ
 
Indexed by   SCIE, EI ...
Bimonthly    Since 1986
Journal of Computer Science and Technology 2014, Vol. 29 Issue (1) :38-52    DOI: 10.1007/s11390-013-1410-9
Computer Networks and Distributed Computing Current Issue | Archive | Adv Search << Previous Articles | Next Articles >>
Improving Scalability of Cloud Monitoring Through PCA-Based Clustering of Virtual Machines
Claudia Canali, Member, IEEE, and Riccardo Lancellotti, Member, ACM, IEEE
Department of Information Engineering, University of Modena and Reggio Emilia, 41125 Modena, Italy

Abstract
Reference
Related Articles
Download: [PDF 703KB]     Export: BibTeX or EndNote (RIS)  
Abstract Cloud computing has recently emerged as a leading paradigm to allow customers to run their applications in virtualized large-scale data centers. Existing solutions for monitoring and management of these infrastructures consider virtual machines (VMs) as independent entities with their own characteristics. However, these approaches suffer from scalability issues due to the increasing number of VMs in modern cloud data centers. We claim that scalability issues can be addressed by leveraging the similarity among VMs behavior in terms of resource usage patterns. In this paper we propose an automated methodology to cluster VMs starting from the usage of multiple resources, assuming no knowledge of the services executed on them. The innovative contribution of the proposed methodology is the use of the statistical technique known as principal component analysis (PCA) to automatically select the most relevant information to cluster similar VMs. We apply the methodology to two case studies, a virtualized testbed and a real enterprise data center. In both case studies, the automatic data selection based on PCA allows us to achieve high performance, with a percentage of correctly clustered VMs between 80% and 100% even for short time series (1 day) of monitored data. Furthermore, we estimate the potential reduction in the amount of collected data to demonstrate how our proposal may address the scalability issues related to monitoring and management in cloud computing data centers.
Articles by authors
Claudia Canali
Riccardo Lancellotti
Keywordscloud computing   resource monitoring   principal component analysis   k-means clustering     
Received 2013-02-12;
Cite this article:   
Claudia Canali, Riccardo Lancellotti.Improving Scalability of Cloud Monitoring Through PCA-Based Clustering of Virtual Machines[J]  Journal of Computer Science and Technology, 2014,V29(1): 38-52
URL:  
http://jcst.ict.ac.cn:8080/jcst/EN/10.1007/s11390-013-1410-9
Copyright 2010 by Journal of Computer Science and Technology