We use cookies to improve your experience with our site.

Indexed in:

SCIE, EI, Scopus, INSPEC, DBLP, CSCD, etc.

Submission System
(Author / Reviewer / Editor)
Fei Wu, Ya-Hong Han, Yue-Ting Zhuang. Multiple Hypergraph Clustering ofWeb Images by MiningWord2Image Correlations[J]. Journal of Computer Science and Technology, 2010, 25(4): 750-760. DOI: 10.1007/s11390-010-1058-7
Citation: Fei Wu, Ya-Hong Han, Yue-Ting Zhuang. Multiple Hypergraph Clustering ofWeb Images by MiningWord2Image Correlations[J]. Journal of Computer Science and Technology, 2010, 25(4): 750-760. DOI: 10.1007/s11390-010-1058-7

Multiple Hypergraph Clustering ofWeb Images by MiningWord2Image Correlations

Funds: Supported by the National Natural Science Foundation of China under Grant Nos. 90920303, 60833006; the National Basic Research 973 Program of China under Grant No. 2010CB327905; the Program for Changjiang Scholars and Innovative Research Team in University of China under Grant Nos. IRT0652, PCSIRT.
More Information
  • Author Bio:

    Fei Wu is a senior member of CCF. He received the B.S. degree from Lanzhou University, China, the M.S. degree from Macao University, China, and the Ph.D. degree from Zhejiang University, Hangzhou, China. His research interest includes multimedia retrieval and statistic learning.

    Ya-Hong Han received the B.S. degree in 2000 from Zhengzhou University, China, the M.S. degree in 2003 from Hohai University, Nanjing, China. He is currently pursuing the Ph.D. degree at the College of Computer Science, Zhejiang University. His research interests are machine learning and multimedia retrieval.

    Yue-Ting Zhuang is a Member of IEEE. He received the B.S., M.S., and Ph.D. degrees from Zhejiang University, Hangzhou, China, in 1986, 1989, and 1998, respectively. Currently, he is a professor and Ph.D. supervisor at the College of Computer Science, Zhejiang University. His research interests include multimedia databases, artificial intelligence, and video-based animation.

  • Received Date: May 14, 2009
  • Revised Date: February 10, 2010
  • Published Date: July 08, 2010
  • In this paper, we consider the problem of clustering Web images by mining correlations between images and their corresponding words. Since Web images always come with associated text, the corresponding textual tags of Web images are used as a source to enhance the description of Web images. However, each word has different contribution for the interpretation of image semantics. Therefore, in order to evaluate the importance of each corresponding word of Web images, we propose a novel visibility model to compute the extent to which a word can be perceived visually in images, and then infer the correlation of word to image by the integration of visibility with tf-idf. Furthermore, Latent Dirichlet Allocation (LDA) is used to discover topic information inherent in surrounding text and topic correlations of images could be defined for image clustering. For integrating visibility and latent topic information into an image clustering framework, we first represent textual correlated and latent-topic correlated images by two hypergraph views, and then the proposed Spectral Multiple Hypergraph Clustering (SMHC) algorithm is used to cluster images into categories. The SMHC could be regarded as a new unsupervised learning process with two hypergraphs to classify Web images. Experimental results show that the SMHC algorithm has better clustering performance and the proposed SMHC-based image clustering framework is effective.
  • [1]
    Smeulders A W M et al. Content-based image retrieval: The end of the early years. IEEE Trans. PAMI, 2000, 22(12): 1349-1380.
    [2]
    Rege M, Dong M, Hua J. Graph theoretical framework for simultaneously integrating visual and textual features for efficient Web image clustering. In Proc. the 17th Int. Conf. World Wide Web 2008, Beijing, China, April 21-25, 2008, pp.317-326.
    [3]
    Jing F, Wang C et al. IGroup: A Web image search engine with semantic clustering of search results. In Proc. the 14th ACM Int. Conf. Multimedia, Singapore, Nov. 6-11, 2005, pp.377-384.
    [4]
    Cai D, He X et al. Hierarchical clustering of WWW image search results using visual, textual and link information. In Proc. the 13th ACM Int. Conf. Multimedia, New York, USA, Oct. 10-16, 2004, pp.952-959.
    [5]
    Saenko K, Darrell T. Unsupervised learning of visual sense models for polysemous words. In Proc. NIPS 2008, Vancouver, Canada, Dec. 8-11, 2008, pp.1393-1400.
    [6]
    Blei D, Ng A, Jordan M. Latent Dirichlet allocation. Journal of Machine Learning Research, Mar. 2003, 3: 993-1022.
    [7]
    Wu F, Liu Y, Zhuang Y. Tensor-based transductive learning for multi-modality video semantic concept detection. IEEE Transactions on Multimedia, 2009, 11(5): 868-878.
    [8]
    Yang Y, Zhuang Y, Wu F, Pan Y. Harmonizing hierarchical manifolds for multimedia document semantics understanding and cross-media retrieval. IEEE Transactions on Multimedia, 2008, 10(3): 437-446.
    [9]
    Wu F, Han Y, Zhuang Y, Shao J. Clustering web images by correlation mining of image-text. Journal of Software, http://www.jos.org.cn/1000-9825/3704.htm, 2010. (in Chinese)
    [10]
    Zhou D, Burges C J C. Spectral clustering and transductive learning with multiple views. In Proc. the 24th Int. Conf. Machine Learning, Corvallis, OR, 2007, pp.1159-1166.
    [11]
    Barnard K, Duygulu P et al. Matching words and pictures. Journal of Machine Learning Research, 2003, 3: 1107-1135.
    [12]
    Wang X J, Zhang L et al. Annotating images by mining image search results. IEEE Trans. PAMI, 2008, 30(11): 1919-1932.
    [13]
    Zhu X, Goldberg A et al. A text-to-picture synthesis system for augmenting communication. In Proc. the 22nd Conf. AAAI, Vancouver, Canada, July 22-26, 2007, pp.1590-1595.
    [14]
    Li H, Tang J et al. Word2Image: Towards visual interpretation of words. In Proc. the 16th ACM Int. Conf. Multimedia, Vancouver, Canada, Oct. 26-31, 2008, pp.813-816.
    [15]
    Xia D, Wu F, Zhuang Y. Search-based automatic Web image annotation using latent visual and semantic analysis. In Proc. the 9th Pacific Rim Conf. Multimedia, Tainan, China, Dec. 9-13, 2008.
    [16]
    Berg T, Forsyth D. Animals on the Web. In Proc. CVPR 2006, Washington, DC, USA, June 17-22, 2006, pp.1463-1470.
    [17]
    Zhou D, Huang J, Schõlkopf B. Learning with hypergraphs clustering, classification, and embedding. Advances in Neural Information Processing Systems, Vancouver/Whistler, Canada, Dec. 4-9, 2007, pp.1601-1608.
    [18]
    Golub G, Loan C. Matrix Computations. 3rd Edition, The Johns Hopkins University Press, 1996.
    [19]
    Griffiths T L, Steyvers M. Finding scientific topics. Proc. National Academy of Science, 2004, 101(Supp.1): 5228-5235.
    [20]
    Dhillon I S. Co-clustering documents and words using bipartite spectral graph partitioning. In Proc. KDD, San Francisco, USA, Aug. 26-29, 2001, 269-274.
    [21]
    LSCOM lexicon definitions and annotations version 1.0. In DTO Challenge Workshop on Large Scale Concept Ontology for Multimedia, Columbia University ADVENT Technical Report 117-2006-3, 2006.
    [22]
    Grubinger M, Clough P et al. The IAPR TC-12 Benchmark: A new evaluation resource for visual information systems. In Proc. Int. Workshop OntoImage'2006 Language Resources for Content-Based Image Retrieval, Genoa, Italy, May 22, 2006, pp.13-23.
    [23]
    Alexander S, Joydeep G. Cluster ensembles --- A knowledge reuse framework for combining multiple partitions. Journal Machine Learning Research, December 2002, 3: 583-617.
    [24]
    Long B, Zhang Z, Xu T. Clustering on complex graphs. In Proc. the 23rd Conf. AAAI 2008, Chicago, USA, July, 2008.
  • Related Articles

    [1]Wei Chen, Weiqing Wang, Hongzhi Yin, Jun-Hua Fang, Lei Zhao. User Account Linkage Across Multiple Platforms with Location Data[J]. Journal of Computer Science and Technology, 2020, 35(4): 751-768. DOI: 10.1007/s11390-020-0250-7
    [2]Xue-Hou Tan, Bo Jiang. Searching a Polygonal Region by Two Guards[J]. Journal of Computer Science and Technology, 2008, 23(5): 728-739.
    [3]HAO Zhongxiao, YAO Chunlong. The Existence Condition of r-Acyclic Database Schemes with MVDs Constraints[J]. Journal of Computer Science and Technology, 2002, 17(4).
    [4]ZHANG Li'ang, ZHANG Yin. Approximation for- Knapsack Problemswith Multiple Constraints[J]. Journal of Computer Science and Technology, 1999, 14(4): 289-297.
    [5]ZHUANG Yueting, RUI Yong, Thomas S.Huang. Video Key Frame Extraction by Unsupervised Clustering and Feedback Adjustment[J]. Journal of Computer Science and Technology, 1999, 14(3): 283-287.
    [6]Zhang Xiaolong, Masayuki Numao. An Efficient Multiple Predicate Learner[J]. Journal of Computer Science and Technology, 1998, 13(3): 268-278.
    [7]Zhou Yingquan, Min Yinghua. A Kind of Multistage Interconnection Networks with Multiple Paths[J]. Journal of Computer Science and Technology, 1996, 11(4): 395-404.
    [8]Liu Tieying, Ye Xinming. An Algorithm for Determining Minimal Reduced-Coverings of Acyclic Database Schemes[J]. Journal of Computer Science and Technology, 1996, 11(4): 347-355.
    [9]Ye Shiwei, Shi Zhongzhi. A Necessary Condition about the Optimum Partition on a Finite Set of Samples and Its Application to Clustering Analysis[J]. Journal of Computer Science and Technology, 1995, 10(6): 545-556.
    [10]Yao Rong, Kang Tai, Chen Tinghuai. Algorithms for the Determination of Cutsets in a Hypergraph[J]. Journal of Computer Science and Technology, 1990, 5(1): 41-46.

Catalog

    Article views (33) PDF downloads (3165) Cited by()
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return