1 Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China;
2 State Key Laboratory on Intelligent Technology and Systems, Tsinghua University, Beijing 100084, China;
3 National Laboratory for Information Science and Technology, Tsinghua University, Beijing 100084, China;
4 Jiangsu Collaborative Innovation Center for Language Ability, Jiangsu Normal University, Xuzhou 221009, China
Abstract Some microblog services encourage users to annotate themselves with multiple tags, indicating their attributes and interests. User tags play an important role for personalized recommendation and information retrieval. In order to better understand the semantics of user tags, we propose Tag Correspondence Model (TCM) to identify complex correspondences of tags from the rich context of microblog users. The correspondence of a tag is referred to as a unique element in the context which is semantically correlated with this tag. In TCM, we divide the context of a microblog user into various sources (such as short messages, user profile, and neighbors). With a collection of users with annotated tags, TCM can automatically learn the correspondences of user tags from multiple sources. With the learned correspondences, we are able to interpret implicit semantics of tags. Moreover, for the users who have not annotated any tags, TCM can suggest tags according to users' context information. Extensive experiments on a real-world dataset demonstrate that our method can efficiently identify correspondences of tags, which may eventually represent semantic meanings of tags.
This work is supported by the National Natural Science Foundation of China under Grant Nos. 61170196 and 61202140§and the Major Project of the National Social Science Foundation of China under Grant No. 13&ZD190.
Corresponding Authors: Zhi-Yuan Liu
About author: Cun-Chao Tu is a Ph.D. student of the Department of Computer Science and Technology, Tsinghua University, Beijing. He got his B.E. degree in computer science from Tsinghua University in 2013. His research interests are user representation and social computation.
Cite this article:
Cun-Chao Tu, Zhi-Yuan Liu, Mao-Song Sun.Tag Correspondence Model for User Tag Suggestion[J] Journal of Computer Science and Technology, 2015,V30(5): 1063-1072
 McPherson M, Smith-Lovin L, Cook J M. Birds of a feather:Homophily in social networks. Annual Review of Sociology, 2001, 27:415-444. Liang H, Xu Y, Li Y, Nayak R, Tao X. Connecting users and items with weighted tags for personalized item recommendations. In Proc. the 21st ACM Conference on Hypertext and Hypermedia, June 2010, pp.51-60. Peng J, Zeng D, Zhao H, Wang F. Collaborative filtering in social tagging systems based on joint item-tag recommendations. In Proc. the 19th ACM International Conference on Information and Knowledge Management, Oct. 2010, pp.809-818. Zhen Y, Li W, Yeung D. TagiCoFi:Tag informed collaborative filtering. In Proc. the 3rd ACM Conference on Recommender Systems, Oct. 2009, pp.69-76. Symeonidis P, Nanopoulos A, Manolopoulos Y. Tag recommendations based on tensor dimensionality reduction. In Proc. the 2008 ACM Conference on Recommender Systems, Oct. 2008, pp.43-50. Rendle S, Marinho L B, Nanopoulos A, Schmidt-Thieme L. Learning optimal ranking with tensor factorization for tag recommendation. In Proc. the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, June 28-July 1, 2009, pp.727-736. Rendle S, Schmidt-Thieme L. Pairwise interaction tensor factorization for personalized tag recommendation. In Proc. the 3rd ACM International Conference on Web Search and Data Mining, Feb. 2010, pp.81-90. J¨aschke R, Marinho L B, Hotho A, Schmidt-Thieme L, Stumme G. Tag recommendations in social bookmarking systems. AI Communications, 2008, 21(4):231-247. Ohkura T, Kiyota Y, Nakagawa H. Browsing system for weblog articles based on automated folksonomy. In Proc. the 15th International Conference on World Wide Web, May 2006. Mishne G. AutoTag:A collaborative approach to automated tag assignment for weblog posts. In Proc. the 15th International Conference on World Wide Web, May 2006, pp.953-954. Lee S, Chun A. Automatic tag recommendation for the Web 2.0 blogosphere using collaborative tagging and hybrid ANN semantic structures. In Proc. the 6th WSEAS International Conference on Applied Computer Science, Apr. 2007, pp.88-93. Katakis I, Tsoumakas G, Vlahavas I. Multilabel text classification for automated tag suggestion. In Proc. the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, volume 18, Sept. 2008. Fujimura S, Fujimura K, Okuda H. Blogosonomy:Autotagging any text using bloggers' knowledge. In Proc. IEEE/WIC/ACM International Conference on Web Intelligence, Nov. 2007, pp.205-212. Heymann P, Ramage D, Garcia-Molina H. Social tag prediction. In Proc. the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2008, pp.531-538. Blei D, Ng A, Jordan M. Latent Dirichlet allocation. Journal of Machine Learning Research, 2003, 3:993-1022. Krestel R, Fankhauser P, Nejdl W. Latent Dirichlet allocation for tag recommendation. In Proc. the 3rd ACM Conference on Recommender Systems, Oct. 2009, pp.61-68. Si X, Sun M. Tag-LDA for scalable real-time tag recommendation. Journal of Computational Information Systems, 2009, 6(1):23-31. Liu Z, Tu C, Sun M. Tag dispatch model with social network regularization for microblog user tag suggestion. In Proc. the 24th International Conference on Computational Linguistics, Dec. 2012, pp.755-764. Bundschus M, Yu S, Tresp V, Rettinger A, Dejori M, Kriegel H. Hierarchical Bayesian models for collaborative tagging systems. In Proc. the 9th IEEE International Conference on Data Mining, Dec. 2009, pp.728-733. Iwata T, Yamada T, Ueda N. Modeling social annotation data with content relevance using a topic model. In Proc. the 23rd Annual Conference on Neural Information Processing Systems, Dec. 2009, pp.835-843. Blei D, Jordan M. Modeling annotated data. In Proc. the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 28-August 1, 2003, pp.127-134. Griffiths T, Steyvers M. Finding scientific topics. Proc. the National Academy of Sciences of the United States of America, 2004, 101(Suppl 1):5228-5235. Heinrich G. Parameter estimation for text analysis. Technical Report, vsonix GmbH + University of Leipzig, Germany, May 2005. Andrieu C, de Freitas N, Doucet A, Jordan M. An introduction to MCMC for machine learning. Machine Learning, 2003, 50(1/2):5-43. Manning C D, Raghavan P, Schütze H. Introduction to Information Retrieval, Volume 1. Cambridge University Press, Cambridge, 2008. Mei Q, Cai D, Zhang D, Zhai C. Topic modeling with network regularization. In Proc. the 17th International Conference on World Wide Web, Apr. 2008, pp.101-110. Chang J, Blei D M. Relational topic models for document networks. In Proc. the 12th International Conference on Artificial Intelligence and Statistics, Apr. 2009, pp.81-88. Cohn D, Chang H. Learning to probabilistically identify authoritative documents. In Proc. ICML, June 29-July 2, 2000, pp.167-174.
Copyright 2010 by Journal of Computer Science and Technology