? 基于微博多特征的标签推荐
Journal of Computer Science and Technology
Quick Search in JCST
 Advanced Search 
      Home | PrePrint | SiteMap | Contact Us | Help
 
Indexed by   SCIE, EI ...
Bimonthly    Since 1986
Journal of Computer Science and Technology 2018, Vol. 33 Issue (4) :711-726    DOI: 10.1007/s11390-018-1851-2
Special Issue on Software Engineering for High-Confidence Systems << Previous Articles | Next Articles >>
基于微博多特征的标签推荐
Fei-Fei Kou, Jun-Ping Du*, Distinguished Member, CCF, Cong-Xian Yang, Yan-Song Shi, Wan-Qiu Cui Mei-Yu Liang, Yue Geng
Beijing Key Laboratory of Intelligent Telecommunication Software and Multimedia, School of Computer Science Beijing University of Posts and Telecommunications, Beijing 100876, China
Hashtag Recommendation Based on Multi-Features of Microblogs
Fei-Fei Kou, Jun-Ping Du*, Distinguished Member, CCF, Cong-Xian Yang, Yan-Song Shi, Wan-Qiu Cui Mei-Yu Liang, Yue Geng
Beijing Key Laboratory of Intelligent Telecommunication Software and Multimedia, School of Computer Science Beijing University of Posts and Telecommunications, Beijing 100876, China

摘要
参考文献
相关文章
Download: [PDF 1035KB]  
摘要 微博标签推荐是一个热门的研究话题,在很多与微博相关的任务中均起到了积极作用。然而,由于微博中的文本较短且微博标签的使用率较低会引起数据稀疏性问题,从而导致传统的标签推荐方法很难实现精准推荐。因此,在本文中,我们基于微博的多种特征提出了微博多特征标签推荐算法HRMF。该方法首先将短文本扩展为长文本,并通过设计新的主题模型建模了微博的多种特征(用户、标签和文本)。为了进一步削弱数据稀疏性问题,该方法同时采用了相似用户和相似微博的标签作为了候选标签。特别地,我们采用主题模型与协同过滤相结合的方法实现了相似用户的精准查找。最后,通过基于多特征的话题表示对每个候选标签的推荐值进行计算,实现了标签推荐。实验数据采用新浪微博上爬取的真实数据集,通过实验验证了所提标签推荐算法HRMF的有效性。
关键词标签推荐   主题模型   协同过滤   微博     
Abstract: Hashtag recommendation for microblogs is a very hot research topic that is useful to many applications involving microblogs. However, since short text in microblogs and low utilization rate of hashtags will lead to the data sparsity problem, it is difficult for typical hashtag recommendation methods to achieve accurate recommendation. In light of this, we propose HRMF, a hashtag recommendation method based on multi-features of microblogs in this article. First, our HRMF expands short text into long text, and then it simultaneously models multi-features (i.e., user, hashtag, text) of microblogs by designing a new topic model. To further alleviate the data sparsity problem, HRMF exploits hashtags of both similar users and similar microblogs as the candidate hashtags. In particular, to find similar users, HRMF combines the designed topic model with typical user-based collaborative filtering method. Finally, we realize hashtag recommendation by calculating the recommended score of each hashtag based on the generated topical representations of multi-features. Experimental results on a real-world dataset crawled from Sina Weibo demonstrate the effectiveness of our HRMF for hashtag recommendation.
Keywordshashtag recommendation   topic model   collaborative filtering method   microblog     
Received 2018-01-14;
本文基金:

This work was supported by the National Natural Science Foundation of China under Grant Nos. 61320106006, 61532006, 61772083, and 61502042, and the Fundamental Research Funds for the Central Universities of China under Grant No. 2017RC39.

通讯作者: Jun-Ping Du,E-mail:junpingd@bupt.edu.cn     Email: junpingd@bupt.edu.cn
About author: Fei-Fei Kou currently is a Ph.D. candidate in computer science and technology at Beijing University of Posts and Telecommunications, Beijing. She received her B.S. degree in electronic information engineering from Yantai University, Yantai, in 2010, and M.S. degree in computer technology from Beijing Technology and Business University, Beijing, in 2013. Her major research interest includes semantic learning and multimedia information retrieval and recommendation.
引用本文:   
Fei-Fei Kou, Jun-Ping Du, Cong-Xian Yang, Yan-Song Shi, Wan-Qiu Cui.基于微博多特征的标签推荐[J]  Journal of Computer Science and Technology , 2018,V33(4): 711-726
Fei-Fei Kou, Jun-Ping Du, Cong-Xian Yang, Yan-Song Shi, Wan-Qiu Cui, Mei-Yu Liang, Yue Geng.Hashtag Recommendation Based on Multi-Features of Microblogs[J]  Journal of Computer Science and Technology, 2018,V33(4): 711-726
链接本文:  
http://jcst.ict.ac.cn:8080/jcst/CN/10.1007/s11390-018-1851-2
Copyright 2010 by Journal of Computer Science and Technology