›› 2015,Vol. 30 ›› Issue (5): 1054-1062.doi: 10.1007/s11390-015-1581-7

所属专题: Artificial Intelligence and Pattern Recognition Data Management and Data Mining

• Special Section on Selected Paper from NPC 2011 • 上一篇    下一篇

基于在线问答社区的意图相关产品挖掘

Jun-Wen Duan(段俊文), Yi-Heng Chen(陈毅恒), Member, CCF Ting Liu*(刘挺), Senior Member, CCF, ACM, Xiao Ding(丁效), Member, CCF   

  1. School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
  • 收稿日期:2014-11-17 修回日期:2015-07-09 出版日期:2015-09-05 发布日期:2015-09-05
  • 通讯作者: Ting Liu E-mail:tliu@ir.hit.edu.cn
  • 作者简介:Jun-Wen Duan received his B.E. degree in computer science and technology from Harbin Institute of Technology, Harbin, in 2013. Currently, he is a Ph.D. candidate in Harbin Institute of Technology. His current research interests include natural language processing, social computing, and text mining.
  • 基金资助:

    The research is supported by the National Basic Research 973 Program of China under Grant No. 2014CB340503 and the National Natural Science Foundation of China under Grant Nos. 61133012, 61202277, and 61472107.

Mining Intention-Related Products on Online Q&A Community

Jun-Wen Duan(段俊文), Yi-Heng Chen(陈毅恒), Member, CCF Ting Liu*(刘挺), Senior Member, CCF, ACM, Xiao Ding(丁效), Member, CCF   

  1. School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
  • Received:2014-11-17 Revised:2015-07-09 Online:2015-09-05 Published:2015-09-05
  • Contact: Ting Liu E-mail:tliu@ir.hit.edu.cn
  • About author:Jun-Wen Duan received his B.E. degree in computer science and technology from Harbin Institute of Technology, Harbin, in 2013. Currently, he is a Ph.D. candidate in Harbin Institute of Technology. His current research interests include natural language processing, social computing, and text mining.
  • Supported by:

    The research is supported by the National Basic Research 973 Program of China under Grant No. 2014CB340503 and the National Natural Science Foundation of China under Grant Nos. 61133012, 61202277, and 61472107.

在线社交媒体上的用户生成内容吸引了来自服务/产品提供商的注意, 因为其中包含大量潜在的商业机会。然而, 之前的工作主要集中在用户消费意图的识别, 挖掘意图相关产品很少有工作涉及。本文中, 我们提出了一种新的基于在线问答社区的意图相关产品的挖掘方法。将问答对作为数据源, 我们首先基于依存分析自动地抽取其中的候选产品。之后, 通过搭配抽取的模型, 我们从候选产品集合中识别出真正的意图相关产品。

Abstract: User generated content on social media has attracted much attention from service/product providers, as it contains plenty of potential commercial opportunities. However, previous work mainly focuses on user consumption intention (CI) identification, and little effort has been spent to mine intention-related products. In this paper, focusing on the Baby & Child Care domain, we propose a novel approach to mine intention-related products on online question and answer (Q&A) community. Making use of the question-answering pairs as data source, we first automatically extract candidate products based on dependency parser. And then by means of the collocation extraction model, we identify the real intention-related products from the candidate set. The experimental results on our carefully constructed evaluation dataset show that our approach achieves better performance than two natural baseline methods.

[1] Ashkan A, Clarke C L. Term-based commercial intent analysis. In Proc. the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2009, pp.800-801.

[2] Dai H K, Zhao L, Nie Z, Wen J R, Wang L, Li Y. Detecting online commercial intention (OCI). In Proc. the 15th International Conference on World Wide Web, May 2006, pp.829-837.

[3] Ding X, Liu T, Duan J, Nie J Y. Mining user consumption intention from social media using domain adaptive convolutional neural network. In Proc. the 29th AAAI Conference on Artificial Intelligence, January 2015, pp.2395-2389.

[4] Hollerit B, Kröll M, Strohmaier M. Towards linking buyers and sellers:Detecting commercial intent on Twitter. In Proc. the 22nd International Conference on World Wide Web Companion, May 2013, pp.629-632.

[5] Shah C, Pomerantz J. Evaluating and predicting answer quality in community QA. In Proc. the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2010, pp.411-418.

[6] Kaufmann M. Syntactic normalization of Twitter messages. Studies, 2010, 2:1-7.

[7] Che W, Li Z, Liu T. LTP:A Chinese language technology platform. In Proc. the 23rd International Conference on Computational Linguistics:Demonstrations, August 2010, pp.13-16.

[8] Liu Z, Wang H, Wu H, Li S. Collocation extraction using monolingual word alignment method. In Proc. the 2009 Conference on Empirical Methods in Natural Language Processing, August 2009, pp.487-495.

[9] Grinstead C M, Snell J L. Introduction to Probability (2nd edition). American Mathematical Society, 1997.

[10] Cohen J. Weighted kappa:Nominal scale agreement with provision for scaled disagreement or partial credit. Psychological Bulletin, 1968, 70(4):213-220.

[11] Robertson S E, Walker S. Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval. In Proc. the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 1994, pp.232-241.

[12] Zhao W X, Jiang J, He J, Song Y, Achananuparp P, Lim E P, Li X. Topical keyphrase extraction from Twitter. In Proc. the 49th Annual Meeting of the Association for Computational Linguistics:Human Language Technologies, Volume 1, June 2011, pp.379-388.

[13] Ritter A, Clark S, Mausam, Etzioni O. Named entity recognition in tweets:An experimental study. In Proc. the Conference on Empirical Methods in Natural Language Processing, July 2011, pp.1524-1534.

[14] Pak A, Paroubek P. Twitter as a corpus for sentiment analysis and opinion mining. In Proc. LREC, May 2010, pp.1320- 1326.

[15] Sakaki T, Okazaki M, Matsuo Y. Earthquake shakes Twitter users:Real-time event detection by social sensors. In Proc. the 19th International Conference on World Wide Web, April 2010, pp.851-860.

[16] Wang J, Zhao W X, Wei H, Yan H, Li X. Mining new business opportunities:Identifying trend related products by leveraging commercial intents from microblogs. In Proc. the Conference on Empirical Methods in Natural Language Processing, October 2013, pp.1337-1347.
No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] Harald E. Otto;. UNDO, An Aid for Explorative Learning?[J]. , 1992, 7(3): 226 -236 .
[2] 徐美瑞; 刘小林;. A VLSI Algorithm for Calculating the Tree to Tree Distance[J]. , 1993, 8(1): 68 -76 .
[3] 王晖; 刘大有; 王亚飞;. Sequential Back-Propagation[J]. , 1994, 9(3): 252 -260 .
[4] 陆波; 蔡士杰;. A Skeleton-Based Approach of Automatically Generating Some Chinese Typefaces[J]. , 1996, 11(1): 30 -38 .
[5] 彭澄廉;. Combining Gprof and Event-Driven Monitoring for Analyzing Distributed Programs:A Rough View of NCSA Mosaic[J]. , 1996, 11(4): 427 -432 .
[6] 张成洪; 胡运发; 施伯乐;. A Reasoning Mechanism for DeductiveObject-Oriented Databases[J]. , 1997, 12(4): 337 -345 .
[7] 王箭; 张福炎;. Multicast Address Management and Connection Control Based on Hierarchical Autonomous Structure[J]. , 1999, 14(1): 64 -73 .
[8] 陈海明;. Function Definition Language FDL andIts Implementation[J]. , 1999, 14(4): 414 -421 .
[9] . 水稻基因组中预测基因的程序评估及测试数据集[J]. , 2005, 20(4): 446 -453 .
[10] . 一种从等参线网格中生成双三次、 G^1 连续的 B 样条船壳曲面的方法[J]. , 2006, 21(2): 265 -271 .
版权所有 © 《计算机科学技术学报》编辑部
本系统由北京玛格泰克科技发展有限公司设计开发 技术支持:support@magtech.com.cn
总访问量: