›› 2015, Vol. 30 ›› Issue (5): 1054-1062.doi: 10.1007/s11390-015-1581-7

Special Issue: Artificial Intelligence and Pattern Recognition; Data Management and Data Mining

• Special Section on Social Media Processing • Previous Articles     Next Articles

Mining Intention-Related Products on Online Q&A Community

Jun-Wen Duan(段俊文), Yi-Heng Chen(陈毅恒), Member, CCF Ting Liu*(刘挺), Senior Member, CCF, ACM, Xiao Ding(丁效), Member, CCF   

  1. School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
  • Received:2014-11-17 Revised:2015-07-09 Online:2015-09-05 Published:2015-09-05
  • Contact: Ting Liu E-mail:tliu@ir.hit.edu.cn
  • About author:Jun-Wen Duan received his B.E. degree in computer science and technology from Harbin Institute of Technology, Harbin, in 2013. Currently, he is a Ph.D. candidate in Harbin Institute of Technology. His current research interests include natural language processing, social computing, and text mining.
  • Supported by:

    The research is supported by the National Basic Research 973 Program of China under Grant No. 2014CB340503 and the National Natural Science Foundation of China under Grant Nos. 61133012, 61202277, and 61472107.

User generated content on social media has attracted much attention from service/product providers, as it contains plenty of potential commercial opportunities. However, previous work mainly focuses on user consumption intention (CI) identification, and little effort has been spent to mine intention-related products. In this paper, focusing on the Baby & Child Care domain, we propose a novel approach to mine intention-related products on online question and answer (Q&A) community. Making use of the question-answering pairs as data source, we first automatically extract candidate products based on dependency parser. And then by means of the collocation extraction model, we identify the real intention-related products from the candidate set. The experimental results on our carefully constructed evaluation dataset show that our approach achieves better performance than two natural baseline methods.

[1] Ashkan A, Clarke C L. Term-based commercial intent analysis. In Proc. the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2009, pp.800-801.

[2] Dai H K, Zhao L, Nie Z, Wen J R, Wang L, Li Y. Detecting online commercial intention (OCI). In Proc. the 15th International Conference on World Wide Web, May 2006, pp.829-837.

[3] Ding X, Liu T, Duan J, Nie J Y. Mining user consumption intention from social media using domain adaptive convolutional neural network. In Proc. the 29th AAAI Conference on Artificial Intelligence, January 2015, pp.2395-2389.

[4] Hollerit B, Kröll M, Strohmaier M. Towards linking buyers and sellers:Detecting commercial intent on Twitter. In Proc. the 22nd International Conference on World Wide Web Companion, May 2013, pp.629-632.

[5] Shah C, Pomerantz J. Evaluating and predicting answer quality in community QA. In Proc. the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2010, pp.411-418.

[6] Kaufmann M. Syntactic normalization of Twitter messages. Studies, 2010, 2:1-7.

[7] Che W, Li Z, Liu T. LTP:A Chinese language technology platform. In Proc. the 23rd International Conference on Computational Linguistics:Demonstrations, August 2010, pp.13-16.

[8] Liu Z, Wang H, Wu H, Li S. Collocation extraction using monolingual word alignment method. In Proc. the 2009 Conference on Empirical Methods in Natural Language Processing, August 2009, pp.487-495.

[9] Grinstead C M, Snell J L. Introduction to Probability (2nd edition). American Mathematical Society, 1997.

[10] Cohen J. Weighted kappa:Nominal scale agreement with provision for scaled disagreement or partial credit. Psychological Bulletin, 1968, 70(4):213-220.

[11] Robertson S E, Walker S. Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval. In Proc. the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 1994, pp.232-241.

[12] Zhao W X, Jiang J, He J, Song Y, Achananuparp P, Lim E P, Li X. Topical keyphrase extraction from Twitter. In Proc. the 49th Annual Meeting of the Association for Computational Linguistics:Human Language Technologies, Volume 1, June 2011, pp.379-388.

[13] Ritter A, Clark S, Mausam, Etzioni O. Named entity recognition in tweets:An experimental study. In Proc. the Conference on Empirical Methods in Natural Language Processing, July 2011, pp.1524-1534.

[14] Pak A, Paroubek P. Twitter as a corpus for sentiment analysis and opinion mining. In Proc. LREC, May 2010, pp.1320- 1326.

[15] Sakaki T, Okazaki M, Matsuo Y. Earthquake shakes Twitter users:Real-time event detection by social sensors. In Proc. the 19th International Conference on World Wide Web, April 2010, pp.851-860.

[16] Wang J, Zhao W X, Wei H, Yan H, Li X. Mining new business opportunities:Identifying trend related products by leveraging commercial intents from microblogs. In Proc. the Conference on Empirical Methods in Natural Language Processing, October 2013, pp.1337-1347.
No related articles found!
Full text



[1] Harald E. Otto;. UNDO, An Aid for Explorative Learning?[J]. , 1992, 7(3): 226 -236 .
[2] Xu Meirui; Liu Xiaolin;. A VLSI Algorithm for Calculating the Tree to Tree Distance[J]. , 1993, 8(1): 68 -76 .
[3] Wang Hui; Liu Dayou; Wang Yafei;. Sequential Back-Propagation[J]. , 1994, 9(3): 252 -260 .
[4] Lu Bo; Cai Shijie;. A Skeleton-Based Approach of Automatically Generating Some Chinese Typefaces[J]. , 1996, 11(1): 30 -38 .
[5] Peng Chenglian;. Combining Gprof and Event-Driven Monitoring for Analyzing Distributed Programs:A Rough View of NCSA Mosaic[J]. , 1996, 11(4): 427 -432 .
[6] Zhang Chenghong; Hu Yunfa; Shi Baile;. A Reasoning Mechanism for DeductiveObject-Oriented Databases[J]. , 1997, 12(4): 337 -345 .
[7] WANG Jian; ZHANG Fuyan;. Multicast Address Management and Connection Control Based on Hierarchical Autonomous Structure[J]. , 1999, 14(1): 64 -73 .
[8] CHEN Haiming;. Function Definition Language FDL andIts Implementation[J]. , 1999, 14(4): 414 -421 .
[9] Heng Li, Jin-Song Liu, Zhao Xu et al.. Test Data Sets and Evaluation of Gene Prediction Programs on the Rice Genome[J]. , 2005, 20(4): 446 -453 .
[10] Joong-Hyun Rhim, Doo-Yeoun Cho, Kyu-Yeul Lee, and Tae-Wan Kim. Generation of Discrete Bicubic G^1 B-Spline Ship Hullform Surfaces from a Given Curve Network Using Virtual Iso-Parametric Curves[J]. , 2006, 21(2): 265 -271 .

ISSN 1000-9000(Print)

CN 11-2296/TP

Editorial Board
Author Guidelines
Journal of Computer Science and Technology
Institute of Computing Technology, Chinese Academy of Sciences
P.O. Box 2704, Beijing 100190 P.R. China
E-mail: jcst@ict.ac.cn
  Copyright ©2015 JCST, All Rights Reserved