›› 2017, Vol. 32 ›› Issue (4): 828-842.doi: 10.1007/s11390-017-1763-6

Special Issue: Artificial Intelligence and Pattern Recognition; Data Management and Data Mining

• Special Issue on Deep Learning • Previous Articles    

An Experimental Study of Text Representation Methods for Cross-Site Purchase Preference Prediction Using the Social Text Data

Ting Bai1,2, Student Member, CCF, Hong-Jian Dou1,2,Wayne Xin Zhao1,2,3,*, Member, CCF, ACM, IEEE, Ding-Yi Yang1, Ji-Rong Wen1,2, Member, CCF, ACM, IEEE   

  1. 1 School of Information, Renmin University of China, Beijing 100872, China;
    2 Beijing Key Laboratory of Big Data Management and Analysis Methods, Beijing 100872, China;
    3 Guangdong Key Laboratory of Big Data Analysis and Processing, Guangzhou 510006, China
  • Received:2016-12-21 Revised:2017-05-24 Online:2017-07-05 Published:2017-07-05
  • Contact: Wayne-Xin Zhao E-mail:batmanfly@gmail.com
  • Supported by:

    This work was partially supported by the National Natural Science Foundation of China under Grant Nos. 61502502 and 61502501, the National Basic Research 973 Program of China under Grant No. 2014CB340403 and the Beijing Natural Science Foundation under Grant No. 4162032. Ting Bai was supported by the Outstanding Innovative Talents Cultivation Funded Programs 2016 of Renmin Univertity of China. Wayne Xin Zhao was also supported by the Opening Project of Guangdong Province Key Laboratory of Big Data Analysis and Processing under Grant No. 2017001.

Nowadays, many e-commerce websites allow users to login with their existing social networking accounts. When a new user comes to an e-commerce website, it is interesting to study whether the information from external social media platforms can be utilized to alleviate the cold-start problem. In this paper, we focus on a specific task on cross-site information sharing, i.e., leveraging the text posted by a user on the social media platform (termed as social text) to infer his/her purchase preference of product categories on an e-commerce platform. To solve the task, a key problem is how to effectively represent the social text in a way that its information can be utilized on the e-commerce platform. We study two major kinds of text representation methods for predicting cross-site purchase preference, including shallow textual features and deep textual features learned by deep neural network models. We conduct extensive experiments on a large linked dataset, and our experimental results indicate that it is promising to utilize the social text for predicting purchase preference. Specially, the deep neural network approach has shown a more powerful predictive ability when the number of categories becomes large.

[1] Zhang Y, Pennacchiotti M. Recommending branded products from social media. In Proc. the 7th ACM Conference on Recommender Systems, Oct. 2013, pp.77-84.

[2] Ricci F, Rokach L, Shapira B. Introduction to Recommender Systems Handbook. Springer US, 2011, pp.1-35.

[3] Zhang Y, Pennacchiotti M. Predicting purchase behaviors from social media. In Proc. the 22nd International Conference on World Wide Web, May 2013, pp.1521-1532.

[4] Wang J, ZhaoWX, He Y, Li X. Leveraging product adopter information from online reviews for product recommendation. In Proc. the 9th International AAAI Conference on Web and Social Media, May 2015, pp.464-472.

[5] Wang J, Cong G, Zhao W X, Li X. Mining user intents in Twitter:A semi-supervised approach to inferring intent categories for tweets. In Proc. the 29th AAAI Conference on Artificial Intelligence, Jan. 2015, pp.318-324.

[6] Zhao W X, Guo Y, He Y, Jiang H, Wu Y, Li X. We know what you want to buy:A demographic-based system for product recommendation on microblogs. In Proc. the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Aug. 2014, pp.1935-1944.

[7] Tang D, Wei F, Yang N, Zhou M, Liu T, Qin B. Learning sentiment-specific word embedding for Twitter sentiment classification. In Proc. the 52nd Annual Meeting of the Association for Computational Linguistics, June 2014, pp.1555-1565.

[8] Cao Z, Li W, Li S, Wei F, Li Y. AttSum:Joint learning of focusing and summarization with neural attention. In Proc. COLING, Dec. 2016, pp.547-556.

[9] Gui L, Xu R, He Y, Lu Q, Wei Z. Intersubjectivity and sentiment:From language to knowledge. In Proc. the 25th International Joint Conference on Artificial Intelligence, July 2016, pp.2789-2795.

[10] Lai S, Xu L, Liu K, Zhao J. Recurrent convolutional neural networks for text classification. In Proc. the 29th International AAAI Conference on Web and Social Media, Jan. 2015, pp.2267-2273.

[11] Yih W, He X, Meek C. Semantic parsing for single-relation question answering. In Proc. the 52nd Annual Meeting of the Association for Computational Linguistics, June 2014, pp.643-648.

[12] Kalchbrenner N, Grefenstette E, Blunsom P. A convolutional neural network for modelling sentences. In Proc. the 52nd Annual Meeting of the Association for Computational Linguistics, June 2014, pp.655-665.

[13] Basili R. Learning to classify text using support vector machines:Methods, theory, and algorithms by Thorsten Joachims. Computational Linguistics, 2003, 29(4):655-661.

[14] Salton G, Wong A, Yang C S. A vector space model for automatic indexing. Commun. ACM, 1975, 18(11):613-620.

[15] Robertson S. Understanding inverse document frequency:On theoretical arguments for IDF. Journal of Documentation, 2004, 60(5):503-520.

[16] Blei D M, Ng A Y, Jordan M I. Latent Dirichlet allocation. In Proc. NIPS, Dec. 2001, pp.601-608.

[17] Seroussi Y, Bohnert F, Zukerman I. Personalised rating prediction for new users using latent factor models. In Proc. the 22nd ACM Conference on Hypertext and Hypermedia, June 2011, pp.47-56.

[18] Mikolov T, Chen K, Corrado G, Dean J. Efficient estimation of word representations in vector space. In Proc. International Conference on Learning Representations, May 2013.

[19] Mikolov T, Sutskever I, Chen K, Corrado G S, Dean J. Distributed representations of words and phrases and their compositionality. In Proc. NIPS, Dec. 2013, pp.3111-3119.

[20] Le Q, Mikolov T. Distributed representations of sentences and documents. In Proc. the 31st International Conference on Machine Learning, June 2014, pp.1188-1196.

[21] Kim Y. Convolutional neural networks for sentence classification. In Proc. the Empirical Methods in Natural Language Processing, Oct. 2014, pp.1746-1751.

[22] Hinton G E, Salakhutdinov R R. Reducing the dimensionality of data with neural networks. Science, 2006, 313(5786):504-507.

[23] Erhan D, Bengio Y, Courville A, Manzagol P A, Vincent P, Bengio S. Why does unsupervised pretraining help deep learning? Journal of Machine Learning Research, 2010, 11:625-660.

[24] Collobert R, Weston J, Bottou L, Karlen M, Kavukcuoglu K, Kuksa P. Natural language processing (almost) from scratch. Journal of Machine Learning Research, 2011, 12:2493-2537.

[25] Kalchbrenner N, Blunsom P. Recurrent convolutional neural networks for discourse compositionality. In Proc. the Workshop on Continuous Vector Space Models and Their Compositionality, Aug. 2013, pp.119-126.

[26] Hochreiter S, Schmidhuber J. Long short-term memory. Neural Computation, 1997, 9(8):1735-1780.

[27] Gers F A, Schraudolph N N, Schmidhuber J. Learning precise timing with LSTM recurrent networks. Journal of Machine Learning Research, 2002, 3:115-143.

[28] Zhang S, Zheng D, Hu X, Yang M. Bidirectional long shortterm memory networks for relation classification. In Proc. the 29th Pacific Asia Conference on Language, Information and Computation, Oct.30-Nov.1, 2015.

[29] Jarvelin K, Kekalainen J. Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems, 2002, 20(4):422-446.

[30] Burges C, Shaked T, Renshaw E, Lazier A, Deeds M, Hamilton N, Hullender G N. Learning to rank using gradient descent. In Proc. the 22nd Annual International Conference on Machine Learning, Aug. 2005, pp.89-96.

[31] Zhao W X, Li S, He Y, Chang E Y, Wen J, Li X. Connecting social media to e-commerce:Cold-start product recommendation using microblogging information. IEEE Transactions on Knowledge and Data Engineering, 2016, 28(5):1147-1159.

[32] Guo S, Wang M, Leskovec J. The role of social networks in online shopping:Information passing, price of trust, and consumer choice. In Proc. the 12th ACM Conference on Electronic Commerce, June 2011, pp.157-166.

[33] Bhatt R, Chaoji V, Parekh R. Predicting product adoption in large-scale social networks. In Proc. the 19th ACM International Conference on Information and Knowledge Management, Oct. 2010, pp.1039-1048.

[34] Zhou F, Ji Y, Jiao R J. Predicting product adoption in large social networks for demand estimation. In Proc. ISPECE, Sept. 2014, pp.890-899.

[35] Hill S, Provost F, Volinsky C. Network-based marketing:Identifying likely adopters via consumer networks. Statistical Science, 2006, 21(2):256-276.

[36] Iyengar R, Han S, Gupta S. Do friends influence purchases in a social network? SSRN ELectronic Journal, 2009.

[37] Pan S J, Yang Q. A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering, 2010, 22(10):1345-1359.

[38] PanW, Xiang EW, Yang Q. Transfer learning in collaborative filtering with uncertain ratings. In Proc. the 26th AAAI Conference on Artificial Intelligence, July 2012, pp.662-668.

[39] Li B, Yang Q, Xue X. Can movies and books collaborate? Cross-domain collaborative filtering for sparsity reduction. In Proc. the 21st International Joint Conference on Artificial Intelligence, July 2009, pp.2052-2057.

[40] Singh A P, Gordon G J. Relational learning via collective matrix factorization. In Proc. the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Aug. 2008, pp.650-658.

[41] Zhao L, Pan S J, Xiang E W, Zhong E, Lu Z, Yang Q. Active transfer learning for cross-system recommendation. In Proc. the 27th AAAI Conference on Artificial Intelligence, July 2013.

[42] Hu L, Cao J, Xu G, Cao L, Gu Z, Zhu C. Personalized recommendation via cross-domain triadic factorization. In Proc. the 22nd International Conference on World Wide Web, May 2013, pp.595-606.

[43] Cao Y, Xu R, Chen T. Combining convolutional neural network and support vector machine for sentiment classification. In Proc. the 4th National Conference on Social Media Processing, Nov. 2015, pp.144-155.

[44] Miwa M, Bansal M. End-to-end relation extraction using LSTMs on sequences and tree structures. In Proc. the 54th Annual Meeting of the Association for Computational Linguistics, Aug. 2016.

[45] Xu Y, Jia R, Mou L, Li G, Chen Y, Lu Y, Jin Z. Improved relation classification by deep recurrent neural networks with data augmentation. In Proc. the 26th International Conference on Computational Linguistics, Dec. 2016, pp.1461-1470.

[46] Chen W F, Ku L W. UTCNN:A deep learning model of stance classification on social media text. In Proc. the 26th International Conference on Computational Linguistics, Dec. 2016, pp.1635-1645.

[47] Dong L, Wei F, Zhou M, Xu K. Adaptive multicompositionality for recursive neural models with applications to sentiment analysis. In Proc. the 28th AAAI Conference on Artificial Intelligence, July 2014, pp.1537-1543.

[48] Hermann K M, Kocisky T, Grefenstette E, Espeholt L, Kay W, Suleyman M, Blunsom P. Teaching machines to read and comprehend. In Proc. NIPS, Dec. 2015, pp.1693-1701.

[49] Kobayashi S, Tian R, Okazaki N, Inui K. Dynamic entity representation with max-pooling improves machine reading. In Proc. the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics, June 2016, pp.850-855.

[50] Chen D, Bolton J, Manning C D. A thorough examination of the CNN/daily mail reading comprehension task. In Proc. the 54th Annual Meeting of the Association for Computational Linguistics, Aug. 2016.

[51] Wang L, Cao Z, de Melo G, Liu Z. Relation classification via multi-level attention CNNs. In Proc. the 54th Annual Meeting of the Association for Computational Linguistics, Aug. 2016.
No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] Harald E. Otto;. UNDO, An Aid for Explorative Learning?[J]. , 1992, 7(3): 226 -236 .
[2] Gu Junzhong;. Modelling Enterprises with Object-Oriented Paradigm[J]. , 1993, 8(3): 80 -89 .
[3] Zhao Zhaokeng; Dai Jun; Chen Wendan;. Automated Theorem Proving in Temporal Logic:T-Resolution[J]. , 1994, 9(1): 53 -62 .
[4] CAI Jiamei;. The Sequence Modeling Method Based on ECCin Developing Program Specifications[J]. , 1999, 14(4): 337 -348 .
[5] I-Shyan Hwang, I-Feng Huang, and Shin-Cheng Yu. Dynamic Fuzzy Controlled RWA Algorithm for IP/GMPLS over WDM Networks[J]. , 2005, 20(5): 717 -727 .
[6] Zhou-Wang Yang, Chun-Lin Wu, Jian-Song Deng,and Fa-Lai Chen. Specification of Initial Shapes for Dynamic Implicit Curve/Surface Reconstruction[J]. , 2006, 21(2): 249 -254 .
[7] Tie-Jun Huang and Yong-Liang Liu. Basic Considerations on AVS DRM Architecture[J]. , 2006, 21(3): 366 -369 .
[8] Xiao-Qing Zheng, Hua-Jun Chen, Zhao-Hui Wu, and Yu-Xin Mao. Dynamic Query Optimization Approach for Semantic Database Grid[J]. , 2006, 21(4): 597 -608 .
[9] Ian Foster. Globus Toolkit Version 4: Software for Service-Oriented Systems[J]. , 2006, 21(4): 513 -520 .
[10] Guan-Qun Gu and Jun-Zhou Luo. Some Issues on Computer Networks: Architecture and Key Technologies[J]. , 2006, 21(5): 708 -722 .

ISSN 1000-9000(Print)

         1860-4749(Online)
CN 11-2296/TP

Home
Editorial Board
Author Guidelines
Subscription
Journal of Computer Science and Technology
Institute of Computing Technology, Chinese Academy of Sciences
P.O. Box 2704, Beijing 100190 P.R. China
Tel.:86-10-62610746
E-mail: jcst@ict.ac.cn
 
  Copyright ©2015 JCST, All Rights Reserved