›› 2016, Vol. 31 ›› Issue (3): 595-603.doi: 10.1007/s11390-016-1649-z

Special Issue: Artificial Intelligence and Pattern Recognition

• Data Management and Data Mining • Previous Articles     Next Articles

A Hybrid Method of Domain Lexicon Construction for Opinion Targets Extraction Using Syntax and Semantics

Chun Liao, Chong Feng*, Member, CCF, Sen Yang, and He-Yan Huang, Member, CCF   

  1. 1 Department of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China;
    2 Beijing Engineering Research Center of High Volume Language Information Processing and Cloud Computing Applications, Beijing Institute of Technology, Beijing 100081, China
  • Received:2014-11-17 Revised:2016-02-29 Online:2016-05-05 Published:2016-05-05
  • Contact: Chong Feng E-mail:fengchong@bit.edu.cn
  • Supported by:

    The work was supported by the National Basic Research 973 Program of China under Grant Nos. 2013CB329605 and 2013CB329303, and the National Natural Science Foundation of China under Grant No. 61201351.

Opinion targets extraction of Chinese microblogs plays an important role in opinion mining. There has been a significant progress in this area recently, especially the method based on conditional random field (CRF). However, this method only takes lexicon-related features into consideration and does not excavate the implied syntactic and semantic knowledge. We propose a novel approach which incorporates domain lexicon with groups of syntactical and semantic features. The approach acquires domain lexicon through a novel way which explores syntactic and semantic information through Partof-Speech, dependency structure, phrase structure, semantic role and semantic similarity based on word embedding. And then we combine the domain lexicon with opinion targets extracted from CRF with groups of features for opinion targets extraction. Experimental results on COAE2014 dataset show the outperformance of the approach compared with other well-known methods on the task of opinion targets extraction.

[1] Liu Q, Feng C, Huang H. Emotional tendency identification for micro-blog topics based on multiple characteristics. In Proc. the 26th Pacific Asia Conference on Language, Information and Computation, Nov. 2012, pp.280-288.

[2] Han Z, Zhang Y, Zhang H, Wan Y, Huang J. An effective short text tendency classification algorithm for Chinese microblogging. Computer Application and Software, 2012, 29(10): 89-93. (in Chinese)

[3] Rao D, Ravichandran D. Semi-supervised polarity lexicon induction. In Proc. the 12th Conference of the European Chapter of the Association for Computational Linguistics, Mar. 2009, pp.675-682.

[4] Hu M, Liu B. Mining and summarizing customer reviews. In Proc. the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Aug. 2004, pp.168-177.

[5] Li B, Zhou L, Feng S, Wong K. A unified graph model for sentence-based opinion retrieval. In Proc. the 48th Annual Meeting of the Association for Computational Linguistics, July 2010, pp.1367-1375.

[6] Popescu A, EtzioniO. Extracting product features and opinions from reviews. In Natural Language Processing and Text Mining, Kao A, Poteet S R (eds.), Springer-Verlag London, 2007, pp.9-28.

[7] Liu H, Zhao Y, Qin B, Liu T. Comment target extraction and sentiment classification. Journal of Chinese Information Processing, 2010, 24(1): 84-88. (in Chinese)

[8] Zhuang L, Jing F, Zhu X. Movie review mining and summarization. In Proc. the 15th ACM International Conference on Information and Knowledge Management, Nov. 2006, pp.43-50.

[9] JakobN, Gurevych I. Extracting opinion targets in a singleand cross-domain setting with conditional random fields. In Proc. the 2010 Conference on Empirical Methods in Natural Language Processing, Oct. 2010, pp.1035-1045.

[10] Hacioglu K. Semantic role labeling using dependency trees. In Proc. the 20th International Conference on Computational Linguistics, Aug. 2004, Article No. 1273.

[11] Turney P, Littman M. Measuring praise and criticism: Inference of semantic orientation from association. ACM Transactions on Information Systems (TOIS), 2003, 21(4): 315-346.

[12] Lu B, Wan X, Yang J, Chen X. Using TongyiciCilin to compute word semantic Polarity. In Proc. International Conference on Chinese Computing, Aug. 2007, pp.17-23.

[13] Wang S, Li D, Wei Y, Song X. A synonyms based word sentiment orientation discriminating. Journal of Chinese Information Processing, 2009, 23(5): 68-75. (in Chinese)

[14] Mikolov T, Chen K, Corrado G, Dean J. Efficient estimation of word representations in vector space. In Proc. ICLR, May 2013.

[15] Tang D, Wei F, Yang N, Zhou M, Liu T, Qin B. Learning sentiment-specific word embedding for twitter sentiment classification. In Proc. the 52nd Annual Meeting of the Association for Computational Linguistics (ACL), June 2014, pp.1555-1565.

[16] Zhang C, Feng C, Liu Q, Shi C, Huang H, Zhou H. Chinese comparative sentence identification based on multi-feature fusion. Journal of Chinese Information Processing, 2013, 2013(6): 110-116. (in Chinese)

[17] Lu B. Identifying opinion holders and targets with dependency parser in Chinese news texts. In Proc. the NAACL HLT 2010 Student Research Workshop, June 2010, pp.46-51.

[18] Hermjakob U. Parsing and question classification for question answering. In Proc. the Workshop on Open-Domain Question Answering, July 2001.

[19] Hu B, Wang D, Yu G, Ma T. An answer extraction algorithm based on syntax structure feature parsing and classification. Chinese Journal of Computers, 2008, 31(4): 662-676. (in Chinese)

[20] Li X, Roth D. Learning question classifiers. In Proc. the 19th International Conference on Computational Linguistics, Aug. 2002.

[21] Zhu Y, Xu Y, Wang W, Lu L, Du R, Deng C. Research on opinion extraction of Chinese review. In Proc. the 3rd Chinese Opinion Analysis Evaluation, 2011, pp.126-135. (in Chinese)

[22] Sun H, Yu S. Shallow parsing: An overview. Contemporary Linguistics, 2000, 2(2): 74-83. (in Chinese)

[23] Che W, Li Z, Liu T. LTP: A Chinese language technology platform. In Proc. the 23rd International Conference on Computational Linguistics: Demonstrations, Aug. 2010, pp.13-16.

[24] Socher R, Bauer J, Manning C, Ng A. Parsing with compositional vector grammars. In Proc. the 51st Annual Meeting of the Association for Computational Linguistics, Aug. 2013, pp.455-465.
No related articles found!
Full text



[1] Liu Mingye; Hong Enyu;. Some Covering Problems and Their Solutions in Automatic Logic Synthesis Systems[J]. , 1986, 1(2): 83 -92 .
[2] Chen Shihua;. On the Structure of (Weak) Inverses of an (Weakly) Invertible Finite Automaton[J]. , 1986, 1(3): 92 -100 .
[3] Gao Qingshi; Zhang Xiang; Yang Shufan; Chen Shuqing;. Vector Computer 757[J]. , 1986, 1(3): 1 -14 .
[4] Chen Zhaoxiong; Gao Qingshi;. A Substitution Based Model for the Implementation of PROLOG——The Design and Implementation of LPROLOG[J]. , 1986, 1(4): 17 -26 .
[5] Huang Heyan;. A Parallel Implementation Model of HPARLOG[J]. , 1986, 1(4): 27 -38 .
[6] Min Yinghua; Han Zhide;. A Built-in Test Pattern Generator[J]. , 1986, 1(4): 62 -74 .
[7] Tang Tonggao; Zhao Zhaokeng;. Stack Method in Program Semantics[J]. , 1987, 2(1): 51 -63 .
[8] Min Yinghua;. Easy Test Generation PLAs[J]. , 1987, 2(1): 72 -80 .
[9] Zhu Hong;. Some Mathematical Properties of the Functional Programming Language FP[J]. , 1987, 2(3): 202 -216 .
[10] Li Minghui;. CAD System of Microprogrammed Digital Systems[J]. , 1987, 2(3): 226 -235 .

ISSN 1000-9000(Print)

CN 11-2296/TP

Editorial Board
Author Guidelines
Journal of Computer Science and Technology
Institute of Computing Technology, Chinese Academy of Sciences
P.O. Box 2704, Beijing 100190 P.R. China
E-mail: jcst@ict.ac.cn
  Copyright ©2015 JCST, All Rights Reserved