We use cookies to improve your experience with our site.
陈坚永, 罗镇川, 邓燕玲, 张圭煜. 一种运用规则类型依赖与复杂短语分析的词语情感极性分类方法[J]. 计算机科学技术学报, 2012, 27(3): 650-666. DOI: 10.1007/s11390-012-1251-y
引用本文: 陈坚永, 罗镇川, 邓燕玲, 张圭煜. 一种运用规则类型依赖与复杂短语分析的词语情感极性分类方法[J]. 计算机科学技术学报, 2012, 27(3): 650-666. DOI: 10.1007/s11390-012-1251-y
Luke Kien-Weng Tan, Jin-Cheon Na, Yin-Leng Theng, Kuiyu Chang. Phrase-Level Sentiment Polarity Classification Using Rule-Based Typed Dependencies and Additional Complex Phrases Consideration[J]. Journal of Computer Science and Technology, 2012, 27(3): 650-666. DOI: 10.1007/s11390-012-1251-y
Citation: Luke Kien-Weng Tan, Jin-Cheon Na, Yin-Leng Theng, Kuiyu Chang. Phrase-Level Sentiment Polarity Classification Using Rule-Based Typed Dependencies and Additional Complex Phrases Consideration[J]. Journal of Computer Science and Technology, 2012, 27(3): 650-666. DOI: 10.1007/s11390-012-1251-y

一种运用规则类型依赖与复杂短语分析的词语情感极性分类方法

Phrase-Level Sentiment Polarity Classification Using Rule-Based Typed Dependencies and Additional Complex Phrases Consideration

  • 摘要: 随着Web 2.0的兴起,互联网用户发布的内容量越来越多.这些丰富的自由格式文本具有主观色彩的观点词语并足以影响人们的决策与行动.信息发布者由此对广大用户施加了一定的影响,例如,博客内容不仅可以影响人们的购买行为,也可以左右其对政治的看法以及理财规划等.一般来说,我们可以通过识别文本所体现的观点倾向来获取人们对于关注主题的态度及其对普通用户产生的影响.本文提出了一种通过挖掘极性规则以及分析词语间复杂关系来识别观点词语情感极性的自动方法.当前情感分析研究,在准确分析文本内容的语法和语义的条件下,着眼于通过使用类型依赖语法解析来处理词语间的功能性关系.典型做法是采用启发式方式来识别类型依赖极性模式,其不足之处在于无法检测到所有规则.我们在本文中提出使用分类序列规则CSR(Class Sequential Rules)来自动识别各种类型依赖模式,并将CSR与另一种启发式方法进行对比验证.实验结果显示,CSR的情感极性分类性能有显著的提高,在测试实例上得到80%的F1值.此外,研究还发现词语间的复杂关系足以影响情感极性分类性能,我们也进一步地探讨了处理词语间这类依赖关系的一些解决方案.

     

    Abstract: The advent of Web 2.0 has led to an increase in user-generated content on the Web. This has provided an extensive collection of free-style texts with opinion expressions that could influence the decisions and actions of their readers. Providers of such content exert a certain level of influence on the receivers and this is evident from blog sites having effect on their readers' purchase decisions, political view points, financial planning, and others. By detecting the opinion expressed, we can identify the sentiments on the topics discussed and the influence exerted on the readers. In this paper, we introduce an automatic approach in deriving polarity pattern rules to detect sentiment polarity at the phrase level, and in addition consider the effects of the more complex relationships found between words in sentiment polarity classification. Recent sentiment analysis research has focused on the functional relations of words using typed dependency parsing, providing a refined analysis on the grammar and semantics of textual data. Heuristics are typically used to determine the typed dependency polarity patterns, which may not comprehensively identify all possible rules. We study the use of class sequential rules (CSRs) to automatically learn the typed dependency patterns, and benchmark the performance of CSR against a heuristic method. Preliminary results show CSR leads to further improvements in classification performance achieving over 80% F1 scores in the test cases. In addition, we observe more complex relationships between words that could influence phrase sentiment polarity, and further discuss on possible approaches to handle the effects of these complex relationships.

     

/

返回文章
返回