Review Authorship Attribution in a Similarity Space

Tie-Yun Qian; Bing Liu; Qing Li; Jianfeng Si

doi:10.1007/s11390-015-1513-6

Tie-Yun Qian, Bing Liu, Qing Li, Jianfeng Si. Review Authorship Attribution in a Similarity SpaceJ. Journal of Computer Science and Technology, 2015, 30(1): 200-213. DOI: 10.1007/s11390-015-1513-6

Citation:

Tie-Yun Qian, Bing Liu, Qing Li, Jianfeng Si. Review Authorship Attribution in a Similarity SpaceJ. Journal of Computer Science and Technology, 2015, 30(1): 200-213. DOI: 10.1007/s11390-015-1513-6

Citation:

Tie-Yun Qian, Bing Liu, Qing Li, Jianfeng Si. Review Authorship Attribution in a Similarity SpaceJ. Journal of Computer Science and Technology, 2015, 30(1): 200-213. DOI: 10.1007/s11390-015-1513-6

Review Authorship Attribution in a Similarity Space

Abstract

Abstract

Authorship attribution, also known as authorship classification, is the problem of identifying the authors (reviewers) of a set of documents (reviews). The common approach is to build a classifier using supervised learning. This approach has several issues which hurts its applicability. First, supervised learning needs a large set of documents from each author to serve as the training data. This can be difficult in practice. For example, in the online review domain, most reviewers (authors) only write a few reviews, which are not enough to serve as the training data. Second, the learned classifier cannot be applied to authors whose documents have not been used in training. In this article, we propose a novel solution to deal with the two problems. The core idea is that instead of learning in the original document space, we transform it to a similarity space. In the similarity space, the learning is able to naturally tackle the issues. Our experiment results based on online reviews and reviewers show that the proposed method outperforms the state-of-the-art supervised and unsupervised baseline methods significantly.

FullText(HTML)

References (47)

Relative Articles

Supplements (0)

Cited By

Review Authorship Attribution in a Similarity Space

Abstract

Catalog

Export File

Citation

Format

Content