We use cookies to improve your experience with our site.
Zhi-Jing Wu, Yi-Qun Liu, Jia-Xin Mao, Min Zhang, Shao-Ping Ma. Leveraging Document-Level and Query-Level Passage Cumulative Gain for Document Ranking[J]. Journal of Computer Science and Technology, 2022, 37(4): 814-838. DOI: 10.1007/s11390-022-2031-y
Citation: Zhi-Jing Wu, Yi-Qun Liu, Jia-Xin Mao, Min Zhang, Shao-Ping Ma. Leveraging Document-Level and Query-Level Passage Cumulative Gain for Document Ranking[J]. Journal of Computer Science and Technology, 2022, 37(4): 814-838. DOI: 10.1007/s11390-022-2031-y

Leveraging Document-Level and Query-Level Passage Cumulative Gain for Document Ranking

  • Document ranking is one of the most studied but challenging problems in information retrieval (IR). More and more studies have begun to address this problem from fine-grained document modeling. However, most of them focus on context-independent passage-level relevance signals and ignore the context information. In this paper, we investigate how information gain accumulates with passages and propose the context-aware Passage Cumulative Gain (PCG). The fine-grained PCG avoids the need to split documents into independent passages. We investigate PCG patterns at the document level (DPCG) and the query level (QPCG). Based on the patterns, we propose a BERT-based sequential model called Passage-level Cumulative Gain Model (PCGM) and show that PCGM can effectively predict PCG sequences. Finally, we apply PCGM to the document ranking task using two approaches. The first one is leveraging DPCG sequences to estimate the gain of an individual document. Experimental results on two public ad hoc retrieval datasets show that PCGM outperforms most existing ranking models. The second one considers the cross-document effects and leverages QPCG sequences to estimate the marginal relevance. Experimental results show that predicted results are highly consistent with users' preferences. We believe that this work contributes to improving ranking performance and providing more explainability for document ranking.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return