We use cookies to improve your experience with our site.

基于深度学习的文本摘要研究综述

A Survey of Text Summarization Approaches Based on Deep Learning

  • 摘要: 随着深度学习技术的进步以及大规模语料库的存在,自动文本摘要得到了快速发展,并取得了显著的效果。近几年来,一系列基于深度学习的自动文本摘要方法被提出,来解决自动文本摘要任务中的两大关键点:(1)篇章中文本的重要性评估;(2)生成连贯性的结果。已有文献大都是对方法的详细描述,目前尚缺少相关的综述文献。本文的目的是对已有的基于深度学习的自动文本摘要方法进行全面综述。本文首先是对自动文本摘要和深度学习相关概念的概述,然后是对用于深度学习模型训练、验证和测试的常用数据集的详细介绍。随之,本文对经典和最新的基于深度学习的单文本摘要方法和多文本摘要方法进行了详细总结综述,并给出了经典方法在常用数据集上的性能对比。最后,综合上述内容,本文尝试性给出了本领域的未来研究方向。

     

    Abstract: Automatic text summarization (ATS) has achieved impressive performance thanks to recent advances in deep learning (DL) and the availability of large-scale corpora. The key points in ATS are to estimate the salience of information and to generate coherent results. Recently, a variety of DL-based approaches have been developed for better considering these two aspects. However, there is still a lack of comprehensive literature review for DL-based ATS approaches. The aim of this paper is to comprehensively review significant DL-based approaches that have been proposed in the literature with respect to the notion of generic ATS tasks and provide a walk-through of their evolution. We first give an overview of ATS and DL. The comparisons of the datasets are also given, which are commonly used for model training, validation, and evaluation. Then we summarize single-document summarization approaches. After that, an overview of multi-document summarization approaches is given. We further analyze the performance of the popular ATS models on common datasets. Various popular approaches can be employed for different ATS tasks. Finally, we propose potential research directions in this fast-growing field. We hope this exploration can provide new insights into future research of DL-based ATS.

     

/

返回文章
返回