A Survey of Text Summarization Approaches Based on Deep Learning

Sheng-Luan Hou; Xi-Kun Huang; Chao-Qun Fei; Shu-Han Zhang; Yang-Yang Li; Qi-Lin Sun; Chuan-Qing Wang

doi:10.1007/s11390-020-0207-x

Sheng-Luan Hou, Xi-Kun Huang, Chao-Qun Fei, Shu-Han Zhang, Yang-Yang Li, Qi-Lin Sun, Chuan-Qing Wang. A Survey of Text Summarization Approaches Based on Deep Learning[J]. Journal of Computer Science and Technology, 2021, 36(3): 633-663. DOI: 10.1007/s11390-020-0207-x

Citation:

A Survey of Text Summarization Approaches Based on Deep Learning

Abstract

Abstract

Automatic text summarization (ATS) has achieved impressive performance thanks to recent advances in deep learning (DL) and the availability of large-scale corpora. The key points in ATS are to estimate the salience of information and to generate coherent results. Recently, a variety of DL-based approaches have been developed for better considering these two aspects. However, there is still a lack of comprehensive literature review for DL-based ATS approaches. The aim of this paper is to comprehensively review significant DL-based approaches that have been proposed in the literature with respect to the notion of generic ATS tasks and provide a walk-through of their evolution. We first give an overview of ATS and DL. The comparisons of the datasets are also given, which are commonly used for model training, validation, and evaluation. Then we summarize single-document summarization approaches. After that, an overview of multi-document summarization approaches is given. We further analyze the performance of the popular ATS models on common datasets. Various popular approaches can be employed for different ATS tasks. Finally, we propose potential research directions in this fast-growing field. We hope this exploration can provide new insights into future research of DL-based ATS.

FullText(HTML)

References (124)

Relative Articles

Supplements (2)

Cited By

A Survey of Text Summarization Approaches Based on Deep Learning

Abstract

Catalog

Export File

Citation

Format

Content