Summarizing Software Artifacts: A Literature Review

Najam Nazar; Yan Hu; He Jiang

doi:10.1007/s11390-016-1671-1

Najam Nazar, Yan Hu, He Jiang. Summarizing Software Artifacts: A Literature Review[J]. Journal of Computer Science and Technology, 2016, 31(5): 883-909. DOI: 10.1007/s11390-016-1671-1

Citation:

Najam Nazar, Yan Hu, He Jiang. Summarizing Software Artifacts: A Literature Review[J]. Journal of Computer Science and Technology, 2016, 31(5): 883-909. DOI: 10.1007/s11390-016-1671-1

Citation:

Najam Nazar, Yan Hu, He Jiang. Summarizing Software Artifacts: A Literature Review[J]. Journal of Computer Science and Technology, 2016, 31(5): 883-909. DOI: 10.1007/s11390-016-1671-1

Summarizing Software Artifacts: A Literature Review

Abstract

Abstract

This paper presents a literature review in the field of summarizing software artifacts, focusing on bug reports, source code, mailing lists and developer discussions artifacts. From Jan. 2010 to Apr. 2016, numerous summarization techniques, approaches, and tools have been proposed to satisfy the ongoing demand of improving software performance and quality and facilitating developers in understanding the problems at hand. Since aforementioned artifacts contain both structured and unstructured data at the same time, researchers have applied different machine learning and data mining techniques to generate summaries. Therefore, this paper first intends to provide a general perspective on the state of the art, describing the type of artifacts, approaches for summarization, as well as the common portions of experimental procedures shared among these artifacts. Moreover, we discuss the applications of summarization, i.e., what tasks at hand have been achieved through summarization. Next, this paper presents tools that are generated for summarization tasks or employed during summarization tasks. In addition, we present different summarization evaluation methods employed in selected studies as well as other important factors that are used for the evaluation of generated summaries such as adequacy and quality. Moreover, we briefly present modern communication channels and complementarities with commonalities among different software artifacts. Finally, some thoughts about the challenges applicable to the existing studies in general as well as future research directions are also discussed. The survey of existing studies will allow future researchers to have a wide and useful background knowledge on the main and important aspects of this research field.

FullText(HTML)

References (87)

Relative Articles

Supplements (0)

Cited By

Summarizing Software Artifacts: A Literature Review

Abstract

Catalog

Export File

Citation

Format

Content