We use cookies to improve your experience with our site.

软件工件摘要方法综述

Summarizing Software Artifacts: A Literature Review

  • 摘要: 本文是针对软件工件摘要技术的综述,其中重点关注bug报告、源代码、邮件列表以及开发者论坛讨论等典型的软件工件。在过去的7年(2010-2016)里,为了帮助改进软件性能与软件质量、方便开发人员快速理解软件架构与细节,研究者们提出了多种软件工件摘要方法。软件工件中既包含具有明显结构的数据,也包含非结构化数据。为此,研究者们提出了多种不同的基于机器学习和数据挖掘的算法来进行软件工件摘要的生成。本综述旨在对现有最新的软件工件摘要技术进行总结,描述了软件工件的种类、软件摘要方法以及软件摘要实验的典型步骤。我们还对软件工件摘要的应用范围进行了讨论。同时,我们对软件工件摘要相关的工具进行了汇总介绍,并对已有研究中使用的软件工件摘要方法以及对于生成的软件工件摘要质量的评估方法进行了讨论和说明。此外,本文简要介绍了产生软件工件数据的现代通信渠道,以及不同软件工件之间的差异与共性。最后,对软件工件摘要研究所面临的挑战,以及未来研究方向进行了讨论。本综述能够为软件工件摘要方向新的研究者们提供广泛而全面的背景知识。

     

    Abstract: This paper presents a literature review in the field of summarizing software artifacts, focusing on bug reports, source code, mailing lists and developer discussions artifacts. From Jan. 2010 to Apr. 2016, numerous summarization techniques, approaches, and tools have been proposed to satisfy the ongoing demand of improving software performance and quality and facilitating developers in understanding the problems at hand. Since aforementioned artifacts contain both structured and unstructured data at the same time, researchers have applied different machine learning and data mining techniques to generate summaries. Therefore, this paper first intends to provide a general perspective on the state of the art, describing the type of artifacts, approaches for summarization, as well as the common portions of experimental procedures shared among these artifacts. Moreover, we discuss the applications of summarization, i.e., what tasks at hand have been achieved through summarization. Next, this paper presents tools that are generated for summarization tasks or employed during summarization tasks. In addition, we present different summarization evaluation methods employed in selected studies as well as other important factors that are used for the evaluation of generated summaries such as adequacy and quality. Moreover, we briefly present modern communication channels and complementarities with commonalities among different software artifacts. Finally, some thoughts about the challenges applicable to the existing studies in general as well as future research directions are also discussed. The survey of existing studies will allow future researchers to have a wide and useful background knowledge on the main and important aspects of this research field.

     

/

返回文章
返回