We use cookies to improve your experience with our site.

基于主题建模的软件资源库变更集预警优先级排序

Topic Modeling Based Warning Prioritization from Change Sets of Software Repository

  • 摘要: 许多现有的预警优先技术对静态分析预警进行重新排序,以便最先提供真正例TP。然而,因为一些警告事实上并不是真正例或者与代码环境和主题无相关,所以在此需要大量的时间调查并修复警告的优先级。本文提出了警告优先级排序技术,它能从缺陷相关代码块中找出各种潜在的主题。本文工作的目的是建立一个优先级模型,该模型包含基于变更集主题的单独的警告优先级,从而发现真正例的警告的数目。为对该模型进行性能评价,我们使用了在诸多警告优先级排序研究中被广泛使用的性能指标:警告检测率,并对此模型与其它有竞争力的技术作了比较。此外,我们将该技术运用于一实体跨国公司的8个产业项目,从而验证其有效性。

     

    Abstract: Many existing warning prioritization techniques seek to reorder the static analysis warnings such that true positives are provided first. However, excessive amount of time is required therein to investigate and fix prioritized warnings because some are not actually true positives or are irrelevant to the code context and topic. In this paper, we propose a warning prioritization technique that reflects various latent topics from bug-related code blocks. Our main aim is to build a prioritization model that comprises separate warning priorities depending on the topic of the change sets to identify the number of true positive warnings. For the performance evaluation of the proposed model, we employ a performance metric called warning detection rate, widely used in many warning prioritization studies, and compare the proposed model with other competitive techniques. Additionally, the effectiveness of our model is verified via the application of our technique to eight industrial projects of a real global company.

     

/

返回文章
返回