What Are They Talking About? Analyzing Code Reviews in Pull-Based Development Model

Zhi-Xing Li; Yue Yu; Gang Yin; Tao Wang; Huai-Min Wang

doi:10.1007/s11390-017-1783-2

Zhi-Xing Li, Yue Yu, Gang Yin, Tao Wang, Huai-Min Wang. What Are They Talking About? Analyzing Code Reviews in Pull-Based Development Model[J]. Journal of Computer Science and Technology, 2017, 32(6): 1060-1075. DOI: 10.1007/s11390-017-1783-2

Citation:

What Are They Talking About? Analyzing Code Reviews in Pull-Based Development Model

Abstract

Abstract

Code reviews in pull-based model are open to community users on GitHub. Various participants are taking part in the review discussions and the review topics are not only about the improvement of code contributions but also about project evolution and social interaction. A comprehensive understanding of the review topics in pull-based model would be useful to better organize the code review process and optimize review tasks such as reviewer recommendation and pull-request prioritization. In this paper, we first conduct a qualitative study on three popular open-source software projects hosted on GitHub and construct a fine-grained two-level taxonomy covering four level-1 categories (code correctness, pullrequest decision-making, project management, and social interaction) and 11 level-2 subcategories (e.g., defect detecting, reviewer assigning, contribution encouraging). Second, we conduct preliminary quantitative analysis on a large set of review comments that were labeled by TSHC (a two-stage hybrid classification algorithm), which is able to automatically classify review comments by combining rule-based and machine-learning techniques. Through the quantitative study, we explore the typical review patterns. We find that the three projects present similar comments distribution on each subcategory. Pull-requests submitted by inexperienced contributors tend to contain potential issues even though they have passed the tests. Furthermore, external contributors are more likely to break project conventions in their early contributions.

FullText(HTML)

References (29)

Relative Articles

Supplements (0)

Cited By

What Are They Talking About? Analyzing Code Reviews in Pull-Based Development Model

Abstract

Catalog

Export File

Citation

Format

Content