? What Are They Talking About? Analyzing Code Reviews in Pull-Based Development Model
Journal of Computer Science and Technology
Quick Search in JCST
 Advanced Search 
      Home | PrePrint | SiteMap | Contact Us | FAQ
Indexed by   SCIE, EI ...
Bimonthly    Since 1986
Journal of Computer Science and Technology 2017, Vol. 32 Issue (6) :1060-1075    DOI: 10.1007/s11390-017-1783-2
Special Section on Software Systems 2017 Current Issue | Archive | Adv Search << Previous Articles | Next Articles >>
What Are They Talking About? Analyzing Code Reviews in Pull-Based Development Model
Zhi-Xing Li, Yue Yu*, Member, CCF, ACM, Gang Yin, Member, CCF, ACM, Tao Wang, Member, CCF, ACM, Huai-Min Wang, Fellow, CCF, Member, ACM
College of Computer, National University of Defense Technology, Changsha 410073, China

Related Articles
Download: [PDF 944KB]     Export: BibTeX or EndNote (RIS)  
Abstract Code reviews in pull-based model are open to community users on GitHub. Various participants are taking part in the review discussions and the review topics are not only about the improvement of code contributions but also about project evolution and social interaction. A comprehensive understanding of the review topics in pull-based model would be useful to better organize the code review process and optimize review tasks such as reviewer recommendation and pull-request prioritization. In this paper, we first conduct a qualitative study on three popular open-source software projects hosted on GitHub and construct a fine-grained two-level taxonomy covering four level-1 categories (code correctness, pullrequest decision-making, project management, and social interaction) and 11 level-2 subcategories (e.g., defect detecting, reviewer assigning, contribution encouraging). Second, we conduct preliminary quantitative analysis on a large set of review comments that were labeled by TSHC (a two-stage hybrid classification algorithm), which is able to automatically classify review comments by combining rule-based and machine-learning techniques. Through the quantitative study, we explore the typical review patterns. We find that the three projects present similar comments distribution on each subcategory. Pull-requests submitted by inexperienced contributors tend to contain potential issues even though they have passed the tests. Furthermore, external contributors are more likely to break project conventions in their early contributions.
Articles by authors
Keywordspull-request   code review   review comment     
Received 2017-04-21;

This work was supported by the National Key Research and Development Program of China under Grant No. 2016YFB1000805 and the National Natural Science Foundation of China under Grant Nos. 61432020, 61303064, 61472430 and 61502512.

Corresponding Authors: Yue Yu     Email: yuyue@nudt.edu.cn
About author: Zhi-Xing Li is a Master student in the College of Computer,National University of Defense Technology,Changsha.His work interests include open source software engineering,data mining,and knowledge discovering in open source software.
Cite this article:   
Zhi-Xing Li, Yue Yu, Gang Yin, Tao Wang, Huai-Min Wang.What Are They Talking About? Analyzing Code Reviews in Pull-Based Development Model[J]  Journal of Computer Science and Technology, 2017,V32(6): 1060-1075
Copyright 2010 by Journal of Computer Science and Technology