Automated, Highly-Accurate, Bug Assignment Using Machine Learning and Tossing Graphs (2012)

by Pamela Bhattacharya, Iulian Neamtiu, and Christian R. Shelton


Abstract: Empirical studies indicate that automating the bug assignment process has the potential to significantly reduce software evolution effort and costs. Prior work has used machine band of tools which can be ineffective in large, long-lived software projects. To redress this situation, in this paper we employ a comprehensive set of machine learning tools and a probabilistic graph-based model (bug tossing graphs) that lead to highly-accurate predictions, and lay the foundation for the next generation of machine learning-based bug assignment. Our work is the first to examine the impact of multiple machine learning dimensions (classifiers, attributes, and training history) along with bug tossing graphs on prediction accuracy in bug assignment. We validate our approach on Mozilla and Eclipse, covering 856,259 bug reports and 21 cumulative years of development. We demonstrate that our techniques can achieve up to 86.09% prediction accuracy in bug assignment and significantly reduce tossing path lengths. We show that for our data sets the Na\"{i}ve Bayes classifier coupled with product-component features, tossing graphs and incremental learning performs best. Next, we perform an ablative analysis by unilaterally varying classifiers, features, and learning model to show their relative importance of on bug assignment accuracy. Finally, we propose optimization techniques that achieve high prediction accuracy while reducing training and prediction time.


Download Information

Pamela Bhattacharya, Iulian Neamtiu, and Christian R. Shelton (2012). "Automated, Highly-Accurate, Bug Assignment Using Machine Learning and Tossing Graphs." Journal of Systems and Software, 85(10), 2275-2292. pdf         ext

Bibtex citation

@article{BhaNeaShe12,
   author = "Pamela Bhattacharya and Iulian Neamtiu and Christian R. Shelton",
   title = "Automated, Highly-Accurate, Bug Assignment Using Machine Learning and Tossing Graphs",
   journal = "Journal of Systems and Software",
   journalabbr = "JSS",
   year = 2012,
   volume = 85,
   number = 10,
   month = oct,
   pages = "2275--2292",
}