Abstract
Context: Just-in-time software defect prediction (JIT SDP) helps to prioritize fault-prone commits for efficient software quality assurance. As each commit can be attributed to each developer, JIT SDP can also be personalized to each developer as a personalized defect prediction. A question is whether the commit data of other developers, namely, cross-personalized data, are still valuable for prediction. Cross-project defect prediction (CPDP) approaches are a promising answer. Objective: To clarify the effectiveness of cross-personalized defect prediction with CPDP approaches. Method: An experiment with 23 CPDP approaches was conducted on 9 project datasets. Results: Some CPDP approaches using cross-personalized data were often better than the personalized defect prediction using one’s data. Conclusion: It is recommended to use the CPDP approach to achieve better predictions. Turhan09 is our recommendation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Amasaki, S., Aman, H., Yokogawa, T.: A preliminary evaluation of CPDP approaches on just-in-time software defect prediction. In: Proceedings of Euromicro Conference on Software Engineering and Advanced Applications (SEAA), pp. 279–286 (2021)
Amasaki, S., Aman, H., Yokogawa, T.: Searching for bellwether developers for cross-personalized defect prediction. In: Proceedings of Product-Focused Software Process Improvement, pp. 183–198 (2021)
Amasaki, S., Kawata, K., Yokogawa, T.: Improving cross-project defect prediction methods with data simplification. In: Proceedings of SEAA 2015, pp. 96–103. IEEE (2015)
Briand, L.C., Melo, W.L., Wust, J.: Assessing the applicability of fault-proneness models across object-oriented software projects. IEEE Trans. Softw. Eng. 28(7), 706–720 (2002)
Cabral, G.G., Minku, L.L., Shihab, E., Mujahid, S.: Class imbalance evolution and verification latency in just-in-time software defect prediction. In: 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE), pp. 666–676 (2019). https://doi.org/10.1109/ICSE.2019.00076
Catolino, G., Di Nucci, D., Ferrucci, F.: Cross-project just-in-time bug prediction for mobile apps: an empirical assessment. In: 2019 IEEE/ACM 6th International Conference on Mobile Software Engineering and Systems (MOBILESoft), pp. 99–110 (2019). https://doi.org/10.1109/MOBILESoft.2019.00023
Eken, B., Tosun, A.: Investigating the performance of personalized models for software defect prediction. J. Syst. Softw. 181, 111038 (2021)
Erika, C.C.A., Ochimizu, K.: Towards logistic regression models for predicting fault-prone code across software projects. In: Proceedings of ESEM 2009, pp. 460–463. IEEE (2009)
He, P., Li, B., Liu, X., Chen, J., Ma, Y.: An empirical study on software defect prediction with a simplified metric set. Inf. Softw. Technol. 59, 170–190 (2015)
He, Z., Peters, F., Menzies, T., Yang, Y.: Learning from open-source projects: an empirical study on defect prediction. In: Proceedings of ESEM 2013, pp. 45–54. IEEE (2013)
Herbold, S.: Training data selection for cross-project defect prediction. In: Proceedings of PROMISE 2013, pp. 6:1–6:10. ACM (2013)
Herbold, S.: CrossPare: a tool for benchmarking cross-project defect predictions. In: Proceedings of ASEW 2015, pp. 90–96. IEEE (2015)
Hosseini, S., Turhan, B., Gunarathna, D.: A systematic literature review and meta-analysis on cross project defect prediction. IEEE Trans. Softw. Eng. 45(2), 111–147 (2019)
Huang, Q., **a, X., Lo, D.: Revisiting supervised and unsupervised models for effort-aware just-in-time defect prediction. Empir. Softw. Eng. 24(5), 2823–2862 (2018). https://doi.org/10.1007/s10664-018-9661-2
Jiang, T., Tan, L., Kim, S.: Personalized defect prediction. In: Proceedings of International Conference on Automated Software Engineering, pp. 279–289 (2013)
Kamei, Y., Fukushima, T., McIntosh, S., Yamashita, K., Ubayashi, N., Hassan, A.E.: Studying just-in-time defect prediction using cross-project models. Empir. Softw. Eng. 21(5), 2072–2106 (2015). https://doi.org/10.1007/s10664-015-9400-x
Kamei, Y., et al.: A large-scale empirical study of just-in-time quality assurance. IEEE Trans. Softw. Eng. 39(6), 757–773 (2013). https://doi.org/10.1109/TSE.2012.70
Kamei, Y., Shihab, E.: Defect prediction: accomplishments and future challenges. In: Proceedings of International Conference on Software Analysis, Evolution and Reengineering, pp. 33–45 (2016). https://doi.org/10.1109/SANER.2016.56
Khoshgoftaar, T.M., Rebours, P., Seliya, N.: Software quality analysis by combining multiple projects and learners. Softw. Qual. J. 17(1), 25–49 (2009)
Krishna, R., Menzies, T., Fu, W.: Too much automation? The bellwether effect and its implications for transfer learning. In: Proceedings of International Conference on Automated Software Engineering, pp. 122–131 (2016)
Li, W., Zhang, W., Jia, X., Huang, Z.: Effort-aware semi-supervised just-in-time defect prediction. Inf. Softw. Technol. 126, 106364 (2020). https://doi.org/10.1016/j.infsof.2020.106364
Ma, Y., Luo, G., Zeng, X., Chen, A.: Transfer learning for cross-company software defect prediction. Inf. Softw. Technol. 54(3), 248–256 (2012)
McIntosh, S., Kamei, Y.: Are fix-inducing changes a moving target? A longitudinal case study of just-in-time defect prediction. IEEE Trans. Softw. Eng. 44(5), 412–428 (2018). https://doi.org/10.1109/TSE.2017.2693980
Menzies, T., Greenwald, J., Frank, A.: Data mining static code attributes to learn defect predictors. IEEE Trans. Softw. Eng. 33(1), 2–13 (2007)
Menzies, T., Butcher, A., Marcus, A., Zimmermann, T., Cok, D.: Local versus global models for effort estimation and defect prediction. In: Proceedings of ASE 2011, pp. 343–351. IEEE (2011)
Nam, J., Kim, S.: CLAMI: defect prediction on unlabeled datasets. In: Proceedings of ASE 2015, pp. 452–463. IEEE (2015)
Nam, J., Pan, S.J., Kim, S.: Transfer defect learning. In: Proceedings of ICSE 2013, pp. 382–391. IEEE (2013)
Panichella, A., Oliveto, R., De Lucia, A.: Cross-project defect prediction models: L’Union fait la force. In: Proceedings of CSMR-WCRE 2014, pp. 164–173. IEEE (2014)
Peters, F., Menzies, T.: Privacy and utility for defect prediction: experiments with MORPH. In: Proceedings of ICSE 2012, pp. 189–199. IEEE (2012)
Peters, F., Menzies, T., Gong, L., Zhang, H.: Balancing privacy and utility in cross-company defect prediction. IEEE Trans. Softw. Eng. 39(8), 1054–1068 (2013)
Peters, F., Menzies, T., Layman, L.: LACE2: better privacy-preserving data sharing for cross project defect prediction. In: Proceedings of ICSE 2015, pp. 801–811. IEEE (2015)
Rahman, F., Devanbu, P.: Ownership, experience and defects: a fine-grained study of authorship. In: Proceedings of International Conference on Software Engineering, pp. 491–500 (2011)
Ryu, D., Choi, O., Baik, J.: Value-cognitive boosting with a support vector machine for cross-project defect prediction. Empir. Softw. Eng. 21(1), 43–71 (2014). https://doi.org/10.1007/s10664-014-9346-4
Ryu, D., Jang, J.I., Baik, J.: A hybrid instance selection using nearest-neighbor for cross-project defect prediction. J. Comput. Sci. Technol. 30(5), 969–980 (2015)
Schröter, A., Zimmermann, T., Premraj, R., Zeller, A.: Where do bugs come from? SIGSOFT Softw. Eng. Notes 31(6) (2006)
Tabassum, S., Minku, L.L., Feng, D., Cabral, G.G., Song, L.: An investigation of cross-project learning in online just-in-time software defect prediction. In: Proceedings of International Conference on Software Engineering, New York, NY, USA, pp. 554–565 (2020). https://doi.org/10.1145/3377811.3380403
Tantithamthavorn, C., McIntosh, S., Hassan, A.E., Matsumoto, K.: An empirical comparison of model validation techniques for defect prediction models. IEEE Trans. Softw. Eng. 43(1), 1–18 (2017)
Turhan, B., Menzies, T., Bener, A.B., Di Stefano, J.: On the relative value of cross-company and within-company data for defect prediction. Empir. Softw. Eng. 14(5), 540–578 (2009)
Wan, Z., **a, X., Hassan, A.E., Lo, D., Yin, J., Yang, X.: Perceptions, expectations, and challenges in defect prediction. IEEE Trans. Softw. Eng. 46(11), 1241–1266 (2020). https://doi.org/10.1109/TSE.2018.2877678
Watanabe, S., Kaiya, H., Kaijiri, K.: Adapting a fault prediction model to allow inter language reuse. In: Proceedings of PROMISE 2008, pp. 19–24. ACM (2008)
**a, X., Lo, D., Wang, X., Yang, X.: Collective personalized change classification with multiobjective search. IEEE Trans. Reliab. 65(4), 1810–1829 (2016)
Yang, X., Lo, D., **a, X., Zhang, Y., Sun, J.: Deep learning for just-in-time defect prediction. In: 2015 IEEE International Conference on Software Quality, Reliability and Security, pp. 17–26 (2015). https://doi.org/10.1109/QRS.2015.14
Yang, X., Lo, D., **a, X., Sun, J.: TLEL: a two-layer ensemble learning approach for just-in-time defect prediction. Inf. Softw. Technol. 87, 206–220 (2017)
Zhang, Y., Lo, D., **a, X., Sun, J.: An empirical study of classifier combination for cross-project defect prediction. In: Proceedings of COMPSAC 2015, pp. 264–269. IEEE (2015)
Zhang, Y., Lo, D., **a, X., Sun, J.: Combined classifier for cross-project defect prediction: an extended empirical study. Front. Comp. Sci. 12(2), 280–296 (2018). https://doi.org/10.1007/s11704-017-6015-y
Zhou, Y., et al.: How far we have progressed in the journey? An examination of cross-project defect prediction. ACM Trans. Softw. Eng. Methodol. 27(1), 1–51 (2018)
Zhu, K., Zhang, N., Ying, S., Zhu, D.: Within-project and cross-project just-in-time defect prediction based on denoising autoencoder and convolutional neural network. IET Softw. 14(3), 185–195 (2020). https://doi.org/10.1049/iet-sen.2019.0278
Zimmermann, T., Nagappan, N., Gall, H., Giger, E., Murphy, B.: Cross-project defect prediction: a large scale experiment on data vs. domain vs. process. In: Proceedings of ESEC/FSE 2009, pp. 91–100. ACM (2009)
Acknowledgment
This work was partially supported by JSPS KAKENHI Grant #21K11831, #21K11833, and Wesco Scientific Promotion Foundation.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Amasaki, S., Aman, H., Yokogawa, T. (2022). An Evaluation of Cross-Project Defect Prediction Approaches on Cross-Personalized Defect Prediction. In: Taibi, D., Kuhrmann, M., Mikkonen, T., Klünder, J., Abrahamsson, P. (eds) Product-Focused Software Process Improvement. PROFES 2022. Lecture Notes in Computer Science, vol 13709. Springer, Cham. https://doi.org/10.1007/978-3-031-21388-5_30
Download citation
DOI: https://doi.org/10.1007/978-3-031-21388-5_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-21387-8
Online ISBN: 978-3-031-21388-5
eBook Packages: Computer ScienceComputer Science (R0)