An Evaluation of Cross-Project Defect Prediction Approaches on Cross-Personalized Defect Prediction

  • Conference paper
  • First Online:
Product-Focused Software Process Improvement (PROFES 2022)

Abstract

Context: Just-in-time software defect prediction (JIT SDP) helps to prioritize fault-prone commits for efficient software quality assurance. As each commit can be attributed to each developer, JIT SDP can also be personalized to each developer as a personalized defect prediction. A question is whether the commit data of other developers, namely, cross-personalized data, are still valuable for prediction. Cross-project defect prediction (CPDP) approaches are a promising answer. Objective: To clarify the effectiveness of cross-personalized defect prediction with CPDP approaches. Method: An experiment with 23 CPDP approaches was conducted on 9 project datasets. Results: Some CPDP approaches using cross-personalized data were often better than the personalized defect prediction using one’s data. Conclusion: It is recommended to use the CPDP approach to achieve better predictions. Turhan09 is our recommendation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (France)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 85.59
Price includes VAT (France)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
EUR 105.49
Price includes VAT (France)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    http://doi.org/10.5281/zenodo.2594681.

References

  1. Amasaki, S., Aman, H., Yokogawa, T.: A preliminary evaluation of CPDP approaches on just-in-time software defect prediction. In: Proceedings of Euromicro Conference on Software Engineering and Advanced Applications (SEAA), pp. 279–286 (2021)

    Google Scholar 

  2. Amasaki, S., Aman, H., Yokogawa, T.: Searching for bellwether developers for cross-personalized defect prediction. In: Proceedings of Product-Focused Software Process Improvement, pp. 183–198 (2021)

    Google Scholar 

  3. Amasaki, S., Kawata, K., Yokogawa, T.: Improving cross-project defect prediction methods with data simplification. In: Proceedings of SEAA 2015, pp. 96–103. IEEE (2015)

    Google Scholar 

  4. Briand, L.C., Melo, W.L., Wust, J.: Assessing the applicability of fault-proneness models across object-oriented software projects. IEEE Trans. Softw. Eng. 28(7), 706–720 (2002)

    Article  Google Scholar 

  5. Cabral, G.G., Minku, L.L., Shihab, E., Mujahid, S.: Class imbalance evolution and verification latency in just-in-time software defect prediction. In: 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE), pp. 666–676 (2019). https://doi.org/10.1109/ICSE.2019.00076

  6. Catolino, G., Di Nucci, D., Ferrucci, F.: Cross-project just-in-time bug prediction for mobile apps: an empirical assessment. In: 2019 IEEE/ACM 6th International Conference on Mobile Software Engineering and Systems (MOBILESoft), pp. 99–110 (2019). https://doi.org/10.1109/MOBILESoft.2019.00023

  7. Eken, B., Tosun, A.: Investigating the performance of personalized models for software defect prediction. J. Syst. Softw. 181, 111038 (2021)

    Article  Google Scholar 

  8. Erika, C.C.A., Ochimizu, K.: Towards logistic regression models for predicting fault-prone code across software projects. In: Proceedings of ESEM 2009, pp. 460–463. IEEE (2009)

    Google Scholar 

  9. He, P., Li, B., Liu, X., Chen, J., Ma, Y.: An empirical study on software defect prediction with a simplified metric set. Inf. Softw. Technol. 59, 170–190 (2015)

    Article  Google Scholar 

  10. He, Z., Peters, F., Menzies, T., Yang, Y.: Learning from open-source projects: an empirical study on defect prediction. In: Proceedings of ESEM 2013, pp. 45–54. IEEE (2013)

    Google Scholar 

  11. Herbold, S.: Training data selection for cross-project defect prediction. In: Proceedings of PROMISE 2013, pp. 6:1–6:10. ACM (2013)

    Google Scholar 

  12. Herbold, S.: CrossPare: a tool for benchmarking cross-project defect predictions. In: Proceedings of ASEW 2015, pp. 90–96. IEEE (2015)

    Google Scholar 

  13. Hosseini, S., Turhan, B., Gunarathna, D.: A systematic literature review and meta-analysis on cross project defect prediction. IEEE Trans. Softw. Eng. 45(2), 111–147 (2019)

    Article  Google Scholar 

  14. Huang, Q., **a, X., Lo, D.: Revisiting supervised and unsupervised models for effort-aware just-in-time defect prediction. Empir. Softw. Eng. 24(5), 2823–2862 (2018). https://doi.org/10.1007/s10664-018-9661-2

    Article  Google Scholar 

  15. Jiang, T., Tan, L., Kim, S.: Personalized defect prediction. In: Proceedings of International Conference on Automated Software Engineering, pp. 279–289 (2013)

    Google Scholar 

  16. Kamei, Y., Fukushima, T., McIntosh, S., Yamashita, K., Ubayashi, N., Hassan, A.E.: Studying just-in-time defect prediction using cross-project models. Empir. Softw. Eng. 21(5), 2072–2106 (2015). https://doi.org/10.1007/s10664-015-9400-x

    Article  Google Scholar 

  17. Kamei, Y., et al.: A large-scale empirical study of just-in-time quality assurance. IEEE Trans. Softw. Eng. 39(6), 757–773 (2013). https://doi.org/10.1109/TSE.2012.70

    Article  Google Scholar 

  18. Kamei, Y., Shihab, E.: Defect prediction: accomplishments and future challenges. In: Proceedings of International Conference on Software Analysis, Evolution and Reengineering, pp. 33–45 (2016). https://doi.org/10.1109/SANER.2016.56

  19. Khoshgoftaar, T.M., Rebours, P., Seliya, N.: Software quality analysis by combining multiple projects and learners. Softw. Qual. J. 17(1), 25–49 (2009)

    Article  Google Scholar 

  20. Krishna, R., Menzies, T., Fu, W.: Too much automation? The bellwether effect and its implications for transfer learning. In: Proceedings of International Conference on Automated Software Engineering, pp. 122–131 (2016)

    Google Scholar 

  21. Li, W., Zhang, W., Jia, X., Huang, Z.: Effort-aware semi-supervised just-in-time defect prediction. Inf. Softw. Technol. 126, 106364 (2020). https://doi.org/10.1016/j.infsof.2020.106364

    Article  Google Scholar 

  22. Ma, Y., Luo, G., Zeng, X., Chen, A.: Transfer learning for cross-company software defect prediction. Inf. Softw. Technol. 54(3), 248–256 (2012)

    Article  Google Scholar 

  23. McIntosh, S., Kamei, Y.: Are fix-inducing changes a moving target? A longitudinal case study of just-in-time defect prediction. IEEE Trans. Softw. Eng. 44(5), 412–428 (2018). https://doi.org/10.1109/TSE.2017.2693980

    Article  Google Scholar 

  24. Menzies, T., Greenwald, J., Frank, A.: Data mining static code attributes to learn defect predictors. IEEE Trans. Softw. Eng. 33(1), 2–13 (2007)

    Article  Google Scholar 

  25. Menzies, T., Butcher, A., Marcus, A., Zimmermann, T., Cok, D.: Local versus global models for effort estimation and defect prediction. In: Proceedings of ASE 2011, pp. 343–351. IEEE (2011)

    Google Scholar 

  26. Nam, J., Kim, S.: CLAMI: defect prediction on unlabeled datasets. In: Proceedings of ASE 2015, pp. 452–463. IEEE (2015)

    Google Scholar 

  27. Nam, J., Pan, S.J., Kim, S.: Transfer defect learning. In: Proceedings of ICSE 2013, pp. 382–391. IEEE (2013)

    Google Scholar 

  28. Panichella, A., Oliveto, R., De Lucia, A.: Cross-project defect prediction models: L’Union fait la force. In: Proceedings of CSMR-WCRE 2014, pp. 164–173. IEEE (2014)

    Google Scholar 

  29. Peters, F., Menzies, T.: Privacy and utility for defect prediction: experiments with MORPH. In: Proceedings of ICSE 2012, pp. 189–199. IEEE (2012)

    Google Scholar 

  30. Peters, F., Menzies, T., Gong, L., Zhang, H.: Balancing privacy and utility in cross-company defect prediction. IEEE Trans. Softw. Eng. 39(8), 1054–1068 (2013)

    Article  Google Scholar 

  31. Peters, F., Menzies, T., Layman, L.: LACE2: better privacy-preserving data sharing for cross project defect prediction. In: Proceedings of ICSE 2015, pp. 801–811. IEEE (2015)

    Google Scholar 

  32. Rahman, F., Devanbu, P.: Ownership, experience and defects: a fine-grained study of authorship. In: Proceedings of International Conference on Software Engineering, pp. 491–500 (2011)

    Google Scholar 

  33. Ryu, D., Choi, O., Baik, J.: Value-cognitive boosting with a support vector machine for cross-project defect prediction. Empir. Softw. Eng. 21(1), 43–71 (2014). https://doi.org/10.1007/s10664-014-9346-4

    Article  Google Scholar 

  34. Ryu, D., Jang, J.I., Baik, J.: A hybrid instance selection using nearest-neighbor for cross-project defect prediction. J. Comput. Sci. Technol. 30(5), 969–980 (2015)

    Article  Google Scholar 

  35. Schröter, A., Zimmermann, T., Premraj, R., Zeller, A.: Where do bugs come from? SIGSOFT Softw. Eng. Notes 31(6) (2006)

    Google Scholar 

  36. Tabassum, S., Minku, L.L., Feng, D., Cabral, G.G., Song, L.: An investigation of cross-project learning in online just-in-time software defect prediction. In: Proceedings of International Conference on Software Engineering, New York, NY, USA, pp. 554–565 (2020). https://doi.org/10.1145/3377811.3380403

  37. Tantithamthavorn, C., McIntosh, S., Hassan, A.E., Matsumoto, K.: An empirical comparison of model validation techniques for defect prediction models. IEEE Trans. Softw. Eng. 43(1), 1–18 (2017)

    Article  Google Scholar 

  38. Turhan, B., Menzies, T., Bener, A.B., Di Stefano, J.: On the relative value of cross-company and within-company data for defect prediction. Empir. Softw. Eng. 14(5), 540–578 (2009)

    Article  Google Scholar 

  39. Wan, Z., **a, X., Hassan, A.E., Lo, D., Yin, J., Yang, X.: Perceptions, expectations, and challenges in defect prediction. IEEE Trans. Softw. Eng. 46(11), 1241–1266 (2020). https://doi.org/10.1109/TSE.2018.2877678

    Article  Google Scholar 

  40. Watanabe, S., Kaiya, H., Kaijiri, K.: Adapting a fault prediction model to allow inter language reuse. In: Proceedings of PROMISE 2008, pp. 19–24. ACM (2008)

    Google Scholar 

  41. **a, X., Lo, D., Wang, X., Yang, X.: Collective personalized change classification with multiobjective search. IEEE Trans. Reliab. 65(4), 1810–1829 (2016)

    Article  Google Scholar 

  42. Yang, X., Lo, D., **a, X., Zhang, Y., Sun, J.: Deep learning for just-in-time defect prediction. In: 2015 IEEE International Conference on Software Quality, Reliability and Security, pp. 17–26 (2015). https://doi.org/10.1109/QRS.2015.14

  43. Yang, X., Lo, D., **a, X., Sun, J.: TLEL: a two-layer ensemble learning approach for just-in-time defect prediction. Inf. Softw. Technol. 87, 206–220 (2017)

    Article  Google Scholar 

  44. Zhang, Y., Lo, D., **a, X., Sun, J.: An empirical study of classifier combination for cross-project defect prediction. In: Proceedings of COMPSAC 2015, pp. 264–269. IEEE (2015)

    Google Scholar 

  45. Zhang, Y., Lo, D., **a, X., Sun, J.: Combined classifier for cross-project defect prediction: an extended empirical study. Front. Comp. Sci. 12(2), 280–296 (2018). https://doi.org/10.1007/s11704-017-6015-y

    Article  Google Scholar 

  46. Zhou, Y., et al.: How far we have progressed in the journey? An examination of cross-project defect prediction. ACM Trans. Softw. Eng. Methodol. 27(1), 1–51 (2018)

    Article  Google Scholar 

  47. Zhu, K., Zhang, N., Ying, S., Zhu, D.: Within-project and cross-project just-in-time defect prediction based on denoising autoencoder and convolutional neural network. IET Softw. 14(3), 185–195 (2020). https://doi.org/10.1049/iet-sen.2019.0278

    Article  Google Scholar 

  48. Zimmermann, T., Nagappan, N., Gall, H., Giger, E., Murphy, B.: Cross-project defect prediction: a large scale experiment on data vs. domain vs. process. In: Proceedings of ESEC/FSE 2009, pp. 91–100. ACM (2009)

    Google Scholar 

Download references

Acknowledgment

This work was partially supported by JSPS KAKENHI Grant #21K11831, #21K11833, and Wesco Scientific Promotion Foundation.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sousuke Amasaki .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Amasaki, S., Aman, H., Yokogawa, T. (2022). An Evaluation of Cross-Project Defect Prediction Approaches on Cross-Personalized Defect Prediction. In: Taibi, D., Kuhrmann, M., Mikkonen, T., Klünder, J., Abrahamsson, P. (eds) Product-Focused Software Process Improvement. PROFES 2022. Lecture Notes in Computer Science, vol 13709. Springer, Cham. https://doi.org/10.1007/978-3-031-21388-5_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-21388-5_30

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-21387-8

  • Online ISBN: 978-3-031-21388-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Navigation