An overview of structural coverage metrics for testing neural networks

Usman, Muhammad; Sun, Youcheng; Gopinath, Divya; Dange, Rishi; Manolache, Luca; Păsăreanu, Corina S.

doi:10.1007/s10009-022-00683-x

An overview of structural coverage metrics for testing neural networks

Explanation Paradigms Leveraging Analytic Intuition
Special Section: Introducing Explanation Paradigms Leveraging Analytic Intuition
Published: 02 November 2022

Volume 25, pages 393–405, (2023)
Cite this article

International Journal on Software Tools for Technology Transfer Aims and scope Submit manuscript

Muhammad Usman¹,
Youcheng Sun²,
Divya Gopinath³,
Rishi Dange⁴,
Luca Manolache⁵ &
…
Corina S. Păsăreanu⁶

547 Accesses
4 Citations
Explore all metrics

Abstract

Deep neural network (DNN) models, including those used in safety-critical domains, need to be thoroughly tested to ensure that they can reliably perform well in different scenarios. In this article, we provide an overview of structural coverage metrics for testing DNN models, including neuron coverage, k-multisection neuron coverage, top-k neuron coverage, neuron boundary coverage, strong neuron activation coverage and modified condition/decision coverage. We evaluate the metrics on realistic DNN models used for perception tasks (LeNet-1, LeNet-4, LeNet-5, ResNet20) including networks used in autonomy (TaxiNet). We also provide a tool, DNNCov, which can measure the testing coverage for all these metrics. DNNCov outputs an informative coverage report to enable researchers and practitioners to assess the adequacy of DNN testing, to compare different coverage measures, and to more conveniently inspect the model’s internals during testing.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 6

Eager to Stop: Efficient Falsification of Deep Neural Networks

Analyzing neural network behavior through deep statistical model checking

Article Open access 13 December 2022

HeatC: A Variable-Grained Coverage Criterion for Deep Learning Systems

Notes

https://github.com/DNNCov/DNNCov

References

Pei, K., Cao, Y., Yang, J., Jana, S.: DeepXplore: automated whitebox testing of deep learning systems. In: SOSP (2017)
Ma, L., Juefei-Xu, F., Zhang, F., Sun, J., Xue, M., Li, B., Chen, C., Su, T., Li, L., Liu, Y., et al.: DeepGauge: Multi-granularity testing criteria for deep learning systems. In: ASE (2018)
Sun, Y., Huang, X., Kroening, D., Sharp, J., Hill, M., Ashmore, R.: Structural test coverage criteria for deep neural networks. ACM Trans. Embed. Comput. Syst. 18, 1 (2019)
Article Google Scholar
Lee, S., Cha, S., Lee, D., Oh, H.: Effective white-box testing of deep neural networks with adaptive neuron-selection strategy. In: ISSTA (2020)
**e, X., Ma, L., Juefei-Xu, F., Xue, M., Chen, H., Liu, Y., Zhao, J., Li, B., Yin, J., See, S.: DeepHunter: A coverage-guided fuzz testing framework for deep neural networks. In: ISSTA (2019)
Sun, Y., Wu, M., Ruan, W., Huang, X., Kwiatkowska, M., Kroening, D.: Concolic testing for deep neural networks. In: ASE (2018)
Tian, Y., Zeng, Z., Wen, M., Liu, Y., Kuo, T.-y., Cheung, S.-C.: Evaldnn: A toolbox for evaluating deep neural network models. In: 2020 IEEE/ACM 42nd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion), pp. 45–48 (2020)
Huang, X., Kwiatkowska, M., Wang, S., Wu, M.: Safety verification of deep neural networks. In: CAV (2017)
Kim, J., Feldt, R., Yoo, S.: Guiding deep learning system testing using surprise adequacy. In: ICSE (2019)
Deng, L.: The MNIST database of handwritten digit images for machine learning research. IEEE Signal Process. Mag. 29, 141 (2012)
Article Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. In: Proceedings of the IEEE (1998)
LeCun, Y., Boser, B.E., Denker, J.S., Henderson, D., Howard, R.E., Hubbard, W.E., Jackel, L.D.: Handwritten digit recognition with a back-propagation network. In: NIPS (1989)
Krizhevsky, A., Hinton, G., et al.: Learning multiple layers of features from tiny images (2009)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: CVPR (2016)
Julian, K.D., Lee, R., Kochenderfer, M.J.: Validation of image-based neural network controllers through adaptive stress testing. In: ITSC (2020)
Szegedy, C., Zaremba, W., Sutskever, I., Bruna, J., Erhan, D., Goodfellow, I., Fergus, R.: Intriguing properties of neural networks. ar**v preprint ar**v:1312.6199 (2013)
Gu, T., Liu, K., Dolan-Gavitt, B., Garg, S.: Badnets: evaluating backdooring attacks on deep neural networks. IEEE Access 7, 47230–47244 (2019). https://doi.org/10.1109/ACCESS.2019.2909068
Article Google Scholar
Gopinath, D., Converse, H., Pasareanu, C.S., Taly, A.: Property inference for deep neural networks. In: 34th IEEE/ACM International Conference on Automated Software Engineering, ASE 2019, San Diego, CA, USA, November 11-15, 2019, pp. 797–809. IEEE, (2019). https://doi.org/10.1109/ASE.2019.00079

Download references

Author information

Authors and Affiliations

University of Texas at Austin, Austin, USA
Muhammad Usman
University of Manchester, Manchester, UK
Youcheng Sun
KBR Inc. at NASA Ames, Ames, USA
Divya Gopinath
Princeton University, Princeton, USA
Rishi Dange
Palo Alto High School, Palo Alto, USA
Luca Manolache
CMU Cylab and KBR Inc. at NASA Ames, Ames, USA
Corina S. Păsăreanu

Authors

Muhammad Usman
View author publications
You can also search for this author in PubMed Google Scholar
Youcheng Sun
View author publications
You can also search for this author in PubMed Google Scholar
Divya Gopinath
View author publications
You can also search for this author in PubMed Google Scholar
Rishi Dange
View author publications
You can also search for this author in PubMed Google Scholar
Luca Manolache
View author publications
You can also search for this author in PubMed Google Scholar
Corina S. Păsăreanu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Corina S. Păsăreanu.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Usman, M., Sun, Y., Gopinath, D. et al. An overview of structural coverage metrics for testing neural networks. Int J Softw Tools Technol Transfer 25, 393–405 (2023). https://doi.org/10.1007/s10009-022-00683-x

Download citation

Accepted: 18 October 2022
Published: 02 November 2022
Issue Date: June 2023
DOI: https://doi.org/10.1007/s10009-022-00683-x

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An overview of structural coverage metrics for testing neural networks

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Eager to Stop: Efficient Falsification of Deep Neural Networks

Analyzing neural network behavior through deep statistical model checking

HeatC: A Variable-Grained Coverage Criterion for Deep Learning Systems

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

An overview of structural coverage metrics for testing neural networks

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Eager to Stop: Efficient Falsification of Deep Neural Networks

Analyzing neural network behavior through deep statistical model checking

HeatC: A Variable-Grained Coverage Criterion for Deep Learning Systems

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation