Compact Convolutional Transformers on Edge TPUs

Sun, Yipeng; Kist, Andreas M.

doi:10.1007/978-3-658-41657-7_32

Yipeng Sun⁸ &
Andreas M. Kist⁸

Part of the book series: Informatik aktuell ((INFORMAT))

Included in the following conference series:

BVM Workshop

685 Accesses

Abstract

Medical image processing on edge devices is the key to local and efficient data processing. In the last decade, convolutional neural networks (CNNs) have dominated and achieved top performance in various medical imaging applications. However, CNNs are limited in their performance due to their inability to understand long-distance spatial relationships. The recently proposed vision transformer (ViT) learns long-distance spatial relationships of images based on self-attention, but these require large datasets for training. Hence, ViT-based architectures can be combined with CNNs to solve this problem. Yet, their use of edge devices has been barely explored. In this work, we investigate compact convolutional transformers (CCTs) and their ability to be deployed to edge devices. Using strategic design decisions, we were able to deploy CCT to Google Edge TPUs. In comparison to a reference CNN (ResNet50) that was also deployed to the Edge TPU, we reduce the model parameters by a factor of 35 and obtain a 7× inference time speed-up while obtaining competitive accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Germany)

eBook: EUR 86.99; Price includes VAT (Germany)

Softcover Book: EUR 109.99; Price includes VAT (Germany)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications

EdgeViT: Efficient Visual Modeling for Edge Computing

Energy-Efficient 3D Convolution Using Interposed Memory Accelerator eXtension 2 for Medical Image Processing

References

Cao K, Liu Y, Meng G, Sun Q. An overview on edge computing research. IEEE access. 2020;8:85714–28.
Google Scholar
Dong P, Ning Z, Obaidat MS, Jiang X, Guo Y, Hu X et al. Edge computing based healthcare systems: enabling decentralized health monitoring in internet of medical things. IEEE Network. 2020;34(5):254–61.
Google Scholar
Sun Y, Kist AM. Deep learning on edge TPUs. 2021.
Google Scholar
Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T et al. An image is worth 16x16 words: transformers for image recognition at scale. ar** the big data paradigm with compact transformers. ar**v preprint ar**v:2104.05704. 2021.
Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T et al. Mobilenets: efficient convolutional neural networks for mobile vision applications. ar**v preprint ar**v:1704.04861. 2017.
Valanarasu JMJ, Patel VM. UNeXt: MLP-based rapid medical image segmentation network. ar**v preprint ar**v:2203.04967. 2022.
Zagoruyko S, Komodakis N. Paying more attention to attention: improving the performance of convolutional neural networks via attention transfer. ar**v preprint ar**v:1612.03928. 2016.
apolanco3225. Medical MNIST classification. https://github.com/apolanco3225/Medical-MNIST-Classification. 2017.
Kermany DS, Goldbaum M, Cai W, Valentim CC, Liang H, Baxter SL et al. Identifying medical diagnoses and treatable diseases by image-based deep learning. Cell. 2018;172(5):1122–31.
Google Scholar
Al-Dhabyani W, Gomaa M, Khaled H, Fahmy A. Dataset of breast ultrasound images. Data Brief. 2020;28:104863.
Google Scholar
Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D. Grad-cam: visual explanations from deep networks via gradient-based localization. Proceedings of the IEEE international conference on computer vision. 2017:618–26.
Google Scholar

Download references

Author information

Authors and Affiliations

Department Artificial Intelligence in Biomedical Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
Yipeng Sun & Andreas M. Kist

Authors

Yipeng Sun
View author publications
You can also search for this author in PubMed Google Scholar
Andreas M. Kist
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andreas M. Kist .

Editor information

Editors and Affiliations

Peter L. Reichertz Institut für Medizinische, Informatik der TU Braunschweig und der Medizinischen Hochschule Hannover, Braunschweig, Niedersachsen, Deutschland
Thomas M. Deserno
Institut für Medizinische Informatik, Universität zu Lübeck, Lübeck, Schleswig-Holstein, Deutschland
Heinz Handels
Lehrstuhl für Mustererkennung, Friedrich-Alexander-Universität, Erlangen, Bayern, Deutschland
Andreas Maier
Medical Image Computing, E230, Deutsches Krebsforschungszentrum (DKFZ), Heidelberg, Baden-Württemberg, Deutschland
Klaus Maier-Hein
Fakultät für Informatik und Mathematik, Ostbayerische Technische Hochschule Regensburg, Regensburg, Deutschland
Christoph Palm
Institut für Medizinische Informatik, Charité – Universitätsmedizin Berlin, Berlin, Berlin, Deutschland
Thomas Tolxdorff

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sun, Y., Kist, A.M. (2023). Compact Convolutional Transformers on Edge TPUs. In: Deserno, T.M., Handels, H., Maier, A., Maier-Hein, K., Palm, C., Tolxdorff, T. (eds) Bildverarbeitung für die Medizin 2023. BVM 2023. Informatik aktuell. Springer Vieweg, Wiesbaden. https://doi.org/10.1007/978-3-658-41657-7_32

Download citation

DOI: https://doi.org/10.1007/978-3-658-41657-7_32
Published: 02 June 2023
Publisher Name: Springer Vieweg, Wiesbaden
Print ISBN: 978-3-658-41656-0
Online ISBN: 978-3-658-41657-7
eBook Packages: Computer Science and Engineering (German Language)

Publish with us

Policies and ethics

Compact Convolutional Transformers on Edge TPUs

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications

EdgeViT: Efficient Visual Modeling for Edge Computing

Energy-Efficient 3D Convolution Using Interposed Memory Accelerator eXtension 2 for Medical Image Processing

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Compact Convolutional Transformers on Edge TPUs

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications

EdgeViT: Efficient Visual Modeling for Edge Computing

Energy-Efficient 3D Convolution Using Interposed Memory Accelerator eXtension 2 for Medical Image Processing

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation