Medical Image Segmentation Using Transformer

Wang, Qian; Li, Longyan; Ni, Bo; Li, Yu; Kong, De**; Wang, Chen; Li, Zan

doi:10.1007/978-981-16-9423-3_12

Qian Wang⁴²,
Longyan Li⁴²,
Bo Ni⁴³,
Yu Li⁴²,
De** Kong⁴²,
Chen Wang⁴² &
…
Zan Li⁴⁴

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 854))

1639 Accesses

Abstract

For the past few years, the U-Net structure shows strong performance in the field of medical image segmentation. However, due to the inherent locality of convolution operations, U-shaped structures are often limited in modeling long-range dependencies. Transformer, a global self-attention mechanism designed for sequence-to-sequence prediction, has been successfully used in the field of computer vision. In this paper, we propose a novel network, named TransHarDNet. HarDNet, which is a low memory traffic CNN. We combine it as backbone with Transformer. Our network enables the global semantic context information and low-level spatial details of the input image to be captured more effectively. We evaluate the effectiveness of the proposed network on five medical image datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Hardcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

ConTrans: Improving Transformer with Convolutional Attention for Medical Image Segmentation

UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation

SegNetr: Rethinking the Local-Global Interactions and Skip Connections in U-Shaped Networks

References

Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W., Frangi, A. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Zhou, Z., Siddiquee, M.M.R., Tajbakhsh, N., Liang, J.: UNet++: a nested U-Net architecture for medical image segmentation. In: IEEE TMI, pp. 3–11 (2019)
Google Scholar
Jha, D., et al.: ResUNet++: an advanced architecture for medical image segmentation. In: IEEE ISM, pp. 225–230 (2019)
Google Scholar
Oktay, O., et al.: Attention U-Net: Learning Where to Look for the Pancreas. Ar**v Preprint Ar**v:1804.03999 (2018)
Dosovitskiy, A., et al.: An image is worth 16\(\,\times \,\)16 words: transformers for image recognition at scale. In: ICLR (2021)
Google Scholar
ImageNet. http://image-net.org
Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., Zagoruyko, S.: End-to-end object detection with transformers. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.M. (eds.) ECCV 2020. LNCS, vol. 12346, pp. 213–229. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58452-8_13
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: IEEE TPAMI, pp. 1137–1149 (2017)
Google Scholar
Zheng, S., et al.: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers. Ar**v Preprint Ar**v:2012.15840 (2020)
Chen, J., et al.: TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation. Ar**v Preprint Ar**v:2102.04306 (2021)
Fan, D.P., et al.: PraNet: parallel reverse attention network for polyp segmentation. In: Martel, A.L. et al. (eds.) MICCAI 2020. LNCS, vol. 12266, pp. 263–273 . Springer, Cham (2020). https://doi.org/10.1007/978-3-030-59725-2_26
Huang, C.H., Wu, H.Y., Lin, Y.L.: HarDNet-MSEG: a simple encoder-decoder polyp segmentation neural network that achieves over 0.9 mean dice and 86 FPS. Ar**v Preprint Ar**v:2101.07172 (2021)
Chao, P., Kao, C.-Y., Ruan, Y., Huang, C.-H., Lin, Y.-L.: HarDNet: a low memory traffic network. In: IEEE/CVF ICCV, pp. 3552–3561 (2019)
Google Scholar
Huang, G., Liu, Z., van der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: IEEE CVPR, pp. 2261–2269 (2017)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: IEEE CVPR, pp. 770–778 (2016)
Google Scholar
Liu, S., Huang, D., Wang, Y.: Receptive field block net for accurate and fast object detection. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11215, pp. 385–400. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01252-6_24
Wu, Z., Su, L., Huang, Q.: Cascaded partial decoder for fast and accurate salient object detection. In: IEEE CVPR, pp. 3907–3916 (2019)
Google Scholar
Jha, D., et al.: Kvasir-SEG: a segmented polyp dataset. In: Ro, Y., et al. (eds.) MMM 2020. LNCS, vol. 11962, pp. 451–462. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-37734-2_37
Tajbakhsh, N., Gurudu, S.R., Liang, J.: Automated polyp detection in colonoscopy videos using shape and context information. IEEE TMI 35(2), 630–644 (2015)
Google Scholar
Silva, J., Histace, A., Romain, O., Dray, X., Granado, B.: Toward embedded detection of polyps in WCE images for early diagnosis of colorectal cancer. Int. J. Comput. Assist. Radiol. Surg. 9(2), 283–293 (2013). https://doi.org/10.1007/s11548-013-0926-3
Article Google Scholar
V\(\acute{a}\)zquez, D., et al.: A benchmark for endoluminal scene segmentation of colonoscopy images. J. Healthc. Eng. 2017, 1–10 (2017)
Google Scholar
Bernal, J., S\(\acute{a}\)nchez, F.J., Fern\(\acute{a}\)ndez-Esparrach, G., Gil, D., Rodr\(\acute{i}\)guez, C., Vilari\(\tilde{n}\)o, F.: WM-DOVA maps for accurate polyp highlighting in colonoscopy: validation vs. saliency maps from physicians. In: CMIG, vol. 43, pp. 99–111 (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Electronic and Electrical Engineering, Wuhan Textile University, Wuhan, 430074, China
Qian Wang, Longyan Li, Yu Li, De** Kong & Chen Wang
Computer School of Hubei Polytechnic University, Huangshi, 435003, China
Bo Ni
International College Bei**g, China Agricultural University, Bei**g, 100089, China
Zan Li

Authors

Qian Wang
View author publications
You can also search for this author in PubMed Google Scholar
Longyan Li
View author publications
You can also search for this author in PubMed Google Scholar
Bo Ni
View author publications
You can also search for this author in PubMed Google Scholar
Yu Li
View author publications
You can also search for this author in PubMed Google Scholar
De** Kong
View author publications
You can also search for this author in PubMed Google Scholar
Chen Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zan Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qian Wang .

Editor information

Editors and Affiliations

Department of Electrical Engineering, University of Texas at Arlington, Arlington, TX, USA
Qilian Liang
Tian** Normal University, Tian**, China
Wei Wang
Tian** Normal University, Tian**, China
Jiasong Mu
Dalian University of Technology, Dalian, China
**n Liu
School of Information Science and Technology, Dalian Maritime University, Dalian, China
Zhenyu Na

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, Q. et al. (2022). Medical Image Segmentation Using Transformer. In: Liang, Q., Wang, W., Mu, J., Liu, X., Na, Z. (eds) Artificial Intelligence in China. Lecture Notes in Electrical Engineering, vol 854. Springer, Singapore. https://doi.org/10.1007/978-981-16-9423-3_12

Download citation

DOI: https://doi.org/10.1007/978-981-16-9423-3_12
Published: 22 March 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-9422-6
Online ISBN: 978-981-16-9423-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Medical Image Segmentation Using Transformer

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

ConTrans: Improving Transformer with Convolutional Attention for Medical Image Segmentation

UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation

SegNetr: Rethinking the Local-Global Interactions and Skip Connections in U-Shaped Networks

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Medical Image Segmentation Using Transformer

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

ConTrans: Improving Transformer with Convolutional Attention for Medical Image Segmentation

UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation

SegNetr: Rethinking the Local-Global Interactions and Skip Connections in U-Shaped Networks

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation