Intelligent Masking: Deep Q-Learning for Context Encoding in Medical Image Analysis

Bahrami, Mojtaba; Ghorbani, Mahsa; Yeganeh, Yousef; Navab, Nassir

doi:10.1007/978-3-031-21014-3_40

Mojtaba Bahrami^12,13,
Mahsa Ghorbani^12,13,
Yousef Yeganeh¹² &
…
Nassir Navab^12,14

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13583))

Included in the following conference series:

International Workshop on Machine Learning in Medical Imaging

1241 Accesses

Abstract

The need for a large amount of labeled data in the supervised setting has led recent studies to utilize self-supervised learning to pre-train deep neural networks using unlabeled data. Many self-supervised training strategies have been investigated especially for medical datasets to leverage the information available in the much fewer unlabeled data. One of the fundamental strategies in image-based self-supervision is context prediction. In this approach, a model is trained to reconstruct the contents of an arbitrary missing region of an image based on its surroundings. However, the existing methods adopt a random and blind masking approach by focusing uniformly on all regions of the images. This approach results in a lot of unnecessary network updates that cause the model to forget the rich extracted features. In this work, we develop a novel self-supervised approach that occludes targeted regions to improve the pre-training procedure. To this end, we propose a reinforcement learning-based agent which learns to intelligently mask input images through deep Q-learning. We show that training the agent against the prediction model can significantly improve the semantic features extracted for downstream classification tasks. We perform our experiments on two public datasets for diagnosing breast cancer in the ultrasound images and detecting lower-grade glioma with MR images. In our experiments, we show that our novel masking strategy advances the learned features according to the performance on the classification task in terms of accuracy, macro F1, and AUROC.

M. Bahrami and M. Ghorbani—Equal contribution.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Improving Medical Image Classification in Noisy Labels Using only Self-supervised Pretraining

Region-of-Interest Guided Supervoxel Inpainting for Self-supervision

Semi-supervised medical image classification via increasing prediction diversity

Article 15 August 2022

References

Al-Dhabyani, W., Gomaa, M., Khaled, H., Fahmy, A.: Dataset of breast ultrasound images. Data Brief 28, 104863 (2020)
Article Google Scholar
Bai, W., et al.: Self-supervised learning for cardiac MR image segmentation by anatomical position prediction. In: Shen, D., et al. (eds.) MICCAI 2019. LNCS, vol. 11765, pp. 541–549. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-32245-8_60
Bengio, Y., Lamblin, P., Popovici, D., Larochelle, H.: Greedy layer-wise training of deep networks. Adv. Neural Inform. Process. Syst. 19 (2006)
Google Scholar
Buda, M., Saha, A., Mazurowski, M.A.: Association of genomic subtypes of lower-grade gliomas with shape features automatically extracted by a deep learning algorithm. Comput. Biol. Med. 109, 218–225 (2019)
Article Google Scholar
Chen, L., Bentley, P., Mori, K., Misawa, K., Fujiwara, M., Rueckert, D.: Self-supervised learning for medical image analysis using image context restoration. Med. image Anal. 58, 101539 (2019)
Article Google Scholar
Da, K.: A method for stochastic optimization. ar**v preprint ar**v:1412.6980 (2014)
Doersch, C., Gupta, A., Efros, A.A.: Unsupervised visual representation learning by context prediction. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1422–1430 (2015)
Google Scholar
Gidaris, S., Singh, P., Komodakis, N.: Unsupervised representation learning by predicting image rotations. ar**v preprint ar**v:1803.07728 (2018)
Hester, T., et al.: Deep q-learning from demonstrations. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32 (2018)
Google Scholar
Jamaludin, A., Kadir, T., Zisserman, A.: Self-supervised learning for spinal MRIs. In: Cardoso, M.J., et al. (eds.) DLMIA/ML-CDS -2017. LNCS, vol. 10553, pp. 294–302. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-67558-9_34
Li, C.L., Sohn, K., Yoon, J., Pfister, T.: Cutpaste: self-supervised learning for anomaly detection and localization. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9664–9674 (2021)
Google Scholar
Liao, R., et al.: An artificial agent for robust image registration. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 31 (2017)
Google Scholar
Liao, X., et al.: Iteratively-refined interactive 3d medical image segmentation with multi-agent reinforcement learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9394–9402 (2020)
Google Scholar
Mazurowski, M.A., Clark, K., Czarnek, N.M., Shamsesfandabadi, P., Peters, K.B., Saha, A.: Radiogenomics of lower-grade glioma: algorithmically-assessed tumor shape is associated with tumor genomic subtypes and patient outcomes in a multi-institutional study with the cancer genome atlas data. J. Neuro-oncol. 133(1), 27–35 (2017)
Article Google Scholar
Noroozi, M., Favaro, P.: Unsupervised learning of visual representations by solving jigsaw puzzles. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9910, pp. 69–84. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46466-4_5
Pathak, D., Krahenbuhl, P., Donahue, J., Darrell, T., Efros, A.A.: Context encoders: feature learning by inpainting. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2536–2544 (2016)
Google Scholar
Tajbakhsh, N., et al.: Surrogate supervision for medical image analysis: Effective deep learning from limited quantities of labeled data. In: 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019), pp. 1251–1255. IEEE (2019)
Google Scholar
Vlontzos, A., Alansary, A., Kamnitsas, K., Rueckert, D., Kainz, B.: Multiple landmark detection using multi-agent reinforcement learning. In: Shen, D., et al. (eds.) MICCAI 2019. LNCS, vol. 11767, pp. 262–270. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-32251-9_29
Watkins, C.J., Dayan, P.: Q-learning. Mach. Learn. 8(3), 279–292 (1992)
Google Scholar
Watkins, C.J.C.H.: Learning from delayed rewards (1989)
Google Scholar
**ong, J., et al.: Edge-sensitive left ventricle segmentation using deep reinforcement learning. Sensors 21(7), 2375 (2021)
Google Scholar
Zhang, M., Xu, J., Abaci Turk, E., Grant, P.E., Golland, P., Adalsteinsson, E.: Enhanced detection of fetal pose in 3D MRI by deep reinforcement learning with physical structure priors on anatomy. In: Martel, A.L., et al. (eds.) MICCAI 2020. LNCS, vol. 12266, pp. 396–405. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-59725-2_38
Zhang, P., Wang, F., Zheng, Y.: Self supervised deep representation learning for fine-grained body part recognition. In: 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017), pp. 578–582. IEEE (2017)
Google Scholar
Zhou, S.K., Le, H.N., Luu, K., Nguyen, H.V., Ayache, N.: Deep reinforcement learning in medical imaging: a literature review. Med. Image Anal. 73, 102193 (2021)
Article Google Scholar
Zhuang, X., Li, Y., Hu, Y., Ma, K., Yang, Y., Zheng, Y.: Self-supervised feature learning for 3d medical images by playing a rubik’s cube. In: Shen, D., et al. (eds.) MICCAI 2019. LNCS, vol. 11767, pp. 420–428. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-32251-9_46

Download references

Author information

Authors and Affiliations

Computer Aided Medical Procedures, Technical University of Munich, Munich, Germany
Mojtaba Bahrami, Mahsa Ghorbani, Yousef Yeganeh & Nassir Navab
Sharif University of Technology, Tehran, Iran
Mojtaba Bahrami & Mahsa Ghorbani
Whiting School of Engineering, Johns Hopkins University, Baltimore, USA
Nassir Navab

Authors

Mojtaba Bahrami
View author publications
You can also search for this author in PubMed Google Scholar
Mahsa Ghorbani
View author publications
You can also search for this author in PubMed Google Scholar
Yousef Yeganeh
View author publications
You can also search for this author in PubMed Google Scholar
Nassir Navab
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mahsa Ghorbani .

Editor information

Editors and Affiliations

**'an Jiaotong University, **'an, China
Chunfeng Lian
Shanghai United Imaging Intelligence Co., Ltd., Shanghai, China
**aohuan Cao
Istanbul Technical University, Istanbul, Turkey
Islem Rekik
Rensselaer Polytechnic Institute, Troy, NY, USA
Xuanang Xu
ShanghaiTech University, Pudong, China
Zhiming Cui

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 1133 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bahrami, M., Ghorbani, M., Yeganeh, Y., Navab, N. (2022). Intelligent Masking: Deep Q-Learning for Context Encoding in Medical Image Analysis. In: Lian, C., Cao, X., Rekik, I., Xu, X., Cui, Z. (eds) Machine Learning in Medical Imaging. MLMI 2022. Lecture Notes in Computer Science, vol 13583. Springer, Cham. https://doi.org/10.1007/978-3-031-21014-3_40

Download citation

DOI: https://doi.org/10.1007/978-3-031-21014-3_40
Published: 16 December 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-21013-6
Online ISBN: 978-3-031-21014-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Intelligent Masking: Deep Q-Learning for Context Encoding in Medical Image Analysis

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Improving Medical Image Classification in Noisy Labels Using only Self-supervised Pretraining

Region-of-Interest Guided Supervoxel Inpainting for Self-supervision

Semi-supervised medical image classification via increasing prediction diversity

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 1133 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Intelligent Masking: Deep Q-Learning for Context Encoding in Medical Image Analysis

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Improving Medical Image Classification in Noisy Labels Using only Self-supervised Pretraining

Region-of-Interest Guided Supervoxel Inpainting for Self-supervision

Semi-supervised medical image classification via increasing prediction diversity

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 1133 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation