Iterative learning for maxillary sinus segmentation based on bounding box annotations

Xu, **nli; Wang, Kaidong; Wang, Chengze; Chen, Ruihao; Zhu, Fudong; Long, Haixia; Guan, Qiu

doi:10.1007/s11042-023-16544-x

Iterative learning for maxillary sinus segmentation based on bounding box annotations

Published: 16 September 2023

Volume 83, pages 33263–33293, (2024)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

**nli Xu¹,
Kaidong Wang¹,
Chengze Wang²,
Ruihao Chen¹,
Fudong Zhu²,
Haixia Long¹ &
…
Qiu Guan¹

263 Accesses
Explore all metrics

Abstract

An accurate segmentation of the maxillary sinus (MS) is helpful for preoperative planning of dental implantation, diagnosis and evaluation of sinusitis, and validation of radiotherapy for sinus cancer. Many medical image segmentation models based on convolutional neural networks have achieved excellent performance, however, relied heavily on manual accurate labeling of training data. We propose an iterative learning method for MS segmentation with only bounding box supervision. First, a cone-beam computed tomography (CBCT) image is over-segmented into a set of superpixels and a feature extraction network is optimized to better extract multi-scale features of each small-size superpixel. Second, an improved graph convolutional network (IGCN) is developed to merge superpixel regions and improve the feature transformation ability of each node on a superpixel-wise graph. Finally, the iterative learning combined with the superpixel-conditional random field and IGCN makes pseudo labels gradually refine and close to fully supervised information. On a practical MS dataset, the proposed method achieves 90.5% in Dice similarity coefficient. Extending to the public dataset Promise12 for prostate MR image segmentation, it also performs well. The results show that our proposed method has good comprehensive weakly supervised segmentation performance and can narrow a gap between the bounding box and full supervision.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price includes VAT (Germany)

Instant access to the full article PDF.

Institutional subscriptions

Convolutional neural network for automatic maxillary sinus segmentation on cone-beam computed tomographic images

Article Open access 07 May 2022

Deep learning-based fully automatic segmentation of the maxillary sinus on cone-beam computed tomographic images

Article Open access 17 August 2022

Comparison of 2D, 2.5D, and 3D segmentation networks for maxillary sinuses and lesions in CBCT images

Article Open access 15 November 2023

Data availability

The data from stomatology hospital that support the findings of this study are available from the corresponding author (Fudong Zhu) upon reasonable request and with permission of stomatology hospital. The data from stomatology hospital are not publicly available, because they contain protected patient privacy information. Another data set is publicly available at the MICCAI2012 prostate MR segmentation challenge (https://promise12.grand-challenge.org/)

Notes

References

Achanta R, Shaji A, Smith K, Lucchi A, Fua P, Süsstrunk S (2012) SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Trans Pattern Anal Mach Intell 34(11):2274–2282. https://doi.org/10.1109/TPAMI.2012.120
Article PubMed Google Scholar
Aksoy EA, Özden SU, Karaarslan E, Ünal ÖF, Tanyeri H (2014) Reliability of high-pitch ultra-low-dose paranasal sinus computed tomography for evaluating paranasal sinus anatomy and sinus disease. J Craniofac Surg 25(5):1801–1804. https://doi.org/10.1097/SCS.0000000000000966
Article PubMed Google Scholar
Amanda FG, Thiago D, Cristina YM et al (2019) Development and validation of a formula based on maxillary sinus measurements as a tool for sex estimation: a cone beam computed tomography study. Int J Legal Med 133(4):1241–1249. https://doi.org/10.1007/s00414-018-1869-6
Article Google Scholar
Amirkhani D, Bastanfard A (2019) Inpainted image quality evaluation based on saliency map features, in: 5th Iranian conference on signal processing and intelligent systems (ICSPIS), Shahrood, Iran, December 18-19, 2019: 1–6 https://doi.org/10.1109/ICSPIS48872.2019.9066140.
Andersen TN, Darvann TA, Murakami S et al (2018) Accuracy and precision of manual segmentation of the maxillary sinus in MR images - a method study. Br J Radiol 91(1085):1085. https://doi.org/10.1259/bjr.20170663
Article Google Scholar
Bastanfard A, Amirkhani D, Mohammadi M (2022) Toward image super-resolution based on local regression and nonlocal means. Multimed Tools Appl 81:23473–23492. https://doi.org/10.1007/s11042-022-12584-x
Article Google Scholar
Berberi A, Bouserhal L, Nader N, Assaf RB, Nassif NB, Bouserhal J, Salameh Z (2015) Evaluation of three-dimensional volumetric changes after sinus floor augmentation with mineralized cortical bone allograft. J Maxillofac Oral Surg 14(3):624–629. https://doi.org/10.1007/s12663-014-0736-3
Article PubMed Google Scholar
Bui NL, Ong SH, Foong K (2015) Automatic segmentation of the nasal cavity and paranasal sinuses from cone-beam CT images. Int J Comput Assist Radiol Surg 10(8):1269–1277. https://doi.org/10.1007/s11548-014-1134-5
Article PubMed Google Scholar
Cha JY, Yoon HI, Yeo IS et al (2021) Panoptic segmentation on panoramic radiographs: deep learning-based segmentation of various structures including maxillary sinus and mandibular canal. J Clin Med 10(12):2577. https://doi.org/10.3390/jcm10122577
Article PubMed PubMed Central Google Scholar
Chen L C, Papandreou G, Kokkinos I, Murphy K, Yuille A L (2015) Semantic image segmentation with deep convolutional nets and fully connected CRFs. In: 3rd international conference on learning representations (ICLR), San Diego, CA, May 7-9, 2015
Choi H, Jeon KJ, Kim YH et al (2022) Deep learning-based fully automatic segmentation of the maxillary sinus on cone-beam computed tomographic images. Sci Rep 12:14009. https://doi.org/10.1038/s41598-022-18436-w
Article ADS CAS PubMed PubMed Central Google Scholar
Cui F, Feng R, Wang L, Wei L (2021) Joint superpixel segmentation and graph convolutional network road extraction for high-resolution remote sensing imagery. In: IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Brussels, Belgium, July 11–16, 2021: 2178–2181. https://doi.org/10.1109/IGARSS47720.2021.9554635.
Demir UL, Akca ME, Ozpar R et al (2015) Anatomical correlation between existence of concha bullosa and maxillary sinus volume. Surg Radiol Anat 37(9):1093–1098. https://doi.org/10.1007/s00276-015-1459-y
Article PubMed Google Scholar
Di S, Zhao Y, Liao M, Yang Z, Zeng Y (2022) Automatic liver tumor segmentation from CT images using hierarchical iterative superpixels and local statistical features. Expert Syst Appl 203:117347. https://doi.org/10.1016/j.eswa.2022.117347
Article Google Scholar
Dosovitskiy A, Beyer L, Kolesnikov A et al (2021) An image is worth 16×16 words: transformers for image recognition at scale. In: 9th international conference on learning representations (ICRL), virtual event, Austria, May 3-7, 2021
Elineide S, Rodrigo MSV, Kelson RTA et al (2022) Semi-automatic segmentation of skin lesions based on superpixels and hybrid texture information. Med Image Anal 77:102363. https://doi.org/10.1016/j.media.2022.102363
Article Google Scholar
Gugulothu V K, Balaji S (2023) An early prediction and classification of lung nodule diagnosis on CT images based on hybrid deep learning techniques. Multimed Tools Appl https://doi.org/10.1007/s11042-023-15802-2
Guilherme G, Menegatti P, Carrasco A et al (2018) Computed tomography-based volumetric tool for standardized measurement of the maxillary sinus. PLoS One 13(1):e0190770. https://doi.org/10.1371/journal.pone.0190770
Article CAS Google Scholar
Han K, Wang Y, Guo J, Tang Y, Wu E (2022) Vision GNN: an image is worth graph of nodes. In: Advances in Neural Information Processing Systems (NeurIPS), New Orleans, November 28–December 9, 2022, vol, 35, pp 8291–8303
He K, Zhang X, Ren S, Sun J (2016). Deep residual learning for image recognition. In: conference on computer vision and pattern recognition (CVPR), Las Vegas, NV, USA, June 27-30, 2016: 770-778. https://doi.org/10.1109/CVPR.2016.90
Hong Y, Zhang G, Wei B et al (2022) Weakly supervised semantic segmentation for skin cancer via CNN superpixel region response. Multimed Tools Appl 81:38409–38427. https://doi.org/10.1007/s11042-022-13606-4
Article Google Scholar
Hu H, Ji D, Gan W et al (2020) Class-wise dynamic graph convolution for semantic segmentation. In: Computer Vision - ECCV 2020, 16th European Conference, Glasgow, UK, August 23–28, 2020, 12362: 1–17. https://doi.org/10.1007/978-3-030-58520-4_1
Huang Q, Huang Y, Luo Y, Yuan F, Li X (2020) Segmentation of breast ultrasound image with semantic classification of superpixels. Med Image Anal 61:101657. https://doi.org/10.1016/j.media.2020.101657
Article PubMed Google Scholar
Hung KF, Ai QYH, King AD et al (2022) Automatic detection and segmentation of morphological changes of the maxillary sinus mucosa on cone-beam computed tomography images using a three-dimensional convolutional neural network 26: 3987–3998. https://doi.org/10.1007/s00784-021-04365-x
Imtiaz T, Rifat S, Fattah SA, Wahid KA (2020) Automated brain tumor segmentation based on multi-planar superpixel level features extracted from 3D MR images. IEEE Access 8:25335–25349. https://doi.org/10.1109/ACCESS.2019.2961630
Article Google Scholar
Iqbal MJ, Bajwa UI, Gilanie G et al (2022) Automatic brain tumor segmentation from magnetic resonance images using superpixel-based approach. Multimed Tools Appl 81:38409–38427. https://doi.org/10.1007/s11042-022-13166-7
Article Google Scholar
Jampani V, Sun D, Liu M-Y, Yang M-H, Kautz J (2018) Superpixel sampling networks. In: proceedings of the European conference on computer vision (ECCV), Munich, Germany, September 8-14, 2018: 363-380. https://doi.org/10.1007/978-3-030-01234-2\_22
**da-Apiraksa A, Ongt S H, Hiew L T et al (2010) A segmentation technique for maxillary sinus using the 3-D level set method. In: TENCON 2009–2009 IEEE region 10 conference, Singapore, January 23-26, 2009: 1–6. https://doi.org/10.1109/TENCON.2009.5396044.
Joachims T, Finley T, Yu C-NJ (2009) Cutting-plane training of structural SVMs. Mach Learn 77(1):27–59. https://doi.org/10.1007/s10994-009-5108-8
Article Google Scholar
Jung SK, Lim HK, Lee S et al (2021) Deep active learning for automatic segmentation of maxillary sinus lesions using a convolutional neural network. Diagnostics 11(4):688. https://doi.org/10.3390/diagnostics11040688
Article PubMed PubMed Central Google Scholar
Kervadec H, Dolz J, Wang S, Granger E, Ayed IB (2020) Bounding boxes for weakly supervised segmentation: global constraints get close to full supervision. In: Proceedings of the Third Conference on Medical Imaging with Deep Learning (PMLR), July 6–8, 2020, Montreal, QC, Canada, vol 121, pp 365–381
Keustermans W, Huysmans T, Schmelzer B et al (2019) Matlab toolbox for semi-automatic segmentation of the human nasal cavity based on active shape modeling. Comput Biol Med 105:27–38. https://doi.org/10.1016/j.compbiomed.2018.12.008
Article PubMed Google Scholar
Khosravanian A, Rahmanimanesh M, Keshavarzi P, Mozaffari S (2021) Fast level set method for glioma brain tumor segmentation based on superpixel fuzzy clustering and lattice boltzmann method. Comput Methods Prog Biomed 198:105809. https://doi.org/10.1016/j.cmpb.2020.105809
Article Google Scholar
Kipf TN, Welling M (2017) Semi-supervised classification with graph convolutional networks. In: international conference on learning representations (ICLR), Toulon, France, April 24-26, 2017
Lafferty J, McCallum A, Pereira F (2001) Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of the 18th International Conference on Machine Learning (ICML), Williams College, Williamstown, MA, June 28–July 1, 2001, pp 282–289
Li Z, Chen J (2015) Superpixel segmentation using linear spectral clustering. In: proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), Boston, MA, USA, June 7-12, 2015: 1356-1363. https://doi.org/10.1109/CVPR.2015.7298741
Li S, He J, Wang Y et al (2018) Blind CT image quality assessment via deep learning strategy: initial study. In: medical imaging 2018: image perception, observer performance, and technology assessment, Houston, Texas, United States, February 10-15, 2018, 10577: 105771A. https://doi.org/10.1117/12.2293240
Ling H, Gao J, Kar A, Chen W, Fidler S (2019) Fast interactive object annotation with curve-GCN. In: proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), Long Beach, CA, USA, June 16-20, 2019: 5257-5266. https://doi.org/10.1109/CVPR.2019.00540
Ma F, Gao F, Sun J, Zhou H, Hussain A (2019) Attention graph convolution network for image segmentation in big SAR imagery data. Remote Sens 11(21):2586. https://doi.org/10.3390/rs11212586
Article ADS Google Scholar
Mahani GK, Li R, Evangelou N et al (2022) Bounding box based weakly supervised deep convolutional neural network for medical image segmentation using an uncertainty guided and spatially constrained loss. In: 19th international symposium on biomedical imaging (ISBI). Kolkata, India, march 28-31, 2022: 1-5. https://doi.org/10.1109/ISBI52829.2022.9761558
Martins A, Figueiredo M, Aguiar P, Smith N, **ng E (2015) AD³: alternating directions dual decomposition for map inference in graphical models. J Mach Learn Res 16:495–545. https://doi.org/10.5555/2789272.2789288
Article MathSciNet Google Scholar
Milletari F, Navab N, Ahmadi SA (2016) V-net: fully convolutional neural networks for volumetric medical image segmentation. In: proceedings of the IEEE fourth international conference on 3D vision (3DV), Stanford, CA, USA, October 25-28, 2016: 565-571. https://doi.org/10.1109/3DV.2016.79
Morgan N, Van Gerven A, Smolders A et al (2022) Convolutional neural network for automatic maxillary sinus segmentation on cone-beam computed tomographic images. Sci Rep 12:7523. https://doi.org/10.1038/s41598-022-11483-3
Article ADS CAS PubMed PubMed Central Google Scholar
Qu H, Wu P, Huang Q, Yi J, Yan Z et al (2020) Weakly supervised deep nuclei segmentation using partial points annotation in histopathology images. IEEE Trans Med Imaging 39(11):3655–3666. https://doi.org/10.1109/TMI.2020.3002244
Article PubMed Google Scholar
Rajchl M, Lee MCH, Oktay O et al (2017) DeepCut: object segmentation from bounding box annotations using convolutional neural networks. IEEE Trans Med Imaging 36(2):674–683. https://doi.org/10.1109/TMI.2016.2621185
Article PubMed Google Scholar
Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation, in: medical image computing and computer-assisted intervention, MICCAI 2015, 18th international conference Munich, Germany, October 5-9, 2015: 234–241. https://doi.org/10.1007/978-3-319-24574-4\_28
Rother C, Kolmogorov V, Blake A (2004) “GrabCut”: interactive foreground extraction using iterated graph cuts. ACM Trans Graph 23(3):309–314. https://doi.org/10.1145/1015706.1015720
Article Google Scholar
Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22(8):888–905. https://doi.org/10.1109/34.868688
Article Google Scholar
Shi H, Scarfe WC, Farman AG (2006) Maxillary sinus 3D segmentation and reconstruction from cone beam CT data sets. Int J CARS 1(2):83–89. https://doi.org/10.1007/s11548-006-0041-9
Article Google Scholar
Tang W, Qiu G (2021) Dense graph convolutional neural networks on 3D meshes for 3D object segmentation and classification. Image Vis Comput 114:10425. https://doi.org/10.1016/j.imavis.2021.104265
Article Google Scholar
Tolstikhin I O, Houlsby N, Kolesnikov A et al (2021) MLP-Mixer: an all-MLP architecture for vision. In: Annual Conference on Neural Information Processing Systems (NeurIPS), December 6–14, 2021, virtual, pp 24261–24272
Valvano G, Leo A, Tsaftaris SA (2021) Learning to segment from scribbles using multi-scale adversarial attention gates. IEEE Trans Med Imaging 40(8):1990–2001. https://doi.org/10.1109/TMI.2021.3069634
Article PubMed Google Scholar
Veličković P, Cucurull G, Casanova A, Romero A, Liò P, Bengio Y (2018) Graph attention networks. In international conference on learning representations (ICLR), Vancouver, BC, Canada, April 30 - May 3, 2018
Wei J, Hu Y, Li G, Cui S et al (2022) BoxPolyp: boost generalized polyp segmentation using extra coarse bounding box annotations. In: medical image computing and computer assisted intervention (MICCAI), Singapore, September 18-22, 2022, 13433: 67-77. https://doi.org/10.1007/978-3-031-16437-8\_7
Xu J, Wang S, Zhou Z, Liu J, Chen X (2020) Automatic CT image segmentation of maxillary sinus based on VGG network and improved V-net. Int J Comput Assist Radiol Surg 15(9):1457–1465. https://doi.org/10.1007/s11548-020-02228-6
Article PubMed Google Scholar

Download references

Acknowledgments

This work was supported partly by National Natural Science Foundation of China (No. 62106225 and No. U20A20171), partly by Natural Science Foundation of Zhejiang Province (No. LY21F020027), partly by Zhejiang Province Public Welfare Technology Application Research Project (No. LGG20F020017), and partly by Key Programs for Science and Technology Development of Zhejiang Province (No. 2022C03113).

Author information

Authors and Affiliations

School of Computer Science and Technology, Zhejiang University of Technology, Hangzhou, 310023, China
**nli Xu, Kaidong Wang, Ruihao Chen, Haixia Long & Qiu Guan
Stomatology Hospital, School of Stomatology, Zhejiang University School of Medicine, Clinical Research Center for Oral Diseases of Zhejiang Province, Key Laboratory of Oral Biomedical Research of Zhejiang Province, Cancer Center of Zhejiang University, Hangzhou, 310006, China
Chengze Wang & Fudong Zhu

Authors

**nli Xu
View author publications
You can also search for this author in PubMed Google Scholar
Kaidong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Chengze Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ruihao Chen
View author publications
You can also search for this author in PubMed Google Scholar
Fudong Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Haixia Long
View author publications
You can also search for this author in PubMed Google Scholar
Qiu Guan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Fudong Zhu or Haixia Long.

Ethics declarations

Conflict of interests

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Xu, X., Wang, K., Wang, C. et al. Iterative learning for maxillary sinus segmentation based on bounding box annotations. Multimed Tools Appl 83, 33263–33293 (2024). https://doi.org/10.1007/s11042-023-16544-x

Download citation

Received: 02 November 2022
Revised: 06 August 2023
Accepted: 13 August 2023
Published: 16 September 2023
Issue Date: March 2024
DOI: https://doi.org/10.1007/s11042-023-16544-x

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price includes VAT (Germany)

Instant access to the full article PDF.

Institutional subscriptions

Iterative learning for maxillary sinus segmentation based on bounding box annotations

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Convolutional neural network for automatic maxillary sinus segmentation on cone-beam computed tomographic images

Deep learning-based fully automatic segmentation of the maxillary sinus on cone-beam computed tomographic images

Comparison of 2D, 2.5D, and 3D segmentation networks for maxillary sinuses and lesions in CBCT images

Data availability

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Iterative learning for maxillary sinus segmentation based on bounding box annotations

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Convolutional neural network for automatic maxillary sinus segmentation on cone-beam computed tomographic images

Deep learning-based fully automatic segmentation of the maxillary sinus on cone-beam computed tomographic images

Comparison of 2D, 2.5D, and 3D segmentation networks for maxillary sinuses and lesions in CBCT images

Data availability

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation