Log in

Improving detection and classification of diabetic retinopathy using CUDA and Mask RCNN

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

Diabetic retinopathy (DR) is an eye disease caused by diabetes and can progress to certain degrees. Because DR’s the final stage can cause blindness, early detection is crucial to prevent visual disturbances. With the development of GPU technology, image classification and object detection can be done faster. Particularly on medical images, these processes play an important role in disease detection. In this work, we improved our previous work to detect diabetic retinopathy using Faster RCNN and attention layer. In the detection phase, firstly non-used area of DR image was extracted using compute unified device architecture with gradient-based edge detection method. Then Mask RCNN was used instead of faster region-based convolutional neural networks (Faster RCNN) to detect lesion areas more successful. With the proposed method, more successful results were obtained than the our previous work in DenseNet, MobileNet and ResNet networks. In addition, more successful results were obtained than other works in the literature in ACC and AUC metrics obtained by using VGG19.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Data availability

Previously reported diabetic retinopathy datasets were used to support this study and are available at https://www.adcis.net/en/third-party/messidor/, https://www.kaggle.com/c/diabetic-retinopathy-detection/data, https://www.it.lut.fi/project/imageret/diaretdb0/, https://www.it.lut.fi/project/imageret/diaretdb1/, and https://ieee-dataport.org/open-access/indian-diabetic-retinopathy-image-dataset-idrid. These datasets are cited at relevant places within the text as references [41–44].

References

  1. Doganay, S., Evereklioglu, C., Er, H., et al.: Comparison of serum NO, TNF-\(\alpha \), IL-1\(\beta \), sIL-2R, IL-6 and IL-8 levels with grades of retinopathy in patients with diabetes mellitus. Eye 16, 163–170 (2002). https://doi.org/10.1038/sj.eye.6700095

    Article  Google Scholar 

  2. **feng, G., Qummar, S., Junming, Z., Ruxian, Y., Khan, F.G.: Ensemble framework of deep CNNs for diabetic retinopathy detection. Comput. Intell. Neurosci. (2020). https://doi.org/10.1155/2020/8864698

    Article  Google Scholar 

  3. Lee, D.K., Oh, S.J.: Variable block size motion estimation implementation on compute unified device architecture (CUDA). In: IEEE International Conference on Consumer Electronics (ICCE), vol. 2013, 633–634 (2013). https://doi.org/10.1109/ICCE.2013.6487048

  4. Lecun, Y., Bengio, Y.: Convolutional networks for images, speech, and time-series. In: Arbib, M.A. (Ed.) The Handbook of Brain Theory and Neural Networks. MIT Press (1995)

  5. Krizhevsky, A.: Convolutional Neural Networks for Object Classification in CUDA, EECE1742S: Programming Massively Parallel Multiprocessors Using CUDA (2009)

  6. Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet Classification with Deep Convolutional Neural Networks. NIPS’12 Curran Associates Inc., USA (2012)

  7. Szegedy, C., Liu, W., Jia, Y., et al.: Going deeper with convolutions. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–9, Boston, MA, USA (2015)

  8. Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2818–2826, Las Vegas, NV, USA (2016)

  9. Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: The 3rd International Conference on Learning Representations (ICLR2015) (2015). ar**v:1409.1556

  10. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778, Las Vegas, NV, USA (2016)

  11. Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA (2017)

  12. Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587, Columbus, OH, USA (2014)

  13. Girshick, R.B.: Fast R-CNN, Tech. Rep., CoRR (2015). ar**v:1504.08083

  14. Wang, L., Wang, H., Huang, Y., Yan, B., Chang, Z., Liu, Z., Zhao, M., Cui, L., Song, J., Li, F.: Trends in the application of deep learning networks in medical image analysis: evolution between 2012 and 2020. Eur. J. Radiol. 146, 66 (2022)

    Article  Google Scholar 

  15. Wei, H., Sehgal, A., Kehtarnavaz, N.: A deep learning-based smartphone app for real-time detection of retinal abnormalities in fundus images. In: Real-Time Image Processing and Deep Learning 2019, vol. 10996, p. 1099602. International Society for Optics and Photonics (2019)

  16. Hacisoftaoglu, R.E., Karakaya, M., Sallam, A.B.: Deep learning frameworks for diabetic retinopathy detection with smartphone-based retinal imaging systems. Pattern Recognit. Lett. 135, 409–417 (2020)

    Article  Google Scholar 

  17. Erciyas, A., Barışçı, N.: An effective method for detecting and classifying diabetic retinopathy lesions based on deep learning. Comput. Math. Methods Med. 34, 66 (2021)

    Google Scholar 

  18. Toğaçar, M.: Detection of retinopathy disease using morphological gradient and segmentation approaches in fundus images. Comput. Methods Programs Biomed. 214, 106579 (2022). https://doi.org/10.1016/j.cmpb.2021.106579

    Article  Google Scholar 

  19. Bhuiyan, A., Govindaiah, A., Deobhakta, A., Hossain, M., Rosen, R., Smith, T.: Automated diabetic retinopathy screening for primary care settings using deep learning. Intell. Based Med. 5, 100045 (2021)

    Article  Google Scholar 

  20. Zhang, C., Lei, T., Chen, P.: Diabetic retinopathy grading by a source-free transfer learning approach. Biomed. Signal Process. Control 73, 103423 (2022)

    Article  Google Scholar 

  21. Gulshan, V., Peng, L., Coram, M., Stumpe, M.C., Wu, D., Narayanaswamy, A., et al.: Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316(22), 2402–2410 (2016)

    Article  Google Scholar 

  22. Costa, P., Campilho, A.: Convolutional bag of words for diabetic retinopathy detection from eye fundus images. IPSJ Trans. Comput. Vision Appl. 9(1), 10 (2017)

    Article  Google Scholar 

  23. Gargeya, R., Leng, T.: Automated identification of diabetic retinopathy using deep learning. Ophthalmology 124(7), 962–969 (2017)

    Article  Google Scholar 

  24. Wang, Z., Yin, Y., Shi, J., Fang, W., Li, H., Wang, X.: Zoom-in-net: deep mining lesions for diabetic retinopathy detection. In: International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 267–275. Springer (2017)

  25. Chen, Y.-W., Wu, T.-Y., Wong, W.-H., Lee, C.-Y.: Diabetic retinopathy detection based on deep convolutional neural networks. In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1030–1034. IEEE (2018)

  26. Mansour, R.F.: Deep-learning-based automatic computer-aided diagnosis system for diabetic retinopathy. Biomed. Eng. Lett. 8(1), 41–57 (2018)

    Article  Google Scholar 

  27. Quellec, G., Charrière, K., Boudi, Y., Cochener, B., Lamard, M.: Deep image mining for diabetic retinopathy screening. Med Image Anal. 39, 178–193 (2017). https://doi.org/10.1016/j.media.2017.04.012

    Article  Google Scholar 

  28. Gutiérrez, J., Che, Z., Zhai, G., Callet, P.L.: Saliency4ASD: challenge, dataset and tools for visual attention modeling for autism spectrum disorder. Signal Process. Image Commun. 92, 116092 (2021)

    Article  Google Scholar 

  29. Cai, Y., Droste, R., Sharma, H., Chatelain, P., Drukker, L., Papageorghiou, A.T., Noble, J.A.: Spatio-temporal visual attention modelling of standard biometry plane-finding navigation Med. Image Anal. 65, 101762 (2020)

    Article  Google Scholar 

  30. Obeso, A.M., Benois-Pineau, J., Vázquez, M.S.G., Acosta, A.Á.R.: Visual vs internal attention mechanisms in deep neural networks for image classification and object detection. Pattern Recognit. 123, 108411 (2022)

  31. Zhao, B., Wu, X., Feng, J., Peng, Q., Yan, S.: Diversified visual attention networks for fine-grained object classification. IEEE Trans. Multimed. 19(6), 1245–1256 (2017). https://doi.org/10.1109/TMM.2017.2648498

    Article  Google Scholar 

  32. Xu, K., Ba, J., Kiros, R., Cho, K., Courville, A., Salakhutdinov, R., Zemel, R., Bengio, Y.: Show, attend and tell: neural image caption generation with visual attention. IEEE Trans. Multimed. 19(6), 66 (2017)

    Google Scholar 

  33. Liu, S., Wang, Q., Li, X.: Attention based network for remote sensing scene classification. In: IGARSS 2018—2018 IEEE International Geoscience and Remote Sensing Symposium, pp. 4740–4743 (2018). https://doi.org/10.1109/IGARSS.2018.8519232.

  34. Cao, C., et al.: Look and think twice: capturing top-down visual attention with feedback convolutional neural networks. In: IEEE International Conference on Computer Vision (ICCV), vol. 2015, pp. 2956–2964 (2015). https://doi.org/10.1109/ICCV.2015.338

  35. Redmon, J., Divvala, S., Girshick,R., Farhadi, A.: ’You only look once: unified real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788 (2016)

  36. Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C. Y. et al.: ’Ssd: single shot multibox detector. In: European Conference on Computer Vision, pp. 21–37 (2016)

  37. Ren, S., He, K., Girshick, R., Sun, J.: Faster r-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems (2015), pp. 91–99

  38. He, K., Gkioxari, G., Dollar, P., Girshick, R.: Mask R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV) (2017), pp. 2961–2969

  39. Kamble, V.V., Kokate, R.D.: Automated diabetic retinopathy detection using radial basis function. Procedia Comput. Sci. 167, 799–808 (2020)

    Article  Google Scholar 

  40. Porwal, P., Pachade, S., Kamble , R., et al.: Indian diyabetic retinopathy image dataset (IDRid). In: IEEE Dataport (2018)

  41. Kaggle Diabetic Retinopathy Detection Competition: https://www.kaggle.com/c/diabetic-retinopathy-detection

  42. Decencière, E., Zhang, X., Cazuguel, G., et al.: Feedback on a publicly distributed image database: the Messidor database. Image Anal. Stereol. 33(3), 231–234 (2014)

    Article  MATH  Google Scholar 

  43. Díaz-Pernil, D., Berciano, A., Peña-Cantillana, F., Gutiérrez-Naranjob, M.A.: Segmenting images with gradient-based edge detection using membrane computing. Pattern Recognit. Lett. 34(8), 846–855 (2013)

    Article  Google Scholar 

  44. Erciyas, A., Erdal, E., Ergüzen, A.: Medical image processing with CUDA. In: International Advanced Researches & Engineering Congress (IAREC’17) (2017). ISBN: 978-605-245-037-6

Download references

Acknowledgements

We thank the editors, reviewers, and all cited authors.

Funding

No funds, grants, or other support was received.

Author information

Authors and Affiliations

Authors

Contributions

N.B. and A.E. designed the project, main conceptual ideas and project outline. A.E. solved all the technical details and did the numerical calculations for the proposed method and got the results. N.B., H.M.Ü., and H.P. took the lead in writing the article. A.E and N.B. wrote the article template. H.M.Ü and H.P. helped moderate the article. All authors provided critical feedback and assisted in research, analysis, and finalization of the manuscript. All authors discussed the results and contributed to the final article.

Corresponding author

Correspondence to Abdüssamed Erciyas.

Ethics declarations

Conflict of interest

The authors have no conflicts of interest to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Erciyas, A., Barışçı, N., Ünver, H.M. et al. Improving detection and classification of diabetic retinopathy using CUDA and Mask RCNN. SIViP 17, 1265–1273 (2023). https://doi.org/10.1007/s11760-022-02334-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11760-022-02334-9

Keywords

Navigation