Abstract
Shadow detection is essential for scene understanding and image restoration. Existing paradigms for producing shadow detection training data usually rely on densely labeling each image pixel, which will lead to a bottleneck when scaling up the number of images. To tackle this problem, by labeling shadow images with only a few strokes, this paper designs a learning framework for Weakly supervised Shadow Detection, namely WSD. Firstly, it creates two shadow detection datasets with scribble annotations, namely Scr-SBU and Scr-ISTD. Secondly, it proposes an uncertainty-guided label augmentation scheme based on graph convolutional networks, which can propagate the sparse scribble annotations to more reliable regions, and then avoid the model converging to an undesired local minima as intra-class discontinuity. Finally, it introduces a multi-task learning framework to jointly learn for shadow detection and edge detection, which encourages generated shadow maps to be comprehensive and well aligned with shadow boundaries. Experimental results on benchmark datasets demonstrate that our framework even outperforms existing semi-supervised and fully supervised shadow detectors requiring only 2% pixels to be labeled.
Similar content being viewed by others
Data availability
The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.
References
Achanta, R., Shaji, A., Smith, K., Lucchi, A., Fua, P., Süsstrunk, S.: SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Trans. Pattern Anal. Mach. Intell. 34(11), 2274–2282 (2012)
Al-Amaren, A., Ahmad, M.O., Swamy, M.: A low-complexity residual deep neural network for image edge detection. Appl. Intell. 53(9), 11282–11299 (2022)
Al-Huda, Z., Peng, B., Algburi, R.N.A., Alfasly, S., Li, T.: Weakly supervised pavement crack semantic segmentation based on multi-scale object localization and incremental annotation refinement. Appl. Intell. 53(11), 14527–14546 (2023)
Chen, X.D., Wu, W., Yang, W., Qin, H., Wu, X., Mao, X.: Make segment anything model perfect on shadow detection. IEEE Trans. Geosci. Remote Sens. 61, 1–13 (2023)
Chen, Z., Zhu, L., Wan, L., Wang, S., Feng, W., Heng, P.A.: A multi-task mean teacher for semi-supervised shadow detection. In: Proceedings of Computer Vision and Pattern Recognition, 5611–5620. IEEE (2020)
Codella, N., Rotemberg, V., Tschandl, P., Celebi, M.E., Dusza, S., Gutman, D., Helba, B., Kalloo, A., Liopyris, K., Marchetti, M., et al.: Skin lesion analysis toward melanoma detection 2018: A challenge hosted by the international skin imaging collaboration (isic). ar**v preprint ar**v:1902.03368 (2019)
Cucchiara, R., Grana, C., Piccardi, M., Prati, A., Sirotti, S.: Improving shadow suppression in moving object detection with hsv color information. In: Proceedings of Intelligent Transportation Systems, 334–339. IEEE (2001)
Ecins, A., Fermüller, C., Aloimonos, Y.: Shadow free segmentation in still images using local density measure. In: Proceedings of International Conference on Computational Photography, 1–8. IEEE (2014)
Ge, Y., Zhou, Q., Wang, X., Shen, C., Wang, Z., Li, H.: Point-teaching: weakly semi-supervised object detection with point annotations. In: Proceedings of the AAAI Conference on Artificial Intelligence, 37, 667–675 (2023)
Guan, Y.P.: Wavelet multi-scale transform based foreground segmentation and shadow elimination. Open Signal Process. J. 1, 1–6 (2008)
Gulshan, V., Rother, C., Criminisi, A., Blake, A., Zisserman, A.: Geodesic star convexity for interactive image segmentation. In: Proceedings of Computer Vision and Pattern Recognition, 3129–3136. IEEE (2010)
Guo, M.H., Lu, C.Z., Hou, Q., Liu, Z., Cheng, M.M., Hu, S.M.: Segnext: Rethinking convolutional attention design for semantic segmentation. In: Proceedings of Advances in Neural Information Processing Systems. MIT Press (2022)
Guo, R., Dai, Q., Hoiem, D.: Single-image shadow detection and removal using paired regions. In: Proceedings of Computer Vision and Pattern Recognition, 2033–2040. IEEE (2011)
He, R., Dong, Q., Lin, J., Lau, R.W.: Weakly-supervised camouflaged object detection with scribble annotations. In: Proceedings of the AAAI Conference on Artificial Intelligence, 37, 781–789 (2023)
Hu, X., Wang, T., Fu, C.W., Jiang, Y., Wang, Q., Heng, P.A.: Revisiting shadow detection: a new benchmark dataset for complex world. IEEE Trans. Image Process. 30, 1925–1934 (2021)
Hu, X., Zhu, L., Fu, C.W., Qin, J., Heng, P.A.: Direction-aware spatial context features for shadow detection. In: Proceedings of Computer Vision and Pattern Recognition, 7454–7462. IEEE (2018)
Huang, X., Hua, G., Tumblin, J., Williams, L.: What characterizes a shadow boundary under the sun and sky? In: Proceedings of International Conference on Computer Vision, 898–905. IEEE (2011)
Joshi, I., Kothari, R., Utkarsh, A., Kurmi, V.K., Dantcheva, A., Roy, S.D., Kalra, P.K.: Explainable fingerprint roi segmentation using monte carlo dropout. In: Proceedings of Winter Conference on Applications of Computer Vision, 60–69 (2021)
Junejo, I.N., Foroosh, H.: Estimating geo-temporal location of stationary cameras using shadow trajectories. In: Proceedings of European Conference on Computer Vision, 318–331. Springer (2008)
Kang, S., Kim, J., Jang, I.S., Lee, B.D.: C2shadowgan: cycle-in-cycle generative adversarial network for shadow removal using unpaired data. Appl. Intell. 53(12), 15067–15079 (2023)
Karsch, K., Hedau, V., Forsyth, D., Hoiem, D.: Rendering synthetic objects into legacy photographs. ACM Trans. Graph. 30(6), 1–12 (2011)
Kendall, A., Gal, Y.: What uncertainties do we need in bayesian deep learning for computer vision? In: Proceedings of Advances in Neural Information Processing Systems. MIT Press (2017)
Kipf, T.N., Welling, M.: Semi-supervised classification with graph convolutional networks. In: Proceedings of International Conference on Learning Representation, 1–14 (2017)
Kittler, J.: On the accuracy of the Sobel edge detector. Image Vis. Comput. 1(1), 37–42 (1983)
Krähenbühl, P., Koltun, V.: Efficient inference in fully connected crfs with gaussian edge potentials. In: Proceedings of Advances in Neural Information Processing Systems. MIT Press (2011)
Lalonde, J.F., Efros, A.A., Narasimhan, S.G.: Detecting ground shadows in outdoor consumer photographs. In: Proceedings of European Conference on Computer Vision, 322–335. Springer (2010)
Lalonde, J.F., Efros, A.A., Narasimhan, S.G.: Estimating the natural illumination conditions from a single outdoor image. Int. J. Comput. Vis. 98(2), 123–145 (2012)
Le, H., Vicente, T.F.Y., Nguyen, V., Hoai, M., Samaras, D.: A+d net: Training a shadow detector with adversarial shadow attenuation. In: Proceedings of European Conference on Computer Vision, 662–678. Springer (2018)
Liu, D., Cui, Y., Yan, L., Mousas, C., Yang, B., Chen, Y.: Densernet: Weakly supervised visual localization using multi-scale feature aggregation. In: Proceedings of the AAAI Conference on Artificial Intelligence, 35, 6101–6109 (2021)
Liu, Y., Cheng, M.M., Hu, X., Wang, K., Bai, X.: Richer convolutional features for edge detection. In: Proceedings of Computer Vision and Pattern Recognition, 3000–3009. IEEE (2017)
Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., Guo, B.: Swin transformer: Hierarchical vision transformer using shifted windows. In: Proceedings of International Conference on Computer Vision, 10012–10022. IEEE (2021)
Luo, D., Liu, G., Bavirisetti, D.P., Cao, Y.: Infrared and visible image fusion based on VPDE model and VGG network. Appl. Intell. 53(21), 24739–24764 (2023)
Meng, Q., Sinclair, M., Zimmer, V., Hou, B., Rajchl, M., Toussaint, N., Oktay, O., Schlemper, J., Gomez, A., Housden, J., et al.: Weakly supervised estimation of shadow confidence maps in fetal ultrasound imaging. IEEE Trans. Med. Imaging 38(12), 2755–2767 (2019)
Mikic, I., Cosman, P.C., Kogut, G.T., Trivedi, M.M.: Moving shadow and object detection in traffic scenes. In: Proceedings of International Conference on Pattern Recognition, P1, 321–324. IEEE (2000)
Nguyen, V., Yago Vicente, T.F., Zhao, M., Hoai, M., Samaras, D.: Shadow detection with conditional generative adversarial networks. In: Proceedings of International Conference on Computer Vision, 4510–4518. IEEE (2017)
Okabe, T., Sato, I., Sato, Y.: Attached shadow coding: Estimating surface normals from shadows under unknown reflectance and lighting conditions. In: Proceedings of International Conference on Computer Vision, 1693–1700. IEEE (2009)
Panagopoulos, A., Samaras, D., Paragios, N.: Robust shadow and illumination estimation using a mixture model. In: Proceedings of Computer Vision and Pattern Recognition, 651–658. IEEE (2009)
Pei, J., Tang, H., Wang, W., Cheng, T., Chen, C.: Salient instance segmentation with region and box-level annotations. Neurocomputing 507, 332–344 (2022)
Pu, M., Huang, Y., Liu, Y., Guan, Q., Ling, H.: Edter: Edge detection with transformer. In: Proceedings of Computer Vision and Pattern Recognition, 1402–1412. IEEE (2022)
Suykens, J.A., Vandewalle, J.: Least squares support vector machine classifiers. Neural Process. Lett. 9(3), 293–300 (1999)
Tan, M., Le, Q.: Efficientnet: Rethinking model scaling for convolutional neural networks. In: Proceedings of International Conference on Machine Learning, 6105–6114. PMLR (2019)
Tang, M., Djelouah, A., Perazzi, F., Boykov, Y., Schroers, C.: Normalized cut loss for weakly-supervised cnn segmentation. In: Proceedings of Computer Vision and Pattern Recognition, 1818–1827. IEEE (2018)
Unal, O., Dai, D., Van Gool, L.: Scribble-supervised lidar semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2697–2707 (2022)
Vicente, T.F.Y.: Large-scale weakly-supervised shadow detection. Ph.D. thesis, State University of New York at Stony Brook (2018)
Vicente, T.F.Y., Hoai, M., Samaras, D.: Leave-one-out kernel optimization for shadow detection and removal. IEEE Trans. Pattern Anal. Mach. Intell. 40(3), 682–695 (2017)
Vicente, T.F.Y., Hou, L., Yu, C.P., Hoai, M., Samaras, D.: Large-scale training of shadow detectors with noisily-annotated shadow examples. In: Proceedings of European Conference on Computer Vision, 816–832. Springer (2016)
Wang, J., Li, X., Yang, J.: Stacked conditional generative adversarial networks for jointly learning shadow detection and shadow removal. In: Proceedings of Computer Vision and Pattern Recognition, 1788–1797. IEEE (2018)
Wang, Y., Yang, Y., Yang, Z., Zhao, L., Wang, P., Xu, W.: Occlusion aware unsupervised learning of optical flow. In: Proceedings of Computer Vision and Pattern Recognition, 4884–4893. IEEE (2018)
Wu, L., Cao, X., Foroosh, H.: Camera calibration and geo-location estimation from two shadow trajectories. Comput. Vis. Image Underst. 114(8), 915–927 (2010)
Wu, W., Chen, X.D., Yang, W., Yong, J.H.: Exploring better target for shadow detection. Knowl.-Based Syst. 273, 110614 (2023)
Wu, W., Wu, X., Wan, Y.: Single-image shadow removal using detail extraction and illumination estimation. Vis. Comput. 38(5), 1677–1687 (2022)
Wu, W., Yang, W., Ma, W., Chen, X.D.: How many annotations do we need for generalizing new-coming shadow images? IEEE Transactions on Circuits and Systems for Video Technology 1–12 (2023)
Wu, W., Zhang, S., Tian, M., Tan, D., Wu, X., Wan, Y.: Learning to detect soft shadow from limited data. Vis. Comput. 38(5), 1665–1675 (2022)
Wu, W., Zhang, S., Zhou, K., Yang, J., Wu, X., Wan, Y.: Shadow removal via dual module network and low error shadow dataset. Comput. Graph. 95, 156–163 (2021)
**e, E., Wang, W., Yu, Z., Anandkumar, A., Alvarez, J.M., Luo, P.: Segformer: Simple and Efficient Design for Semantic Segmentation with Transformers. MIT Press, Cambridge (2021)
**e, S., Girshick, R., Dollár, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks. In: Proceedings of Computer Vision and Pattern Recognition, 1492–1500. IEEE (2017)
Xu, B., Liang, H., Gong, W., Liang, R., Chen, P.: A visual representation-guided framework with global affinity for weakly supervised salient object detection. IEEE Trans. Circ. Syst. Video Technol. 1–12 (2023)
Yang, W., Wu, W., Chen, X.D., Tao, X., Mao, X.: How to use extra training data for better edge detection? Appl. Intell. 1–15 (2023)
Ye, S., Chen, D., Han, S., Liao, J.: Learning with noisy labels for robust point cloud segmentation. In: Proceedings of International Conference on Computer Vision, 6443–6452 (2021)
Zhang, B., **ao, J., Jiao, J., Wei, Y., Zhao, Y.: Affinity attention graph neural network for weakly supervised semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 44(11), 8082–8096 (2022)
Zhang, J., Yu, X., Li, A., Song, P., Liu, B., Dai, Y.: Weakly-supervised salient object detection via scribble annotations. In: Proceedings of Computer Vision and Pattern Recognition, 12546–12555 (2020)
Zheng, Q., Qiao, X., Cao, Y., Lau, R.W.: Distraction-aware shadow detection. In: Proceedings of Computer Vision and Pattern Recognition, 5167–5176. IEEE (2019)
Zheng, S., Lu, J., Zhao, H., Zhu, X., Luo, Z., Wang, Y., Fu, Y., Feng, J., **ang, T., Torr, P.H., et al.: Rethinking semantic segmentation from a sequence-to-sequence perspective with transformers. In: Proceedings of Computer Vision and Pattern Recognition, 6881–6890. IEEE (2021)
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., Torralba, A.: Learning deep features for discriminative localization. In: Proceedings of Computer Vision and Pattern Recognition, 2921–2929. IEEE (2016)
Zhu, J., Samuel, K.G., Masood, S.Z., Tappen, M.F.: Learning to recognize shadows in monochromatic natural images. In: Proceedings of Computer Vision and Pattern Recognition, 223–230. IEEE (2010)
Zhu, L., Deng, Z., Hu, X., Fu, C.W., Xu, X., Qin, J., Heng, P.A.: Bidirectional feature pyramid network with recurrent attention residual modules for shadow detection. In: Proceedings of European Conference on Computer Vision, 121–136. Springer (2018)
Zhu, L., Xu, K., Ke, Z., Lau, R.W.: Mitigating intensity bias in shadow detection via feature decomposition and reweighting. In: Proceedings of International Conference on Computer Vision, 4702–4711. IEEE (2021)
Zhu, Y., Fu, X., Cao, C., Wang, X., Sun, Q., Zha, Z.J.: Single image shadow detection via complementary mechanism. In: Proceedings of the ACM International Conference on Multimedia, 6717–6726 (2022)
Funding
The work was supported by the National Natural Science Foundation of China (61972120), the General Research Project of Zhejiang Provincial Department of Education (Y202044861), the Zhejiang Provincial Natural Science Foundation (Regional Innovation Joint Funds) (LQZSZ24E050001).
Author information
Authors and Affiliations
Contributions
WW was involved in conceptualization, writing—original draft, writing—review & editing. HC helped in validation, methodology, project administration, funding acquisition. X-DC contributed to visualization, formal analysis, data curation. WY and XM helped in supervision.
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethics approval
The work follows appropriate ethical standards in conducting research and writing the manuscript. This work presents computational models trained with publicly available data, for which no ethical approval was required.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Chen, H., Chen, XD., Wu, W. et al. Annotate less but perform better: weakly supervised shadow detection via label augmentation. Vis Comput (2024). https://doi.org/10.1007/s00371-024-03278-6
Accepted:
Published:
DOI: https://doi.org/10.1007/s00371-024-03278-6