Abstract
When the faces of individuals are not clearly identifiable in surveillance videos due to variations in poses, camera viewpoints and occlusions, the appearances of people play a vital role in their identification. Appearance based person re-identification (re-id) summarizes appearances of persons to identify them across multiple non-overlap** camera views. Existing person re-id solutions work on the cropped person images to learn the salient features of a person instead of working on the raw surveillance images, hence these solutions need an independent preliminary phase of preparing cropped person datasets for the training and evaluation purposes. In contrast, the proposed solution works on the raw surveillance images instead of prerequisite of the cropped person dataset and the proposed hierarchical association building among various local parts of the images results in rich person representations for person re-id. In the proposed solution of Smart Surveillance with Simultaneous Person Detection and Re-identification (SSPDR), the complete surveillance video scenes are processed to perform simultaneous person detection and re-identification for all of the persons captured by a surveillance network. We use region proposals based localization scheme for person detection with an increased confidence strategy about the estimation of bounding boxes locations and the person re-identification module learns the hierarchical associations among local salient body parts of a person. Firstly, the proposed re-id module establishes associations among local horizontal strips of two persons, and afterwards it builds associations among local salient sub-patches of already associated pairs of horizontal strips. We address two major re-id challenges i.e. background noise and scale differences using the proposed re-id solution. In context of simultaneous person detection and re-identification, the proposed method is evaluated on publicly available re-id benchmark Person Re-identification in Wild (PRW) as well as on a local surveillance dataset, and attains state-of-the performance.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-022-13458-y/MediaObjects/11042_2022_13458_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-022-13458-y/MediaObjects/11042_2022_13458_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-022-13458-y/MediaObjects/11042_2022_13458_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-022-13458-y/MediaObjects/11042_2022_13458_Fig4_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-022-13458-y/MediaObjects/11042_2022_13458_Fig5_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-022-13458-y/MediaObjects/11042_2022_13458_Fig6_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-022-13458-y/MediaObjects/11042_2022_13458_Fig7_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-022-13458-y/MediaObjects/11042_2022_13458_Fig8_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-022-13458-y/MediaObjects/11042_2022_13458_Fig9_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11042-022-13458-y/MediaObjects/11042_2022_13458_Fig10_HTML.png)
Similar content being viewed by others
Change history
25 August 2022
Muhammad Shahzad’s email was change from muhammad.shehzad@tum.de to muhammad.shahzad@tum.de
References
Ansar W, Fraz M M, Shahzad M, Gohar I, Javed S, Jung S K (2018) Two stream deep CNN-RNN attentive pooling architecture for video-based person re-identification. In: Progress in pattern recognition, image analysis, computer vision, and applications - 23rd Iberoamerican Congress, CIARP 2018, Madrid, Spain, November 19-22, 2018, Proceedings, pp 654–661
Batool S, Ali M Z, Shahzad M, Fraz M M (2018) End to end person re-identification for automated visual surveillance. In: IEEE international conference on image processing, applications and systems, IPAS 2018, Sophia Antipolis, France, December 12-14, 2018, pp 220–225
Cheng D, Gong Y, Shi W, Zhang S (2018) Person re-identification by the asymmetric triplet and identification loss function. Multimed Tools Appl 77(3):3533–3550
Cheng D, Gong Y, Zhou S, Wang J, Zheng N (2016) Person re-identification by multi-channel parts-based CNN with improved triplet loss function. In: 2016 IEEE Conference on computer vision and pattern recognition, CVPR 2016, Las Vegas, NV, USA, June 27-30, 2016, pp 1335–1344
Cho H, Rybski P E, Bar-Hillel A, Zhang W (2012) Real-time pedestrian detection with deformable part models. In: Intelligent vehicles symposium. IEEE, pp 1035–1042
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: CVPR (1). IEEE Computer Society, pp 886–893
Dollár P, Appel R, Belongie S J, Perona P (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell 36(8):1532–1545
Dollár P, Wojek C, Schiele B, Perona P (2012) Pedestrian detection: an evaluation of the state of the art. IEEE Trans Pattern Anal Mach Intell 34(4):743–761
Felzenszwalb P F, Girshick R B, McAllester D A, Ramanan D (2010) Object detection with discriminatively trained part-based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645
He K, Zhang X, Ren S, Sun J (2015) Deep residual learning for image recognition. ar**v:1512.03385
He Y, Zhu C, Wang J, Savvides M, Zhang X (2019) Bounding box regression with uncertainty for accurate object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2888–2897
He Z, Jung C, Fu Q, Zhang Z (2019) Deep feature embedding learning for person re-identification based on lifted structured loss. Multimed Tools Appl 78(5):5863–5880
Kwak J Y, Ko B, Nam J-Y (2015) Multi-person tracking based on body parts and online random ferns learning of thermal images. In: 2015 IEEE Winter conference on applications of computer vision, WACV 2015, Waikoloa, HI, USA, January 5-9, 2015, pp 41–46
Leng Q, Ye M, Tian Q (2019) A survey of open-world person re-identification. IEEE Trans Circuits Syst Video Technol, 1–1. https://doi.org/10.1109/TCSVT.2019.2898940, https://ieeexplore.ieee.org/document/8640834/
Li D, Chen X, Zhang Z, Huang K (2017) Learning deep context-aware features over body and latent parts for person re-identification. ar**v:http://arxiv.org/
Li T, Sun L, Han C, Guo J (2018) Person re-identification using salient region matching game. Multimed Tools Appl 77(16):21393–21415
Li W, Zhao R, **ao T, Wang X (2014) Deepreid: deep filter pairing neural network for person re-identification. In: CVPR. IEEE Computer Society, pp 152–159
Liu H, Feng J, Jie Z, Karlekar J, Zhao B, Qi M, Jiang J, Yan S (2017) Neural person search machines. In: IEEE International conference on computer vision, ICCV 2017, Venice, Italy, October 22-29, 2017, pp 493–501
Luo H, Jiang W, Zhang X, Fan X, Qian J, Zhang C (2019) Alignedreid++: dynamically matching local information for person re-identification. Pattern Recogn 94:53–61
Mubariz N, Mumtaz S, Hamayun M M, Fraz M M (2018) Optimization of person re-identification through visual descriptors. In: Proceedings of (VISIGRAPP 2018) - volume 4: VISAPP, Funchal, Madeira, Portugal, January 27-29, 2018, pp 348–355
Mumtaz S, Mubariz N, Saleem S, Fraz M M (2017) Weighted hybrid features for person re-identification. In: Seventh international conference on image processing theory, tools and Applications, IPTA 2017, Montreal, QC, Canada, November 28 - December 1, 2017, pp 1–6
Munjal B, Amin S, Tombari F, Galasso F (2019) Query-guided end-to-end person search. ar**v:1905.01203
Nam W, Dollár P, Han J H (2014) Local decorrelation for improved pedestrian detection. In: Advances in neural information processing systems 27: annual conference on neural information processing systems 2014, December 8-13 2014, Montreal, Quebec, Canada, pp 424–432
Ou X, Ma Q, Wang Y (2019) Improving person re-identification by multi-task learning. Multimed Tools Appl 78(19):28257–28283
Perwaiz N, Fraz M M, Shahzad M (2019) Hierarchical refined local associations for robust person re-identification. In: 2019 International conference on robotics and automation in industry (ICRAI), pp 1–6
Perwaiz N, Fraz MM, Shahzad M (2020) Smart visual surveillance: proactive person re-identification instead of impulsive person search. In: 2020 IEEE 23rd international multitopic conference (INMIC). IEEE, pp 1–6
Perwaiz N, Fraz MM, Shahzad M (2021) Stochastic attentions and context learning for person re-identification. In: 2021 PeerJ Computer Science. PeerJ
Perwaiz N, Fraz M M, Shahzad M (2018) Person re-identification using hybrid representation reinforced by metric learning. IEEE Access 6:77334–77349
Qi M, Han J, Jiang J, Liu H (2019) Deep feature representation and multiple metric ensembles for person re-identification in security surveillance system. Multimed Tools Appl 78(19):27029–27043
Ren S, He K, Girshick R B, Sun J (2017) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137–1149
Tang S, Ye M, Xu P, Li X (2019) Adaptive pedestrian detection by predicting classifier. Neural Comput Applic 31(4):1189–1200
Tian Y, Li Q, Wang D, Wan B (2019) Robust joint learning network: improved deep representation learning for person re-identification. Multimed Tools Appl 78(17):24187–24203
Verma A, Hebbalaguppe R, Vig L, Kumar S, Hassan E (2015) Pedestrian detection via mixture of CNN experts and thresholded aggregated channel features. In: ICCV workshops. IEEE Computer Society, pp 555–563
Wu J, Jiang J, Qi M, Liu H (2019) Independent metric learning with aligned multi-part features for video-based person re-identification. Multimed Tools Appl 78(20):29323–29341
**ao J, **e Y, Tillo T, Huang K, Wei Y, Feng J (2017) IAN: the individual aggregation network for person search. ar**v:1705.0555
**ao T, Li S, Wang B, Lin L, Wang X (2017) Joint detection and identification feature learning for person search. In: CVPR. IEEE Computer Society, pp 3376–3385
Xu Y, Ma B, Huang R, Lin L (2014) Person search in a scene by jointly modeling people commonness and person uniqueness. In: ACM Multimedia. ACM, pp 937–940
Yang D, Zhang J, Xu S, Ge S, Kumar G H, Zhang X (2018) Real-time pedestrian detection via hierarchical convolutional feature. Multimed Tools Appl 77(19):25841–25860
Zhai S, Liu S, Wang X, Tang J (2019) Fmt: fusing multi-task convolutional neural network for person search. Multimed Tools Appl
Zhang N, Paluri M, Taigman Y, Fergus R, Bourdev L D (2015) Beyond frontal faces: improving person recognition using multiple cues. In: CVPR. IEEE Computer Society, pp 4804–4813
Zhang S, Benenson R, Schiele B (2015) Filtered channel features for pedestrian detection. In: CVPR. IEEE Computer Society, pp 1751–1760
Zhang S, Benenson R, Schiele B (2017) Citypersons: a diverse dataset for pedestrian detection. In: CVPR. IEEE Computer Society, pp 4457–4465
Zhang S, Yang J, Schiele B (2018) Occluded pedestrian detection through guided attention in cnns. In: CVPR. IEEE Computer Society, pp 6995–7003
Zhao K, Deng J, Cheng D (2018) Real-time moving pedestrian detection using contour features. Multimedia Tools Appl 77(23):30891–30910
Zheng L, Yang Y, Hauptmann A G (2016) Person re-identification: past, present and future. ar**v:1610.02984
Zheng L, Zhang H, Sun S, Chandraker M, Yang Y, Tian Q (2017) Person re-identification in the wild. In: 2017 IEEE Conference on computer vision and pattern recognition (CVPR). https://doi.org/10.1109/CVPR.2017.357, http://ieeexplore.ieee.org/document/8099840/. IEEE, Honolulu, pp 3346–3355
Zhu F, Kong X, Wu Q, Fu H, Li M (2018) A loss combination based deep model for person re-identification. Multimed Tools Appl 77(3):3049–3069
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Perwaiz, N., Fraz, M.M. & Shahzad, M. Smart surveillance with simultaneous person detection and re-identification. Multimed Tools Appl 83, 15461–15482 (2024). https://doi.org/10.1007/s11042-022-13458-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-13458-y