Exert Diversity and Mitigate Bias: Domain Generalizable Person Re-identification with a Comprehensive Benchmark

Hu, Bingyu; Liu, Jiawei; Zheng, Yufei; Zheng, Kecheng; Zha, Zheng-Jun

doi:10.1007/s11263-024-02124-5

Exert Diversity and Mitigate Bias: Domain Generalizable Person Re-identification with a Comprehensive Benchmark

Published: 03 June 2024

(2024)
Cite this article

International Journal of Computer Vision Aims and scope Submit manuscript

Bingyu Hu¹,
Jiawei Liu¹,
Yufei Zheng¹,
Kecheng Zheng¹ &
…
Zheng-Jun Zha¹

119 Accesses
Explore all metrics

Abstract

Person re-identification (ReID), aiming at retrieving persons of the same identity across non-overlap** cameras, holds immense practical significance for security and surveillance applications. In pursuit of a more general and practical solution, recent research attention has gradually shifted from the traditional single-domain ReID to the domain generalizable person re-identification (DG-ReID). However, the DG-ReID landscape lacks a meticulously designed and all-encompassing benchmark to provide a common ground for competing approaches. To this end, in this paper, we first delve into the intricate challenges of DG-ReID and introduce a comprehensive and large-scale benchmark with enhanced distributional variety and shifts to facilitate the research progress. Furthermore, in response to the highlighted challenges, a novel DG-ReID framework based on diverse feature space learning with domain factorization is proposed to effectively learn rich domain-adaptive discriminative features through the two designed blocks with fairly limited additional cost in both memory and computation. Firstly, the feature diversification block promotes a diverse feature space capable of learning domain-specific characteristics under the rich distributional variety. Secondly, the domain-adaptive shielding block applies channel-wise shielding operations based on subspace-based domain factorization in order to prevent the model from prediction bias caused by distributional shifts. Our extensive experiments demonstrate the effectiveness of the proposed framework, surpassing the performance of current state-of-the-art methods under various evaluation protocols.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Open-World, Diverse, Cross-Spatial-Temporal Benchmark for Dynamic Wild Person Re-Identification

Article 24 April 2024

Generalizing Person Re-Identification by Camera-Aware Invariance Learning and Cross-Domain Mixup

Specialise to Generalise: The Person Re-identification Case

References

Barbosa, I. B., Cristani, M., Caputo, B., Rognhaugen, A., & Theoharis, T. (2018). Looking beyond appearances: Synthetic training data for deep CNNs in re-identification. Computer Vision and Image Understanding, 167, 50–62.
Article Google Scholar
Bialkowski, A., Denman, S., Sridharan, S., Fookes, C., & Lucey, P. (2012). A database for person re-identification in multi-camera surveillance networks. In International conference on digital image computing techniques and applications (DICTA) (pp. 1–8). IEEE.
Bini, D. A., Higham, N. J., & Meini, B. (2005). Algorithms for the matrix p-th root. Numerical Algorithms, 39(4), 349–378.
Article MathSciNet Google Scholar
Cai, Y., Takala, V., & Pietikainen, M. (2010). Matching groups of people by covariance descriptor. In 20th International conference on pattern recognition (pp. 2744–2747). IEEE.
Chang, W. G., You, T., Seo, S., Kwak, S., & Han, B. (2019). Domain-specific batch normalization for unsupervised domain adaptation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 7354–7362).
Chavdarova, T., Baqué, P., Bouquet, S., Maksai, A., Jose, C., Bagautdinov, T., Lettry, L., Fua, P., Van Gool, L., & Fleuret, F. (2018). Wildtrack: A multi-camera hd dataset for dense unscripted pedestrian detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 5030–5039).
Chen, P., Dai, P., Liu, J., Zheng, F., Xu, M., Tian, Q., & Ji, R. (2021). Dual distribution alignment network for generalizable person re-identification. In Proceedings of the AAAI conference on artificial intelligence (pp. 1054–1062).
Chen, W., Xu, X., Jia, J., Luo, H., Wang, Y., Wang, F., **, R., & Sun, X. (2023). Beyond appearance: A semantic controllable self-supervised learning framework for human-centric visual tasks. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 15050–15061).
Cho, W., Choi, S., Park, D. K., Shin, I., & Choo, J. (2019). Image-to-image translation via group-wise deep whitening-and-coloring transformation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 10639–10647).
Cho, Y., Cho, H., Kim, Y., & Kim, J. (2021). Improving generalization of batch whitening by convolutional unit optimization. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 5321–5329).
Choi, S., Kim, T., Jeong, M., Park, H., & Kim, C. (2021). Meta batch-instance normalization for generalizable person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 3425–3435).
Dai, Y., Li, X., Liu, J., Tong, Z., & Duan, L. Y. (2021). Generalizable person re-identification with relevance-aware mixture of experts. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 16145–16154).
Deng, J., Dong, W., Socher, R., Li, L. J., Li, K., & Fei-Fei, L. (2009). Imagenet: A large-scale hierarchical image database. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 248–255).
Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S, Uszkoreit, J., & Houlsby, N. (2021). An image is worth 16x16 words: Transformers for image recognition at scale. In International conference on learning representations.
Edelman, A., Arias, T. A., & Smith, S. T. (1998). The geometry of algorithms with orthogonality constraints. SIAM Journal on Matrix Analysis and Applications, 20(2), 303–353.
Article MathSciNet Google Scholar
Fu, D., Chen, D., Bao, J., Yang, H., Yuan, L., Zhang, L., Li, H., & Chen, D. (2021). Unsupervised pre-training for person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 14750–14759).
Gou, M., Wu, Z., Rates-Borras, A., Camps, O., & Radke, R. J. (2018). A systematic evaluation and benchmark for person re-identification: Features, metrics, and datasets. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(3), 523–536.
Google Scholar
Gray, D., & Tao, H. (2008). Viewpoint invariant pedestrian recognition with an ensemble of localized features. In European conference on computer vision (pp. 262–275). Springer.
Harandi, M., Sanderson, C., Shen, C., & Lovell, B. C. (2013). Dictionary learning and sparse coding on Grassmann manifolds: An extrinsic solution. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 3120–3127).
He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 770–778).
He, L., Liu, W., Liang, J., Zheng, K., Liao, X., Cheng, P., & Mei, T. (2021) Semi-supervised domain generalizable person re-identification. ar**v preprint ar**v:2108.05045
Hirzer, M., Beleznai, C., Roth, P. M., & Bischof, H. (2011). Person re-identification by descriptive and discriminative classification. In Image analysis: 17th Scandinavian conference, SCIA 2011, Ystad, Sweden, May 2011 (pp. 91–102). Proceedings 17, Springer.
Huang, L., Yang, D., Lang, B., & Deng, J. (2018). Decorrelated batch normalization. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 791–800).
Huang, L., Zhou, Y., Zhu, F., Liu, L., & Shao, L. (2019). Iterative normalization: Beyond standardization towards efficient whitening. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 4874–4883).
Huang, Y., Wu, Q., Xu, J., Zhong, Y., & Zhang, Z. (2021). Unsupervised domain adaptation with background shift mitigating for person re-identification. International Journal of Computer Vision, 129(7), 2244–2263.
Article Google Scholar
Ioffe, S., & Szegedy, C. (2015). Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International conference on machine learning (pp. 448–456). PMLR.
Jia, J., Ruan, Q., & Hospedales, T. M. (2019). Frustratingly easy person re-identification: Generalizing person re-id in practice. ar**v preprint ar**v:1905.03422
Jiao, B., Liu, L., Gao, L., Lin, G., Yang, L., Zhang, S., Wang, P., & Zhang, Y. (2022). Dynamically transformed instance normalization network for generalizable person re-identification. In European conference on computer vision (pp. 285–301). Springer.
**, X., Lan, C., Zeng, W., Chen, Z., & Zhang, L. (2020). Style normalization and restitution for generalizable person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 3143–3152).
Li, S., Ren, W., Wang, F., Araujo, I. B., Tokuda, E. K., Junior, R. H., Cesar-Jr, R. M., Wang, Z., & Cao, X. (2021). A comprehensive benchmark analysis of single image deraining: Current challenges and future perspectives. International Journal of Computer Vision, 129, 1301–1322.
Article Google Scholar
Li, W., & Wang, X. (2013). Locally aligned feature transforms across views. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 3594–3601).
Li, W., Zhao, R., **ao, T., & Wang, X. (2014). Deepreid: Deep filter pairing neural network for person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 152–159).
Liao, S., & Shao, L. (2020). Interpretable and generalizable person re-identification with query-adaptive convolution and temporal lifting. In European conference on computer vision (pp. 456–474). Springer.
Liao, S., & Shao, L. (2021). Transmatcher: Deep image matching through transformers for generalizable person re-identification. Advances in Neural Information Processing Systems, 34, 1992–2003.
Google Scholar
Lin, Y., Zheng, L., Zheng, Z., Wu, Y., Hu, Z., Yan, C., & Yang, Y. (2019). Improving person re-identification by attribute and identity learning. Pattern Recognition, 95, 151–161.
Article Google Scholar
Liu, J., Zha, Z. J., Tian, Q. I., Liu, D., Yao, T., Ling, Q., & Mei, T. (2016). Multi-scale triplet CNN for person re-identification. In Proceedings of the 24th ACM international conference on multimedia (pp. 192–196).
Liu, J., Zha, Z. J., Chen, D., Hong, R., & Wang, M. (2019a) Adaptive transfer network for cross-domain person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. https://doi.org/10.1109/CVPR.2019.00737
Liu, J., Zha, Z. J., Hong, R., Wang, M., & Zhang, Y. (2019b). Deep adversarial graph attention convolution network for text-based person search. In Proceedings of the 27th ACM international conference on multimedia.
Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., & Guo, B. (2021). Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF international conference on computer vision (ICCV).
Loy, C. C., Liu, C., & Gong, S. (2013). Person re-identification by manifold ranking. In IEEE international conference on image processing (pp. 3567–3571). IEEE.
Luo, H., Jiang, W., Gu, Y., Liu, F., Liao, X., Lai, S., & Gu, J. (2019). A strong baseline and batch normalization neck for deep person re-identification. IEEE Transactions on Multimedia, 22(10), 2597–2609.
Article Google Scholar
Ma, L., Liu, H., Hu, L., Wang, C., & Sun, Q. (2016). Orientation driven bag of appearances for person re-identification. ar** network. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 719–728).
Su, C., Li, J., Zhang, S., **ng, J., Gao, W., & Tian, Q. (2017). Pose-driven deep convolutional model for person re-identification. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 3960–3969).
Sun, X., & Zheng, L. (2019). Dissecting person re-identification from the viewpoint of viewpoint. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 608–617).
Tan, W., Ding, C., Wang, P., Gong, M., & Jia, K. (2023). Style interleaved learning for generalizable person re-identification. IEEE Transactions on Multimedia. https://doi.org/10.1109/TMM.2023.3283878
Article Google Scholar
Tishby, N., & Zaslavsky, N. (2015). Deep learning and the information bottleneck principle. In 2015 IEEE information theory workshop (ITW) (pp. 1–5). IEEE.
Ulyanov, D., Vedaldi, A., & Lempitsky, V. (2016). Instance normalization: The missing ingredient for fast stylization. ar**v preprint ar**v:1607.08022
Vinyals, O., Blundell, C., Lillicrap, T., & Wierstra, D. (2016). Matching networks for one shot learning. Advances in Neural Information Processing Systems, 29, 3630–3638.
Wang, H., Zhu, X., Gong, S., & **ang, T. (2018). Person re-identification in identity regression space. International Journal of Computer Vision, 126, 1288–1310.
Article Google Scholar
Wang, X., Yu, F., Dunlap, L., Ma, Y. A., Wang, R., Mirhoseini, A., Darrell, T., & Gonzalez, J. E. (2020). Deep mixture of experts via shallow embedding. In R. P. Adams & V. Gogate (Eds.), Proceedings of the 35th uncertainty in artificial intelligence conference, proceedings of machine learning research (Vol. 115, pp. 552–562). PMLR. https://proceedings.mlr.press/v115/wang20d.html
Wei, L., Zhang, S., Gao, W., & Tian, Q. (2018). Person transfer GAN to bridge domain gap for person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 79–88).
**ao, T., Li, S., Wang, B., Lin, L., & Wang, X. (2016). End-to-end deep learning for person search (vol. 2, no. 2, p. 4). ar**v preprint ar**v:1604.01850
Xu, B., Liang, J., He, L., & Sun, Z. (2022). Meta: Mimicking embedding via others’ aggregation for generalizable person re-identification. In European conference on computer vision (pp. 372–388).
Ye, M., Shen, J., Lin, G., **ang, T., Shao, L., & Hoi, S. C. (2021). Deep learning for person re-identification: A survey and outlook. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(6), 2872–2893.
Article Google Scholar
Yin, J., Wu, A., & Zheng, W. S. (2020). Fine-grained person re-identification. International Journal of Computer Vision, 128, 1654–1672.
Article Google Scholar
Zhang, J., Niu, L., & Zhang, L. (2020). Person re-identification with reinforced attribute attention selection. IEEE Transactions on Image Processing, 30, 603–616.
Article Google Scholar
Zhang, P., Dou, H., Yu, Y., & Li, X. (2022). Adaptive cross-domain learning for generalizable person re-identification. In European conference on computer vision (pp. 215–232). Springer.
Zhang, S., Zhang, Q., Yang, Y., Wei, X., Wang, P., Jiao, B., & Zhang, Y. (2020). Person re-identification in aerial imagery. IEEE Transactions on Multimedia, 23, 281–291.
Article Google Scholar
Zhang, T., **e, L., Wei, L., Zhuang, Z., Zhang, Y., Li, B., & Tian, Q. (2021). Unrealperson: An adaptive pipeline towards costless person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 11506–11515).
Zhang, X., He, Y., Xu, R., Yu, H., Shen, Z., & Cui, P. (2023). Nico++: Towards better benchmarking for domain generalization. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 16036–16047).
Zhang, Z., Lan, C., Zeng, W., **, X., & Chen, Z. (2020c). Relation-aware global attention for person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 3186–3195).
Zhao, Y., Zhong, Z., Yang, F., Luo, Z., Lin, Y., Li, S., & Sebe, N. (2021). Learning to generalize unseen domains via memory-based multi-source meta-learning for person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 6277–6286).
Zheng, L., Shen, L., Tian, L., Wang, S., Wang, J., & Tian, Q. (2015a). Scalable person re-identification: A benchmark. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 1116–1124).
Zheng, W. S., Li, X., **ang, T., Liao, S., Lai, J., & Gong, S. (2015b). Partial person re-identification. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 4678–4686).
Zheng, Z., Zheng, L., & Yang, Y. (2017). Unlabeled samples generated by GAN improve the person re-identification baseline in vitro. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 3754–3762).
Zhong, Z., Gao, Y., Zheng, Y., Zheng, B., & Sato, I. (2023). Real-world video deblurring: A benchmark dataset and an efficient recurrent neural network. International Journal of Computer Vision, 131(1), 284–301.
Article Google Scholar
Zhu, X., Zhu, X., Li, M., Morerio, P., Murino, V., & Gong, S. (2021). Intra-camera supervised person re-identification. International Journal of Computer Vision, 129, 1580–1595.
Article Google Scholar
Zhuang, Z., Wei, L., **e, L., Zhang, T., Zhang, H., Wu, H., Ai, H., & Tian, Q. (2020). Rethinking the distribution gap of person re-identification with camera-based batch normalization. In European conference on computer vision (pp. 140–157). Springer.

Download references

Funding

This work was supported by National Natural Science Foundation of China (NSFC) under Grants 62225207 and 62106245.

Author information

Authors and Affiliations

University of Science and Technology of China, Hefei, China
Bingyu Hu, Jiawei Liu, Yufei Zheng, Kecheng Zheng & Zheng-Jun Zha

Authors

Bingyu Hu
View author publications
You can also search for this author in PubMed Google Scholar
Jiawei Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yufei Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Kecheng Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Zheng-Jun Zha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jiawei Liu.

Additional information

Communicated by Zhun Zhong.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Hu, B., Liu, J., Zheng, Y. et al. Exert Diversity and Mitigate Bias: Domain Generalizable Person Re-identification with a Comprehensive Benchmark. Int J Comput Vis (2024). https://doi.org/10.1007/s11263-024-02124-5

Download citation

Received: 15 October 2023
Accepted: 14 May 2024
Published: 03 June 2024
DOI: https://doi.org/10.1007/s11263-024-02124-5

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Exert Diversity and Mitigate Bias: Domain Generalizable Person Re-identification with a Comprehensive Benchmark

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

An Open-World, Diverse, Cross-Spatial-Temporal Benchmark for Dynamic Wild Person Re-Identification

Generalizing Person Re-Identification by Camera-Aware Invariance Learning and Cross-Domain Mixup

Specialise to Generalise: The Person Re-identification Case

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Exert Diversity and Mitigate Bias: Domain Generalizable Person Re-identification with a Comprehensive Benchmark

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

An Open-World, Diverse, Cross-Spatial-Temporal Benchmark for Dynamic Wild Person Re-Identification

Generalizing Person Re-Identification by Camera-Aware Invariance Learning and Cross-Domain Mixup

Specialise to Generalise: The Person Re-identification Case

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation