Attribute Space Analysis forĀ Image Editing

  • Conference paper
  • First Online:
Image and Graphics (ICIG 2023)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14356))

Included in the following conference series:

  • 350 Accesses

Abstract

Image editing is a widely studied topic in computer vision, which enables the modification of specific attributes in images without altering other crucial information. One popular unsupervised technique currently used is feature decomposition in the latent space of Generative Adversarial Networks (GANs), which provides editing directions that can control attribute changes to achieve desired image editing results. However, this method often does not allow for the direct acquisition of the desired editing direction by setting the target attribute in advance. In this work, we propose a method to finding editing directions in the attribute space by analyzing image differences. This enables users to obtain target directions by actively defining the attribute they want to change. Specifically, this method discovers semantic directions suitable for target attribute editing by applying Principal Component Analysis (PCA) on the difference of image latent codes embedded in the latent space. Through experiments, our method can effectively find the target editing direction according to user needs and achieve satisfactory editing effects at the same time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Goodfellow, I., et al.: Generative adversarial networks. Commun. ACM 63(11), 139ā€“144 (2020)

    ArticleĀ  MathSciNetĀ  Google ScholarĀ 

  2. Kingma, D.P., Welling, M.: Auto-encoding variational bayes. ar**v preprint ar**v:1312.6114 (2013)

  3. Ho, J., Jain, A., Abbeel, P.: Denoising diffusion probabilistic models. Adv. Neural. Inf. Process. Syst. 33, 6840ā€“6851 (2020)

    Google ScholarĀ 

  4. Karras, T., Laine, S., Aila, T.: A style-based generator architecture for generative adversarial networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4401ā€“4410 (2019)

    Google ScholarĀ 

  5. Karras, T., Laine, S., Aittala, M., Hellsten, J., Lehtinen, J., Aila, T.: Analyzing and improving the image quality of styleGAN. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8110ā€“8119 (2020)

    Google ScholarĀ 

  6. Brock, A., Donahue, J., Simonyan, K.: Large scale GAN training for high fidelity natural image synthesis. ar**v preprint ar**v:1809.11096 (2018)

  7. Lee, C.H., Liu, Z., Wu, L., Luo, P.: MaskGAN: towards diverse and interactive facial image manipulation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5549ā€“5558 (2020)

    Google ScholarĀ 

  8. Tan, Z., Ye, Z., Yang, X., Wang, Q., Yan, Y., Huang, K.: Towards better text-image consistency in text-to-image generation. ar**v preprint ar**v:2210.15235 (2022)

  9. Shen, Y., Gu, J., Tang, X., Zhou, B.: Interpreting the latent space of GANs for semantic face editing. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9243ā€“9252 (2020)

    Google ScholarĀ 

  10. Goetschalckx, L., Andonian, A., Oliva, A., Isola, P.: GANalyze: toward visual definitions of cognitive image properties. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 5744ā€“5753 (2019)

    Google ScholarĀ 

  11. Shen, Y., Zhou, B.: Closed-form factorization of latent semantics in GANs. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1532ā€“1540 (2021)

    Google ScholarĀ 

  12. HƤrkƶnen, E., Hertzmann, A., Lehtinen, J., Paris, S.: GANspace: discovering interpretable GAN controls. Adv. Neural. Inf. Process. Syst. 33, 9841ā€“9850 (2020)

    Google ScholarĀ 

  13. Noble, W.S.: What is a support vector machine? Nat. Biotechnol. 24(12), 1565ā€“1567 (2006)

    ArticleĀ  Google ScholarĀ 

  14. Wold, S., Esbensen, K., Geladi, P.: Principal component analysis. Chemom. Intell. Lab. Syst. 2(1ā€“3), 37ā€“52 (1987)

    ArticleĀ  Google ScholarĀ 

  15. Dinh, T.M., Tran, A.T., Nguyen, R., Hua, B.S.: Hyperinverter: improving styleGAN inversion via hypernetwork. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11389ā€“11398 (2022)

    Google ScholarĀ 

  16. Alaluf, Y., Tov, O., Mokady, R., Gal, R., Bermano, A.: HyperStyle: StyleGAN inversion with hypernetworks for real image editing. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 18511ā€“18521 (2022)

    Google ScholarĀ 

  17. Karras, T., Aila, T., Laine, S., Lehtinen, J.: Progressive growing of GANs for improved quality, stability, and variation. ar**v preprint ar**v:1710.10196 (2017)

  18. Yu, F., Seff, A., Zhang, Y., Song, S., Funkhouser, T., **ao, J.: LSUN: construction of a large-scale image dataset using deep learning with humans in the loop. ar**v preprint ar**v:1506.03365 (2015)

  19. Krause, J., Stark, M., Deng, J., Fei-Fei, L.: 3d object representations for fine-grained categorization. In: Proceedings of the IEEE International Conference on Computer Vision Workshops, pp. 554ā€“561 (2013)

    Google ScholarĀ 

  20. Liu, Z., Luo, P., Wang, X., Tang, X.: Deep learning face attributes in the wild. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3730ā€“3738 (2015)

    Google ScholarĀ 

Download references

Acknowledgements

This work was supported by the Qingdao Natural Science Foundation (No. 23-2-1-161-zyyd-jch), the Shandong Natural Science Foundation (No. ZR2023MF008, No. ZR2023QF046), the Major Scientific and Technological Projects of CNPC (No. ZD2019-183-008) and the National Natural Science Foundation of China (No. 61671480).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Weifeng Liu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

Ā© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chen, Y., Yang, S., Liu, B., Liu, W. (2023). Attribute Space Analysis forĀ Image Editing. In: Lu, H., et al. Image and Graphics. ICIG 2023. Lecture Notes in Computer Science, vol 14356. Springer, Cham. https://doi.org/10.1007/978-3-031-46308-2_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-46308-2_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-46307-5

  • Online ISBN: 978-3-031-46308-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Navigation