Attribute Space Analysis for Image Editing

Chen, Yiping; Yang, Shuqi; Liu, Baodi; Liu, Weifeng

doi:10.1007/978-3-031-46308-2_9

Yiping Chen¹⁴,
Shuqi Yang¹⁴,
Baodi Liu¹⁴ &
…
Weifeng Liu¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14356))

Included in the following conference series:

International Conference on Image and Graphics

542 Accesses

Abstract

Image editing is a widely studied topic in computer vision, which enables the modification of specific attributes in images without altering other crucial information. One popular unsupervised technique currently used is feature decomposition in the latent space of Generative Adversarial Networks (GANs), which provides editing directions that can control attribute changes to achieve desired image editing results. However, this method often does not allow for the direct acquisition of the desired editing direction by setting the target attribute in advance. In this work, we propose a method to finding editing directions in the attribute space by analyzing image differences. This enables users to obtain target directions by actively defining the attribute they want to change. Specifically, this method discovers semantic directions suitable for target attribute editing by applying Principal Component Analysis (PCA) on the difference of image latent codes embedded in the latent space. Through experiments, our method can effectively find the target editing direction according to user needs and achieve satisfactory editing effects at the same time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 9151; Price includes VAT (Japan)

Softcover Book: JPY 11439; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

StyleDisentangle: Disentangled Image Editing Based on StyleGAN2

Keep It Simple: Evaluating Local Search-Based Latent Space Editing

Article Open access 21 October 2023

Editing Out-of-Domain GAN Inversion via Differential Activations

References

Goodfellow, I., et al.: Generative adversarial networks. Commun. ACM 63(11), 139–144 (2020)
Article MathSciNet Google Scholar
Kingma, D.P., Welling, M.: Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 (2013)
Ho, J., Jain, A., Abbeel, P.: Denoising diffusion probabilistic models. Adv. Neural. Inf. Process. Syst. 33, 6840–6851 (2020)
Google Scholar
Karras, T., Laine, S., Aila, T.: A style-based generator architecture for generative adversarial networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4401–4410 (2019)
Google Scholar
Karras, T., Laine, S., Aittala, M., Hellsten, J., Lehtinen, J., Aila, T.: Analyzing and improving the image quality of styleGAN. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8110–8119 (2020)
Google Scholar
Brock, A., Donahue, J., Simonyan, K.: Large scale GAN training for high fidelity natural image synthesis. arXiv preprint arXiv:1809.11096 (2018)
Lee, C.H., Liu, Z., Wu, L., Luo, P.: MaskGAN: towards diverse and interactive facial image manipulation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5549–5558 (2020)
Google Scholar
Tan, Z., Ye, Z., Yang, X., Wang, Q., Yan, Y., Huang, K.: Towards better text-image consistency in text-to-image generation. arXiv preprint arXiv:2210.15235 (2022)
Shen, Y., Gu, J., Tang, X., Zhou, B.: Interpreting the latent space of GANs for semantic face editing. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9243–9252 (2020)
Google Scholar
Goetschalckx, L., Andonian, A., Oliva, A., Isola, P.: GANalyze: toward visual definitions of cognitive image properties. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 5744–5753 (2019)
Google Scholar
Shen, Y., Zhou, B.: Closed-form factorization of latent semantics in GANs. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1532–1540 (2021)
Google Scholar
Härkönen, E., Hertzmann, A., Lehtinen, J., Paris, S.: GANspace: discovering interpretable GAN controls. Adv. Neural. Inf. Process. Syst. 33, 9841–9850 (2020)
Google Scholar
Noble, W.S.: What is a support vector machine? Nat. Biotechnol. 24(12), 1565–1567 (2006)
Article Google Scholar
Wold, S., Esbensen, K., Geladi, P.: Principal component analysis. Chemom. Intell. Lab. Syst. 2(1–3), 37–52 (1987)
Article Google Scholar
Dinh, T.M., Tran, A.T., Nguyen, R., Hua, B.S.: Hyperinverter: improving styleGAN inversion via hypernetwork. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11389–11398 (2022)
Google Scholar
Alaluf, Y., Tov, O., Mokady, R., Gal, R., Bermano, A.: HyperStyle: StyleGAN inversion with hypernetworks for real image editing. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 18511–18521 (2022)
Google Scholar
Karras, T., Aila, T., Laine, S., Lehtinen, J.: Progressive growing of GANs for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196 (2017)
Yu, F., Seff, A., Zhang, Y., Song, S., Funkhouser, T., Xiao, J.: LSUN: construction of a large-scale image dataset using deep learning with humans in the loop. arXiv preprint arXiv:1506.03365 (2015)
Krause, J., Stark, M., Deng, J., Fei-Fei, L.: 3d object representations for fine-grained categorization. In: Proceedings of the IEEE International Conference on Computer Vision Workshops, pp. 554–561 (2013)
Google Scholar
Liu, Z., Luo, P., Wang, X., Tang, X.: Deep learning face attributes in the wild. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3730–3738 (2015)
Google Scholar

Download references

Acknowledgements

This work was supported by the Qingdao Natural Science Foundation (No. 23-2-1-161-zyyd-jch), the Shandong Natural Science Foundation (No. ZR2023MF008, No. ZR2023QF046), the Major Scientific and Technological Projects of CNPC (No. ZD2019-183-008) and the National Natural Science Foundation of China (No. 61671480).

Author information

Authors and Affiliations

China University of Petroleum (East China), Qingdao, 266580, China
Yiping Chen, Shuqi Yang, Baodi Liu & Weifeng Liu

Authors

Yiping Chen
View author publications
You can also search for this author in PubMed Google Scholar
Shuqi Yang
View author publications
You can also search for this author in PubMed Google Scholar
Baodi Liu
View author publications
You can also search for this author in PubMed Google Scholar
Weifeng Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Weifeng Liu .

Editor information

Editors and Affiliations

Dalian University of Technology, Dalian, China
Huchuan Lu
University of Sydney, Sydney, NSW, Australia
Wanli Ouyang
Shenzhen University, Shenzhen, China
Hui Huang
Tsinghua University, Beijing, China
Jiwen Lu
Dalian University of Technology, Dalian, China
Risheng Liu
Institute of Automation, CAS, Beijing, China
Jing Dong
University of Technology Sydney, Sydney, NSW, Australia
Min Xu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, Y., Yang, S., Liu, B., Liu, W. (2023). Attribute Space Analysis for Image Editing. In: Lu, H., et al. Image and Graphics. ICIG 2023. Lecture Notes in Computer Science, vol 14356. Springer, Cham. https://doi.org/10.1007/978-3-031-46308-2_9

Download citation

DOI: https://doi.org/10.1007/978-3-031-46308-2_9
Published: 30 October 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-46307-5
Online ISBN: 978-3-031-46308-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Attribute Space Analysis for Image Editing