Styled Comic Portrait Synthesis Based on GAN

Chen, Yen-Chia; Chen, Lieu-Hen; Shibata, Hiroki; Takama, Yasufumi

doi:10.1007/978-3-030-96451-1_7

Yen-Chia Chen²⁴,
Lieu-Hen Chen²⁵,
Hiroki Shibata²⁴ &
…
Yasufumi Takama²⁴

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1423))

Included in the following conference series:

Annual Conference of the Japanese Society for Artificial Intelligence

Abstract

Although there are many studies on NPR (Non-Photorealistic Rendering) image synthesis using GANs (Generative Adversarial Networks), it is still difficult to create high-quality comic portraits of a real person. Moreover, there are few studies focused on the painting styles of comic authors, which is the style that makes a comic visually unique. It is expected that for comic readers, the synthesized comic portraits can be more attractive and meaningful if the portraits are presented in the user-preferred comic styles. Therefore, this paper proposes a styled comic portraits synthesis system based on CycleGAN and PIX2PIX. By integrating Deep Learning and NPR techniques, the proposed system aims to transform user’s real pictures into comic portraits with features preserved and defined painting style presented. CNN (Convolutional Neural Networks) is trained to classify the painting styles of comic authors. After that, two sets of GANs are trained with classified and augmented dataset, which is generated by mapping comic characters’ 2D texture onto perturbed and deformed 3D facial models. The experiment results show that the proposed system can successfully create clear and vivid comic portraits, which has a great potential to serve as a useful tool for social network and comic industry.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 20591; Price includes VAT (Japan)

Softcover Book: JPY 25739; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Aesthetic Evaluation of Facial Portraits Using Compositional Augmentation for Deep CNNs

Realistic real-time processing of anime portraits based on generative adversarial networks

Article 06 June 2024

Immersive Traditional Chinese Portrait Painting: Research on Style Transfer and Face Replacement

References

Ledig, C., et al.: Photo-realistic single image super-resolution using a generative adversarial network. In: Conference on Computer Vision and Pattern Recognition (2017)
Google Scholar
Zhu, J., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: International Conference on Computer Vision (2017)
Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Chapter Google Scholar
Isola, P., Zhu, J., Zhou, T., Efros, A.A.: Image-to-image translation with conditional adversarial networks. In: Conference on Computer Vision and Pattern Recognition (2016)
Google Scholar
Zhang, H., et al.: StackGAN: text to photo-realistic image synthesis with stacked generative adversarial networks. In: International Conference on Computer Vision (2017)
Google Scholar
Zhang, H., et al.: StackGAN++: realistic image synthesis with stacked generative adversarial networks. Inst. Electr. Electron. Eng. Trans. Pattern Anal. Mach. Intell. 41, 1947-1962 (2018)
Google Scholar
Radford, A., Metz, L., Chintala, S.: Unsupervised representation learning with deep convolutional generative adversarial networks. In: International Conference on Learning Representations (2016)
Google Scholar
Chen, L., Chen, Y., Lin, S., Liu, T., Hsieh, W.: Synthesizing non-photorealistic rendering effects of volumetric strokes. J. Inf. Sci. Eng. 28, 521–535 (2012)
Google Scholar
Sun, C., Chen, L., Takama, Y.: Synthesizing NPR styled street view animation based on deep learning. In: International Conference on Technologies and Applications of Artificial Intelligence (2019)
Google Scholar
Chen, S., Su, W., Gao, L., Xia, S., Fu, H.: Deep generation of face images from sketches. In: Special Interest Group on Computer Graphics and Interactive Techniques (2020)
Google Scholar
Karras, T., Laine, S., Ail, T.: A style-based generator architecture for generative adversarial networks. In: Conference on Computer Vision and Pattern Recognition (2019)
Google Scholar
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Conference on Computer Vision and Pattern Recognition (2015)
Google Scholar
Demir, U., Unal, G.: Patch-based image inpainting with generative adversarial networks. arXiv preprint, arXiv:1803.07422 (2018)
Goodfellow, I.J., et al.: Image augmentation using radial transform for training deep neural network. In: Institute of Electrical and Electronics Engineers International Conference on Acoustics, Speech, and Signal Processing (2014)
Google Scholar
Perez, L., Wang, J.: The Effectiveness of data augmentation in image classification using deep learning. arXiv preprint, arXiv:1712.04621 (2017)
Bloice, M.D., Stocker, C., Holzinger, A.: Augmentor: an image augmentation library for machine learning. J. Open Sour. Softw. (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Tokyo Metropolitan University, Hachioji, Japan
Yen-Chia Chen, Hiroki Shibata & Yasufumi Takama
National Chi Nan University, Nantou County, Taiwan
Lieu-Hen Chen

Authors

Yen-Chia Chen
View author publications
You can also search for this author in PubMed Google Scholar
Lieu-Hen Chen
View author publications
You can also search for this author in PubMed Google Scholar
Hiroki Shibata
View author publications
You can also search for this author in PubMed Google Scholar
Yasufumi Takama
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Graduate School of System Design, Tokyo Metropolitan University, Hino, Tokyo, Japan
Yasufumi Takama
Graduate School of Economics, Osaka University, Toyonaka, Osaka, Japan
Naohiro Matsumura
Faculty of Business and Commerce, Kansai University, Osaka, Japan
Katsutoshi Yada
Faculty of Informatics, Kansai University, Takatsuki, Osaka, Japan
Mitsunori Matsushita
Department of Applied Computer Science, Tokyo Polytechnic University, Atsugi, Kanagawa, Japan
Daisuke Katagami
Division of Behavioral Science, Chiba University, Chiba, Japan
Akinori Abe
Department of Intelligence Science and Technology, Kyoto University, Kyoto, Japan
Hisashi Kashima
Institute of Industrial Science, The University of Tokyo, Meguro-ku, Tokyo, Japan
Toshihiro Hiraoka
Department of Computer Science and Engineering, Nagoya Institute of Technology, Nagoya, Japan
Takahiro Uchiya
Graduate School of Information Science and Technology Sapporo Hokkaido Japan, Hokkaido University, Sapporo, Hokkaido, Japan
Rafal Rzepka

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, YC., Chen, LH., Shibata, H., Takama, Y. (2022). Styled Comic Portrait Synthesis Based on GAN. In: Takama, Y., et al. Advances in Artificial Intelligence. JSAI 2021. Advances in Intelligent Systems and Computing, vol 1423. Springer, Cham. https://doi.org/10.1007/978-3-030-96451-1_7

Download citation

DOI: https://doi.org/10.1007/978-3-030-96451-1_7
Published: 26 February 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-96450-4
Online ISBN: 978-3-030-96451-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics