StyleGAN2

StyleGAN2: fixing several shortcomings of StyleGAN and taking it one step further

Released in: Analyzing and Improving the Image Quality of StyleGAN

Contributor:

Summary

The style-based GAN architecture (StyleGAN) yields state-of-the-art results in data-driven unconditional generative image modeling. Here, the authors expose and analyze several of its characteristic artifacts, and propose changes in both model architecture and training methods to address them. In particular, they redesign the generator normalization, revisit progressive growing, and regularize the generator to encourage good conditioning in the mapping from latent codes to images. In addition to improving image quality, this path length regularizer yields the additional benefit that the generator becomes significantly easier to invert. This makes it possible to reliably attribute a generated image to a particular network. The authors furthermore visualize how well the generator utilizes its output resolution, and identify a capacity problem, motivating to train larger models for additional quality improvements. Overall, the improved model redefines the state of the art in unconditional image modeling, both in terms of existing distribution quality metrics as well as perceived image quality.

2019

Year Released

Key Links & Stats

StyleGAN2

StyleGAN2

Analyzing and Improving the Image Quality of StyleGAN

@article{DBLP:journals/corr/abs-1912-04958, author = {Tero Karras and Samuli Laine and Miika Aittala and Janne Hellsten and Jaakko Lehtinen and Timo Aila}, title = {Analyzing and Improving the Image Quality of StyleGAN}, journal = {CoRR}, volume = {abs/1912.04958}, year = {2019}, url = {http://arxiv.org/abs/1912.04958}, eprinttype = {arXiv}, eprint = {1912.04958}, timestamp = {Thu, 02 Jan 2020 18:08:18 +0100}, biburl = {https://dblp.org/rec/journals/corr/abs-1912-04958.bib}, bibsource = {dblp computer science bibliography, https://dblp.org} }

ML Tasks

  1. General
  2. Image Generation
  3. Style Transfer

ML Platform

  1. Not Applicable

Modalities

  1. Still Image

Verticals

  1. Facial
  2. Digital Human

CG Platform

  1. Not Applicable

Related organizations

NVIDIA