Pluralistic (Free-Form) Image Completion

Nanyang Technological University

CVPR, 2019IJCV, 2021

Abstract

In this paper, we present an approach for pluralistic image completion the task of generating multiple diverse and plausible solutions for image completion.

A major challenge faced by learning-based approaches is that here the conditional label itself is a partial image, and there is usually only one ground truth training instance per label. As such, sampling from conditional VAEs still leads to minimal diversity.

To overcome this, we propose a novel and probabilistically principled framework with two parallel paths. One is a reconstructive path that extends the VAE through a latent space that covers all partial images with different mask sizes, and imposes priors that adapt to the number of pixels. The other is a generative path for which the conditional prior is coupled to distributions obtained in the reconstructive path. Both are supported by GANs. We also introduce a new short+long term attention layer that exploits distant relations among decoder and encoder features, improving appearance consistency.

Framework

Given a masked images, the generative pipeline (blue line) infers the conditional distribution of missing regions, that can be sampled during the testing to generate multiple and diverse results. During the training, the missing regions are encodered to a distribution, that can be sampled to rebuild the original input by combing with the features of visible part (yellow line). This structure is designed on a probabilistically principled freamework. The details can be found in the paper.

Video

Citation

@inproceedings{zheng2019pluralistic,
    title={Pluralistic image completion},
    author={Zheng, Chuanxia and Cham, Tat-Jen and Cai, Jianfei},
    booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
    pages={1438--1447},
    year={2019}
  }
  @article{zheng2021pluralistic,
    title={Pluralistic free-form image completion},
    author={Zheng, Chuanxia and Cham, Tat-Jen and Cai, Jianfei},
    journal={International Journal of Computer Vision},
    volume={129},
    number={10},
    pages={2786--2805},
    year={2021},
    publisher={Springer}
  }