Multi-Scale Contrastive Learning for Complex Scene Generation

Information

Title Multi-Scale Contrastive Learning for Complex Scene Generation
Authors
Hanbit Lee, Youna Kim, Sang-goo Lee
Year 2023 / 1
Keywords computer vision, scene generation
Publication Type International Conference
Publication IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2023)
Link url

Abstract

Recent advances in Generative Adversarial Networks (GANs) have enabled photo-realistic synthesis of single object images. Yet, modeling more complex distributions, such as scenes with multiple objects, remains challenging. The difficulty stems from the incalculable variety of scene configurations which contain multiple objects of different categories placed at various locations. In this paper, we aim to alleviate the difficulty by enhancing the discriminative ability of the discriminator through a locally defined self-supervised pretext task. To this end, we design a discriminator to leverage multi-scale local feedback that guides the generator to better model local semantic structures in the scene. Then, we require the discriminator to carry out pixel-level contrastive learning at multiple scales to enhance discriminative capability on local regions. Experimental results on several challenging scene datasets show that our method improves the synthesis quality by a substantial margin compared to state-of-the-art baselines.