Cross-view image synthesis using geometry-guided conditional GANs

Krishna Regmi, Ali Borji

2018-08-14Image Generation Cross-View Image-to-Image Translation

Abstract

We address the problem of generating images across two drastically different views, namely ground (street) and aerial (overhead) views. Image synthesis by itself is a very challenging computer vision task and is even more so when generation is conditioned on an image in another view. Due the difference in viewpoints, there is small overlapping field of view and little common content between these two views. Here, we try to preserve the pixel information between the views so that the generated image is a realistic representation of cross view input image. For this, we propose to use homography as a guide to map the images between the views based on the common field of view to preserve the details in the input image. We then use generative adversarial networks to inpaint the missing regions in the transformed image and add realism to it. Our exhaustive evaluation and model comparison demonstrate that utilizing geometry constraints adds fine details to the generated images and can be a better approach for cross view image synthesis than purely pixel based synthesis methods.

Results

Task	Dataset	Metric	Value	Model
Image-to-Image Translation	Dayton (256×256) - ground-to-aerial	SSIM	0.2763	X-Fork
Image Generation	Dayton (256×256) - ground-to-aerial	SSIM	0.2763	X-Fork
1 Image, 2*2 Stitching	Dayton (256×256) - ground-to-aerial	SSIM	0.2763	X-Fork

Related Papers

fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17 Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17 FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17 A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints2025-07-17 Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17 FADE: Adversarial Concept Erasure in Flow Models2025-07-16 CharaConsist: Fine-Grained Consistent Character Generation2025-07-15 CATVis: Context-Aware Thought Visualization2025-07-15