TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/VTGAN: Semi-supervised Retinal Image Synthesis and Disease...

VTGAN: Semi-supervised Retinal Image Synthesis and Disease Prediction using Vision Transformers

Sharif Amit Kamran, Khondker Fariha Hossain, Alireza Tavakkoli, Stewart Lee Zuckerbrod, Salah A. Baker

2021-04-14Image GenerationDisease Prediction
PaperPDFCode(official)Code

Abstract

In Fluorescein Angiography (FA), an exogenous dye is injected in the bloodstream to image the vascular structure of the retina. The injected dye can cause adverse reactions such as nausea, vomiting, anaphylactic shock, and even death. In contrast, color fundus imaging is a non-invasive technique used for photographing the retina but does not have sufficient fidelity for capturing its vascular structure. The only non-invasive method for capturing retinal vasculature is optical coherence tomography-angiography (OCTA). However, OCTA equipment is quite expensive, and stable imaging is limited to small areas on the retina. In this paper, we propose a novel conditional generative adversarial network (GAN) capable of simultaneously synthesizing FA images from fundus photographs while predicting retinal degeneration. The proposed system has the benefit of addressing the problem of imaging retinal vasculature in a non-invasive manner as well as predicting the existence of retinal abnormalities. We use a semi-supervised approach to train our GAN using multiple weighted losses on different modalities of data. Our experiments validate that the proposed architecture exceeds recent state-of-the-art generative networks for fundus-to-angiography synthesis. Moreover, our vision transformer-based discriminators generalize quite well on out-of-distribution data sets for retinal disease prediction.

Results

TaskDatasetMetricValueModel
Image-to-Image TranslationFundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic PatientsFID17.3VTGAN
Image-to-Image TranslationFundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic PatientsKernel Inception Distance0.00053VTGAN
Image GenerationFundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic PatientsFID17.3VTGAN
Image GenerationFundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic PatientsKernel Inception Distance0.00053VTGAN
1 Image, 2*2 StitchingFundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic PatientsFID17.3VTGAN
1 Image, 2*2 StitchingFundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic PatientsKernel Inception Distance0.00053VTGAN

Related Papers

fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17Analysis of Image-and-Text Uncertainty Propagation in Multimodal Large Language Models with Cardiac MR-Based Applications2025-07-17FADE: Adversarial Concept Erasure in Flow Models2025-07-16CharaConsist: Fine-Grained Consistent Character Generation2025-07-15