TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Medical/Image Reconstruction/ImageNet

Image Reconstruction on ImageNet

Metric: FID (lower is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕FID▲Extra DataPaperDate↕Code
1MGVQ (16x16x8)0.49NoMGVQ: Could VQ-VAE Beat VAE? A Generalizable Tok...2025-07-10Code
2MGVQ (16x16x4)0.64NoMGVQ: Could VQ-VAE Beat VAE? A Generalizable Tok...2025-07-10Code
3GigaTok-XL-XXL0.79NoGigaTok: Scaling Visual Tokenizers to 3 Billion ...2025-04-11Code
4OptVQ (16x16x8)0.91NoPreventing Local Pitfalls in Vector Quantization...2024-12-19Code
5OptVQ (16x16x4)1NoPreventing Local Pitfalls in Vector Quantization...2024-12-19Code
6IBQ (16x16)1NoTaming Scalable Visual Tokenizer for Autoregress...2024-12-03Code
7Mo-VQGAN (16x16x4)1.12NoMoVQ: Modulating Quantized Vectors for High-Fide...2022-09-19Code
8Open-Magvit2 (16x16)1.17NoOpen-MAGVIT2: An Open-Source Project Toward Demo...2024-09-06Code
9ViT-VQGAN (16x16)1.28NoVector-quantized Image Modeling with Improved VQ...2021-10-09Code
10MaskBit (16x16)1.66NoMaskBit: Embedding-free Image Generation via Bit...2024-09-24Code
11TiTok-S-1281.71NoAn Image is Worth 32 Tokens for Reconstruction a...2024-06-11Code
12RQ-VAE (8x8x16)1.83NoAutoregressive Image Generation using Residual Q...2022-03-03Code
13MaskGIT-VQGAN (16x16)2.28NoMaskGIT: Masked Generative Image Transformer2022-02-08Code
14VQGAN-LC (16x16)2.62NoScaling the Codebook Size of VQGAN to 100,000 wi...2024-06-17Code
15Taming-VQGAN (16x16)3.64NoTaming Transformers for High-Resolution Image Sy...2020-12-17Code