TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Normalizing Flows are Capable Generative Models

Normalizing Flows are Capable Generative Models

Shuangfei Zhai, Ruixiang Zhang, Preetum Nakkiran, David Berthelot, Jiatao Gu, Huangjie Zheng, Tianrong Chen, Miguel Angel Bautista, Navdeep Jaitly, Josh Susskind

2024-12-09Density EstimationUnconditional Image GenerationConditional Image Generation
PaperPDFCodeCode(official)Code

Abstract

Normalizing Flows (NFs) are likelihood-based models for continuous inputs. They have demonstrated promising results on both density estimation and generative modeling tasks, but have received relatively little attention in recent years. In this work, we demonstrate that NFs are more powerful than previously believed. We present TarFlow: a simple and scalable architecture that enables highly performant NF models. TarFlow can be thought of as a Transformer-based variant of Masked Autoregressive Flows (MAFs): it consists of a stack of autoregressive Transformer blocks on image patches, alternating the autoregression direction between layers. TarFlow is straightforward to train end-to-end, and capable of directly modeling and generating pixels. We also propose three key techniques to improve sample quality: Gaussian noise augmentation during training, a post training denoising procedure, and an effective guidance method for both class-conditional and unconditional settings. Putting these together, TarFlow sets new state-of-the-art results on likelihood estimation for images, beating the previous best methods by a large margin, and generates samples with quality and diversity comparable to diffusion models, for the first time with a stand-alone NF model. We make our code available at https://github.com/apple/ml-tarflow.

Results

TaskDatasetMetricValueModel
Image GenerationImageNet 64x64Bits per dim2.99TarFlow
Image GenerationImageNet 64x64FID2.9TarFlow
Image GenerationImageNet 128x128FID5.03TarFlow

Related Papers

Missing value imputation with adversarial random forests -- MissARF2025-07-213C-FBI: A Combinatorial method using Convolutions for Circle Fitting in Blurry Images2025-07-15Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis2025-07-02Binned semiparametric Bayesian networks2025-06-27Distilling Normalizing Flows2025-06-26EBC-ZIP: Improving Blockwise Crowd Counting with Zero-Inflated Poisson Regression2025-06-24SENIOR: Efficient Query Selection and Preference-Guided Exploration in Preference-based Reinforcement Learning2025-06-17Scaling-Up the Pretraining of the Earth Observation Foundation Model PhilEO to the MajorTOM Dataset2025-06-17