TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/UltraEdit: Instruction-based Fine-Grained Image Editing at...

UltraEdit: Instruction-based Fine-Grained Image Editing at Scale

Haozhe Zhao, Xiaojian Ma, Liang Chen, Shuzheng Si, Rujie Wu, Kaikai An, Peiyu Yu, Minjia Zhang, Qing Li, Baobao Chang

2024-07-07Image Editing
PaperPDFCode(official)

Abstract

This paper presents UltraEdit, a large-scale (approximately 4 million editing samples), automatically generated dataset for instruction-based image editing. Our key idea is to address the drawbacks in existing image editing datasets like InstructPix2Pix and MagicBrush, and provide a systematic approach to producing massive and high-quality image editing samples. UltraEdit offers several distinct advantages: 1) It features a broader range of editing instructions by leveraging the creativity of large language models (LLMs) alongside in-context editing examples from human raters; 2) Its data sources are based on real images, including photographs and artworks, which provide greater diversity and reduced bias compared to datasets solely generated by text-to-image models; 3) It also supports region-based editing, enhanced by high-quality, automatically produced region annotations. Our experiments show that canonical diffusion-based editing baselines trained on UltraEdit set new records on MagicBrush and Emu-Edit benchmarks. Our analysis further confirms the crucial role of real image anchors and region-based editing data. The dataset, code, and models can be found in https://ultra-editing.github.io.

Results

TaskDatasetMetricValueModel
Image EditingImgEdit-DataAction2.98UltraEdit
Image EditingImgEdit-DataAdd3.44UltraEdit
Image EditingImgEdit-DataAdjust2.81UltraEdit
Image EditingImgEdit-DataBackground2.83UltraEdit
Image EditingImgEdit-DataExtract2.13UltraEdit
Image EditingImgEdit-DataHybrid1.91UltraEdit
Image EditingImgEdit-DataOverall2.7UltraEdit
Image EditingImgEdit-DataRemove1.45UltraEdit
Image EditingImgEdit-DataReplace2.96UltraEdit
Image EditingImgEdit-DataStyle3.76UltraEdit

Related Papers

NoHumansRequired: Autonomous High-Quality Image Editing Triplet Mining2025-07-18UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation2025-06-03ImgEdit: A Unified Image Editing Dataset and Benchmark2025-05-26Emerging Properties in Unified Multimodal Pretraining2025-05-20Step1X-Edit: A Practical Framework for General Image Editing2025-04-24AnyEdit: Edit Any Knowledge Encoded in Language Models2025-02-08In-Context Editing: Learning Knowledge from Self-Induced Distributions2024-06-17MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing2023-06-16