TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Simple Copy-Paste is a Strong Data Augmentation Method for...

Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation

Golnaz Ghiasi, Yin Cui, Aravind Srinivas, Rui Qian, Tsung-Yi Lin, Ekin D. Cubuk, Quoc V. Le, Barret Zoph

2020-12-13CVPR 2021 1Image AugmentationData AugmentationSegmentationSemantic SegmentationInstance SegmentationObject Detection
PaperPDFCodeCodeCodeCodeCode(official)

Abstract

Building instance segmentation models that are data-efficient and can handle rare object categories is an important challenge in computer vision. Leveraging data augmentations is a promising direction towards addressing this challenge. Here, we perform a systematic study of the Copy-Paste augmentation ([13, 12]) for instance segmentation where we randomly paste objects onto an image. Prior studies on Copy-Paste relied on modeling the surrounding visual context for pasting the objects. However, we find that the simple mechanism of pasting objects randomly is good enough and can provide solid gains on top of strong baselines. Furthermore, we show Copy-Paste is additive with semi-supervised methods that leverage extra data through pseudo labeling (e.g. self-training). On COCO instance segmentation, we achieve 49.1 mask AP and 57.3 box AP, an improvement of +0.6 mask AP and +1.5 box AP over the previous state-of-the-art. We further demonstrate that Copy-Paste can lead to significant improvements on the LVIS benchmark. Our baseline model outperforms the LVIS 2020 Challenge winning entry by +3.6 mask AP on rare categories.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devbox mAP57.3Cascade Eff-B7 NAS-FPN (1280, self-training Copy Paste, single-scale)
Object DetectionCOCO test-devbox mAP54.8Cascade Eff-B7 NAS-FPN (1280)
Object DetectionCOCO minivalbox AP57Cascade Eff-B7 NAS-FPN (1280, self-training Copy Paste, single-scale)
Object DetectionCOCO minivalbox AP54.5Cascade Eff-B7 NAS-FPN (1280)
Object DetectionLVIS v1.0 valbox AP41.6Eff-B7 NAS-FPN (1280, Copy-Paste pre-training))
3DCOCO test-devbox mAP57.3Cascade Eff-B7 NAS-FPN (1280, self-training Copy Paste, single-scale)
3DCOCO test-devbox mAP54.8Cascade Eff-B7 NAS-FPN (1280)
3DCOCO minivalbox AP57Cascade Eff-B7 NAS-FPN (1280, self-training Copy Paste, single-scale)
3DCOCO minivalbox AP54.5Cascade Eff-B7 NAS-FPN (1280)
3DLVIS v1.0 valbox AP41.6Eff-B7 NAS-FPN (1280, Copy-Paste pre-training))
Instance SegmentationCOCO minivalmask AP48.9Cascade Eff-B7 NAS-FPN (1280, self-training Copy Paste, single-scale)
Instance SegmentationCOCO minivalmask AP46.8Cascade Eff-B7 NAS-FPN (1280)
Instance SegmentationCOCO test-devmask AP49.1Cascade Eff-B7 NAS-FPN (1280, self-training Copy Paste, single-scale)
Instance SegmentationCOCO test-devmask AP46.9Cascade Eff-B7 NAS-FPN (1280)
Instance SegmentationLVIS v1.0 valmask AP38.1Eff-B7 NAS-FPN (1280, Copy-Paste pre-training))
2D ClassificationCOCO test-devbox mAP57.3Cascade Eff-B7 NAS-FPN (1280, self-training Copy Paste, single-scale)
2D ClassificationCOCO test-devbox mAP54.8Cascade Eff-B7 NAS-FPN (1280)
2D ClassificationCOCO minivalbox AP57Cascade Eff-B7 NAS-FPN (1280, self-training Copy Paste, single-scale)
2D ClassificationCOCO minivalbox AP54.5Cascade Eff-B7 NAS-FPN (1280)
2D ClassificationLVIS v1.0 valbox AP41.6Eff-B7 NAS-FPN (1280, Copy-Paste pre-training))
2D Object DetectionCOCO test-devbox mAP57.3Cascade Eff-B7 NAS-FPN (1280, self-training Copy Paste, single-scale)
2D Object DetectionCOCO test-devbox mAP54.8Cascade Eff-B7 NAS-FPN (1280)
2D Object DetectionCOCO minivalbox AP57Cascade Eff-B7 NAS-FPN (1280, self-training Copy Paste, single-scale)
2D Object DetectionCOCO minivalbox AP54.5Cascade Eff-B7 NAS-FPN (1280)
2D Object DetectionLVIS v1.0 valbox AP41.6Eff-B7 NAS-FPN (1280, Copy-Paste pre-training))
16kCOCO test-devbox mAP57.3Cascade Eff-B7 NAS-FPN (1280, self-training Copy Paste, single-scale)
16kCOCO test-devbox mAP54.8Cascade Eff-B7 NAS-FPN (1280)
16kCOCO minivalbox AP57Cascade Eff-B7 NAS-FPN (1280, self-training Copy Paste, single-scale)
16kCOCO minivalbox AP54.5Cascade Eff-B7 NAS-FPN (1280)
16kLVIS v1.0 valbox AP41.6Eff-B7 NAS-FPN (1280, Copy-Paste pre-training))

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17