Yue Cao, Jiarui Xu, Stephen Lin, Fangyun Wei, Han Hu
The Non-Local Network (NLNet) presents a pioneering approach for capturing long-range dependencies within an image, via aggregating query-specific global context to each query position. However, through a rigorous empirical analysis, we have found that the global contexts modeled by the non-local network are almost the same for different query positions. In this paper, we take advantage of this finding to create a simplified network based on a query-independent formulation, which maintains the accuracy of NLNet but with significantly less computation. We further replace the one-layer transformation function of the non-local block by a two-layer bottleneck, which further reduces the parameter number considerably. The resulting network element, called the global context (GC) block, effectively models global context in a lightweight manner, allowing it to be applied at multiple layers of a backbone network to form a global context network (GCNet). Experiments show that GCNet generally outperforms NLNet on major benchmarks for various recognition tasks. The code and network configurations are available at https://github.com/xvjiarui/GCNet.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Object Detection | COCO test-dev | AP50 | 70.9 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| Object Detection | COCO test-dev | AP75 | 56.9 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| Object Detection | COCO test-dev | box mAP | 52.3 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| Object Detection | COCO minival | AP50 | 70.4 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| Object Detection | COCO minival | AP75 | 56.1 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| Object Detection | COCO minival | box AP | 51.8 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 3D | COCO test-dev | AP50 | 70.9 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 3D | COCO test-dev | AP75 | 56.9 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 3D | COCO test-dev | box mAP | 52.3 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 3D | COCO minival | AP50 | 70.4 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 3D | COCO minival | AP75 | 56.1 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 3D | COCO minival | box AP | 51.8 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| Instance Segmentation | COCO minival | AP50 | 67.9 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| Instance Segmentation | COCO minival | mask AP | 44.7 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| Instance Segmentation | COCO test-dev | AP50 | 68.9 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| Instance Segmentation | COCO test-dev | AP75 | 49.6 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| Instance Segmentation | COCO test-dev | mask AP | 45.4 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 2D Classification | COCO test-dev | AP50 | 70.9 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 2D Classification | COCO test-dev | AP75 | 56.9 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 2D Classification | COCO test-dev | box mAP | 52.3 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 2D Classification | COCO minival | AP50 | 70.4 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 2D Classification | COCO minival | AP75 | 56.1 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 2D Classification | COCO minival | box AP | 51.8 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 2D Object Detection | COCO test-dev | AP50 | 70.9 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 2D Object Detection | COCO test-dev | AP75 | 56.9 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 2D Object Detection | COCO test-dev | box mAP | 52.3 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 2D Object Detection | COCO minival | AP50 | 70.4 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 2D Object Detection | COCO minival | AP75 | 56.1 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 2D Object Detection | COCO minival | box AP | 51.8 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 16k | COCO test-dev | AP50 | 70.9 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 16k | COCO test-dev | AP75 | 56.9 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 16k | COCO test-dev | box mAP | 52.3 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 16k | COCO minival | AP50 | 70.4 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 16k | COCO minival | AP75 | 56.1 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |
| 16k | COCO minival | box AP | 51.8 | GCNet (ResNeXt-101 + DCN + cascade + GC r4) |