Yuki Kondo, Norimichi Ukita, Takayuki Yamaguchi, Hao-Yu Hou, Mu-Yi Shen, Chia-Chi Hsu, En-Ming Huang, Yu-Chen Huang, Yu-Cheng Xia, Chien-Yao Wang, Chun-Yi Lee, Da Huo, Marc A. Kastner, TingWei Liu, Yasutomo Kawanishi, Takatsugu Hirayama, Takahiro Komamizu, Ichiro Ide, Yosuke Shinya, Xinyao Liu, Guang Liang, Syusuke Yasui
Small Object Detection (SOD) is an important machine vision topic because (i) a variety of real-world applications require object detection for distant objects and (ii) SOD is a challenging task due to the noisy, blurred, and less-informative image appearances of small objects. This paper proposes a new SOD dataset consisting of 39,070 images including 137,121 bird instances, which is called the Small Object Detection for Spotting Birds (SOD4SB) dataset. The detail of the challenge with the SOD4SB dataset is introduced in this paper. In total, 223 participants joined this challenge. This paper briefly introduces the award-winning methods. The dataset, the baseline code, and the website for evaluation on the public testset are publicly available.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Object Detection | SOD4SB Private Test | AP50 | 22.9 | DL method (YOLOv8 + Ensamble) |
| Object Detection | SOD4SB Private Test | AP50 | 22.1 | E2 method (Normalized Gaussian Wasserstein Distance + Switch Hard Augmentation + Multi scale train + Weight Moving Average + CenterNet + VarifocalNet) |
| Object Detection | SOD4SB Public Test | AP50 | 73.1 | DL method (YOLOv8 + Ensamble) |
| Object Detection | SOD4SB Public Test | AP50 | 69.6 | E2 method (Normalized Gaussian Wasserstein Distance + Switch Hard Augmentation + Multi scale train + Weight Moving Average + CenterNet + VarifocalNet) |
| 3D | SOD4SB Private Test | AP50 | 22.9 | DL method (YOLOv8 + Ensamble) |
| 3D | SOD4SB Private Test | AP50 | 22.1 | E2 method (Normalized Gaussian Wasserstein Distance + Switch Hard Augmentation + Multi scale train + Weight Moving Average + CenterNet + VarifocalNet) |
| 3D | SOD4SB Public Test | AP50 | 73.1 | DL method (YOLOv8 + Ensamble) |
| 3D | SOD4SB Public Test | AP50 | 69.6 | E2 method (Normalized Gaussian Wasserstein Distance + Switch Hard Augmentation + Multi scale train + Weight Moving Average + CenterNet + VarifocalNet) |
| Small Object Detection | SOD4SB Private Test | AP50 | 22.9 | DL method (YOLOv8 + Ensamble) |
| Small Object Detection | SOD4SB Private Test | AP50 | 22.1 | E2 method (Normalized Gaussian Wasserstein Distance + Switch Hard Augmentation + Multi scale train + Weight Moving Average + CenterNet + VarifocalNet) |
| Small Object Detection | SOD4SB Public Test | AP50 | 73.1 | DL method (YOLOv8 + Ensamble) |
| Small Object Detection | SOD4SB Public Test | AP50 | 69.6 | E2 method (Normalized Gaussian Wasserstein Distance + Switch Hard Augmentation + Multi scale train + Weight Moving Average + CenterNet + VarifocalNet) |
| 2D Classification | SOD4SB Private Test | AP50 | 22.9 | DL method (YOLOv8 + Ensamble) |
| 2D Classification | SOD4SB Private Test | AP50 | 22.1 | E2 method (Normalized Gaussian Wasserstein Distance + Switch Hard Augmentation + Multi scale train + Weight Moving Average + CenterNet + VarifocalNet) |
| 2D Classification | SOD4SB Public Test | AP50 | 73.1 | DL method (YOLOv8 + Ensamble) |
| 2D Classification | SOD4SB Public Test | AP50 | 69.6 | E2 method (Normalized Gaussian Wasserstein Distance + Switch Hard Augmentation + Multi scale train + Weight Moving Average + CenterNet + VarifocalNet) |
| 2D Object Detection | SOD4SB Private Test | AP50 | 22.9 | DL method (YOLOv8 + Ensamble) |
| 2D Object Detection | SOD4SB Private Test | AP50 | 22.1 | E2 method (Normalized Gaussian Wasserstein Distance + Switch Hard Augmentation + Multi scale train + Weight Moving Average + CenterNet + VarifocalNet) |
| 2D Object Detection | SOD4SB Public Test | AP50 | 73.1 | DL method (YOLOv8 + Ensamble) |
| 2D Object Detection | SOD4SB Public Test | AP50 | 69.6 | E2 method (Normalized Gaussian Wasserstein Distance + Switch Hard Augmentation + Multi scale train + Weight Moving Average + CenterNet + VarifocalNet) |
| 16k | SOD4SB Private Test | AP50 | 22.9 | DL method (YOLOv8 + Ensamble) |
| 16k | SOD4SB Private Test | AP50 | 22.1 | E2 method (Normalized Gaussian Wasserstein Distance + Switch Hard Augmentation + Multi scale train + Weight Moving Average + CenterNet + VarifocalNet) |
| 16k | SOD4SB Public Test | AP50 | 73.1 | DL method (YOLOv8 + Ensamble) |
| 16k | SOD4SB Public Test | AP50 | 69.6 | E2 method (Normalized Gaussian Wasserstein Distance + Switch Hard Augmentation + Multi scale train + Weight Moving Average + CenterNet + VarifocalNet) |