Xinlong Wang, Tete Xiao, Yuning Jiang, Shuai Shao, Jian Sun, Chunhua Shen
Detecting individual pedestrians in a crowd remains a challenging problem since the pedestrians often gather together and occlude each other in real-world scenarios. In this paper, we first explore how a state-of-the-art pedestrian detector is harmed by crowd occlusion via experimentation, providing insights into the crowd occlusion problem. Then, we propose a novel bounding box regression loss specifically designed for crowd scenes, termed repulsion loss. This loss is driven by two motivations: the attraction by target, and the repulsion by other surrounding objects. The repulsion term prevents the proposal from shifting to surrounding objects thus leading to more crowd-robust localization. Our detector trained by repulsion loss outperforms all the state-of-the-art methods with a significant improvement in occlusion cases.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Autonomous Vehicles | Caltech | Reasonable Miss Rate | 4 | RepLoss + CityPersons dataset |
| Autonomous Vehicles | Caltech | Reasonable Miss Rate | 5 | RepLoss |
| Autonomous Vehicles | CityPersons | Bare MR^-2 | 7.6 | RepLoss |
| Autonomous Vehicles | CityPersons | Heavy MR^-2 | 56.9 | RepLoss |
| Autonomous Vehicles | CityPersons | Partial MR^-2 | 16.8 | RepLoss |
| Autonomous Vehicles | CityPersons | Reasonable MR^-2 | 13.2 | RepLoss |
| Pedestrian Detection | Caltech | Reasonable Miss Rate | 4 | RepLoss + CityPersons dataset |
| Pedestrian Detection | Caltech | Reasonable Miss Rate | 5 | RepLoss |
| Pedestrian Detection | CityPersons | Bare MR^-2 | 7.6 | RepLoss |
| Pedestrian Detection | CityPersons | Heavy MR^-2 | 56.9 | RepLoss |
| Pedestrian Detection | CityPersons | Partial MR^-2 | 16.8 | RepLoss |
| Pedestrian Detection | CityPersons | Reasonable MR^-2 | 13.2 | RepLoss |