TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SaccadeNet: A Fast and Accurate Object Detector

SaccadeNet: A Fast and Accurate Object Detector

Shiyi Lan, Zhou Ren, Yi Wu, Larry S. Davis, Gang Hua

2020-03-26CVPR 2020 6Scene Understandingobject-detectionObject Detection
PaperPDFCode

Abstract

Object detection is an essential step towards holistic scene understanding. Most existing object detection algorithms attend to certain object areas once and then predict the object locations. However, neuroscientists have revealed that humans do not look at the scene in fixed steadiness. Instead, human eyes move around, locating informative parts to understand the object location. This active perceiving movement process is called \textit{saccade}. %In this paper, Inspired by such mechanism, we propose a fast and accurate object detector called \textit{SaccadeNet}. It contains four main modules, the \cenam, the \coram, the \atm, and the \aggatt, which allows it to attend to different informative object keypoints, and predict object locations from coarse to fine. The \coram~is used only during training to extract more informative corner features which brings free-lunch performance boost. On the MS COCO dataset, we achieve the performance of 40.4\% mAP at 28 FPS and 30.5\% mAP at 118 FPS. Among all the real-time object detectors, %that can run faster than 25 FPS, our SaccadeNet achieves the best detection performance, which demonstrates the effectiveness of the proposed detection mechanism.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devAP5055.6SaccadeNet (DLA-34-DCN)
Object DetectionCOCO test-devAP7541.4SaccadeNet (DLA-34-DCN)
Object DetectionCOCO test-devAPL50.6SaccadeNet (DLA-34-DCN)
Object DetectionCOCO test-devAPM42.1SaccadeNet (DLA-34-DCN)
Object DetectionCOCO test-devAPS19.2SaccadeNet (DLA-34-DCN)
Object DetectionCOCO test-devbox mAP38.5SaccadeNet (DLA-34-DCN)
3DCOCO test-devAP5055.6SaccadeNet (DLA-34-DCN)
3DCOCO test-devAP7541.4SaccadeNet (DLA-34-DCN)
3DCOCO test-devAPL50.6SaccadeNet (DLA-34-DCN)
3DCOCO test-devAPM42.1SaccadeNet (DLA-34-DCN)
3DCOCO test-devAPS19.2SaccadeNet (DLA-34-DCN)
3DCOCO test-devbox mAP38.5SaccadeNet (DLA-34-DCN)
2D ClassificationCOCO test-devAP5055.6SaccadeNet (DLA-34-DCN)
2D ClassificationCOCO test-devAP7541.4SaccadeNet (DLA-34-DCN)
2D ClassificationCOCO test-devAPL50.6SaccadeNet (DLA-34-DCN)
2D ClassificationCOCO test-devAPM42.1SaccadeNet (DLA-34-DCN)
2D ClassificationCOCO test-devAPS19.2SaccadeNet (DLA-34-DCN)
2D ClassificationCOCO test-devbox mAP38.5SaccadeNet (DLA-34-DCN)
2D Object DetectionCOCO test-devAP5055.6SaccadeNet (DLA-34-DCN)
2D Object DetectionCOCO test-devAP7541.4SaccadeNet (DLA-34-DCN)
2D Object DetectionCOCO test-devAPL50.6SaccadeNet (DLA-34-DCN)
2D Object DetectionCOCO test-devAPM42.1SaccadeNet (DLA-34-DCN)
2D Object DetectionCOCO test-devAPS19.2SaccadeNet (DLA-34-DCN)
2D Object DetectionCOCO test-devbox mAP38.5SaccadeNet (DLA-34-DCN)
16kCOCO test-devAP5055.6SaccadeNet (DLA-34-DCN)
16kCOCO test-devAP7541.4SaccadeNet (DLA-34-DCN)
16kCOCO test-devAPL50.6SaccadeNet (DLA-34-DCN)
16kCOCO test-devAPM42.1SaccadeNet (DLA-34-DCN)
16kCOCO test-devAPS19.2SaccadeNet (DLA-34-DCN)
16kCOCO test-devbox mAP38.5SaccadeNet (DLA-34-DCN)

Related Papers

Advancing Complex Wide-Area Scene Understanding with Hierarchical Coresets Selection2025-07-17Argus: Leveraging Multiview Images for Improved 3-D Scene Understanding With Large Language Models2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16