TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/MGTANet: Encoding Sequential LiDAR Points Using Long Short...

MGTANet: Encoding Sequential LiDAR Points Using Long Short-Term Motion-Guided Temporal Attention for 3D Object Detection

Junho Koh, Junhyung Lee, Youngwoo Lee, Jaekyum Kim, Jun Won Choi

2022-12-01object-detection3D Object DetectionObject Detection
PaperPDFCode(official)

Abstract

Most scanning LiDAR sensors generate a sequence of point clouds in real-time. While conventional 3D object detectors use a set of unordered LiDAR points acquired over a fixed time interval, recent studies have revealed that substantial performance improvement can be achieved by exploiting the spatio-temporal context present in a sequence of LiDAR point sets. In this paper, we propose a novel 3D object detection architecture, which can encode LiDAR point cloud sequences acquired by multiple successive scans. The encoding process of the point cloud sequence is performed on two different time scales. We first design a short-term motion-aware voxel encoding that captures the short-term temporal changes of point clouds driven by the motion of objects in each voxel. We also propose long-term motion-guided bird's eye view (BEV) feature enhancement that adaptively aligns and aggregates the BEV feature maps obtained by the short-term voxel encoding by utilizing the dynamic motion context inferred from the sequence of the feature maps. The experiments conducted on the public nuScenes benchmark demonstrate that the proposed 3D object detector offers significant improvements in performance compared to the baseline methods and that it sets a state-of-the-art performance for certain 3D object detection categories. Code is available at https://github.com/HYjhkoh/MGTANet.git

Results

TaskDatasetMetricValueModel
Object DetectionnuScenesNDS0.73MGTANet
Object DetectionnuScenesmAAE0.12MGTANet
Object DetectionnuScenesmAOE0.31MGTANet
Object DetectionnuScenesmAP0.67MGTANet
Object DetectionnuScenesmASE0.23MGTANet
Object DetectionnuScenesmATE0.25MGTANet
Object DetectionnuScenesmAVE0.19MGTANet
3DnuScenesNDS0.73MGTANet
3DnuScenesmAAE0.12MGTANet
3DnuScenesmAOE0.31MGTANet
3DnuScenesmAP0.67MGTANet
3DnuScenesmASE0.23MGTANet
3DnuScenesmATE0.25MGTANet
3DnuScenesmAVE0.19MGTANet
3D Object DetectionnuScenesNDS0.73MGTANet
3D Object DetectionnuScenesmAAE0.12MGTANet
3D Object DetectionnuScenesmAOE0.31MGTANet
3D Object DetectionnuScenesmAP0.67MGTANet
3D Object DetectionnuScenesmASE0.23MGTANet
3D Object DetectionnuScenesmATE0.25MGTANet
3D Object DetectionnuScenesmAVE0.19MGTANet
2D ClassificationnuScenesNDS0.73MGTANet
2D ClassificationnuScenesmAAE0.12MGTANet
2D ClassificationnuScenesmAOE0.31MGTANet
2D ClassificationnuScenesmAP0.67MGTANet
2D ClassificationnuScenesmASE0.23MGTANet
2D ClassificationnuScenesmATE0.25MGTANet
2D ClassificationnuScenesmAVE0.19MGTANet
2D Object DetectionnuScenesNDS0.73MGTANet
2D Object DetectionnuScenesmAAE0.12MGTANet
2D Object DetectionnuScenesmAOE0.31MGTANet
2D Object DetectionnuScenesmAP0.67MGTANet
2D Object DetectionnuScenesmASE0.23MGTANet
2D Object DetectionnuScenesmATE0.25MGTANet
2D Object DetectionnuScenesmAVE0.19MGTANet
16knuScenesNDS0.73MGTANet
16knuScenesmAAE0.12MGTANet
16knuScenesmAOE0.31MGTANet
16knuScenesmAP0.67MGTANet
16knuScenesmASE0.23MGTANet
16knuScenesmATE0.25MGTANet
16knuScenesmAVE0.19MGTANet

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07