TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/2nd Place Solution for Waymo Open Dataset Challenge - Real...

2nd Place Solution for Waymo Open Dataset Challenge - Real-time 2D Object Detection

Yueming Zhang, Xiaolin Song, Bing Bai, Tengfei Xing, Chao Liu, Xin Gao, Zhihui Wang, Yawei Wen, Haojin Liao, Guoshan Zhang, Pengfei Xu

2021-06-16Autonomous Driving2D Object Detectionobject-detectionObject Detection
PaperPDFCode

Abstract

In an autonomous driving system, it is essential to recognize vehicles, pedestrians and cyclists from images. Besides the high accuracy of the prediction, the requirement of real-time running brings new challenges for convolutional network models. In this report, we introduce a real-time method to detect the 2D objects from images. We aggregate several popular one-stage object detectors and train the models of variety input strategies independently, to yield better performance for accurate multi-scale detection of each category, especially for small objects. For model acceleration, we leverage TensorRT to optimize the inference time of our detection pipeline. As shown in the leaderboard, our proposed detection framework ranks the 2nd place with 75.00% L1 mAP and 69.72% L2 mAP in the real-time 2D detection track of the Waymo Open Dataset Challenges, while our framework achieves the latency of 45.8ms/frame on an Nvidia Tesla V100 GPU.

Results

TaskDatasetMetricValueModel
Object DetectionWaymo Open DatasetAP/L270.41LeapMotor_Det
Object DetectionWaymo Open DatasetLatency, ms6.16LeapMotor_Det
Object DetectionWaymo Open DatasetAP/L269.72YOLOR_TensorRT (Ours)
Object DetectionWaymo Open DatasetLatency, ms4.58YOLOR_TensorRT (Ours)
Object DetectionWaymo Open DatasetAP/L269.56YOLOR_P6_TRT
Object DetectionWaymo Open DatasetLatency, ms3.74YOLOR_P6_TRT
Object DetectionWaymo Open DatasetAP/L265.65dereyly_self_ensemble
Object DetectionWaymo Open DatasetLatency, ms6.87dereyly_self_ensemble
Object DetectionWaymo Open DatasetAP/L264.14YOLO_v5
Object DetectionWaymo Open DatasetLatency, ms3.81YOLO_v5
3DWaymo Open DatasetAP/L270.41LeapMotor_Det
3DWaymo Open DatasetLatency, ms6.16LeapMotor_Det
3DWaymo Open DatasetAP/L269.72YOLOR_TensorRT (Ours)
3DWaymo Open DatasetLatency, ms4.58YOLOR_TensorRT (Ours)
3DWaymo Open DatasetAP/L269.56YOLOR_P6_TRT
3DWaymo Open DatasetLatency, ms3.74YOLOR_P6_TRT
3DWaymo Open DatasetAP/L265.65dereyly_self_ensemble
3DWaymo Open DatasetLatency, ms6.87dereyly_self_ensemble
3DWaymo Open DatasetAP/L264.14YOLO_v5
3DWaymo Open DatasetLatency, ms3.81YOLO_v5
2D ClassificationWaymo Open DatasetAP/L270.41LeapMotor_Det
2D ClassificationWaymo Open DatasetLatency, ms6.16LeapMotor_Det
2D ClassificationWaymo Open DatasetAP/L269.72YOLOR_TensorRT (Ours)
2D ClassificationWaymo Open DatasetLatency, ms4.58YOLOR_TensorRT (Ours)
2D ClassificationWaymo Open DatasetAP/L269.56YOLOR_P6_TRT
2D ClassificationWaymo Open DatasetLatency, ms3.74YOLOR_P6_TRT
2D ClassificationWaymo Open DatasetAP/L265.65dereyly_self_ensemble
2D ClassificationWaymo Open DatasetLatency, ms6.87dereyly_self_ensemble
2D ClassificationWaymo Open DatasetAP/L264.14YOLO_v5
2D ClassificationWaymo Open DatasetLatency, ms3.81YOLO_v5
2D Object DetectionWaymo Open DatasetAP/L270.41LeapMotor_Det
2D Object DetectionWaymo Open DatasetLatency, ms6.16LeapMotor_Det
2D Object DetectionWaymo Open DatasetAP/L269.72YOLOR_TensorRT (Ours)
2D Object DetectionWaymo Open DatasetLatency, ms4.58YOLOR_TensorRT (Ours)
2D Object DetectionWaymo Open DatasetAP/L269.56YOLOR_P6_TRT
2D Object DetectionWaymo Open DatasetLatency, ms3.74YOLOR_P6_TRT
2D Object DetectionWaymo Open DatasetAP/L265.65dereyly_self_ensemble
2D Object DetectionWaymo Open DatasetLatency, ms6.87dereyly_self_ensemble
2D Object DetectionWaymo Open DatasetAP/L264.14YOLO_v5
2D Object DetectionWaymo Open DatasetLatency, ms3.81YOLO_v5
16kWaymo Open DatasetAP/L270.41LeapMotor_Det
16kWaymo Open DatasetLatency, ms6.16LeapMotor_Det
16kWaymo Open DatasetAP/L269.72YOLOR_TensorRT (Ours)
16kWaymo Open DatasetLatency, ms4.58YOLOR_TensorRT (Ours)
16kWaymo Open DatasetAP/L269.56YOLOR_P6_TRT
16kWaymo Open DatasetLatency, ms3.74YOLOR_P6_TRT
16kWaymo Open DatasetAP/L265.65dereyly_self_ensemble
16kWaymo Open DatasetLatency, ms6.87dereyly_self_ensemble
16kWaymo Open DatasetAP/L264.14YOLO_v5
16kWaymo Open DatasetLatency, ms3.81YOLO_v5

Related Papers

GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17