TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Unified Perception: Efficient Depth-Aware Video Panoptic S...

Unified Perception: Efficient Depth-Aware Video Panoptic Segmentation with Minimal Annotation Costs

Kurt Stolle, Gijs Dubbelman

2023-03-03Panoptic SegmentationVideo Panoptic SegmentationScene UnderstandingDepth-aware Video Panoptic Segmentation
PaperPDF

Abstract

Depth-aware video panoptic segmentation is a promising approach to camera based scene understanding. However, the current state-of-the-art methods require costly video annotations and use a complex training pipeline compared to their image-based equivalents. In this paper, we present a new approach titled Unified Perception that achieves state-of-the-art performance without requiring video-based training. Our method employs a simple two-stage cascaded tracking algorithm that (re)uses object embeddings computed in an image-based network. Experimental results on the Cityscapes-DVPS dataset demonstrate that our method achieves an overall DVPQ of 57.1, surpassing state-of-the-art methods. Furthermore, we show that our tracking strategies are effective for long-term object association on KITTI-STEP, achieving an STQ of 59.1 which exceeded the performance of state-of-the-art methods that employ the same backbone network. Code is available at: https://tue-mps.github.io/unipercept

Results

TaskDatasetMetricValueModel
Semantic SegmentationKITTI-STEPAQ56.4Unified Perception
Semantic SegmentationKITTI-STEPSQ61.9Unified Perception
Semantic SegmentationKITTI-STEPSTQ59.1Unified Perception
10-shot image generationKITTI-STEPAQ56.4Unified Perception
10-shot image generationKITTI-STEPSQ61.9Unified Perception
10-shot image generationKITTI-STEPSTQ59.1Unified Perception
Panoptic SegmentationKITTI-STEPAQ56.4Unified Perception
Panoptic SegmentationKITTI-STEPSQ61.9Unified Perception
Panoptic SegmentationKITTI-STEPSTQ59.1Unified Perception

Related Papers

Advancing Complex Wide-Area Scene Understanding with Hierarchical Coresets Selection2025-07-17Argus: Leveraging Multiview Images for Improved 3-D Scene Understanding With Large Language Models2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based Adaptation2025-07-15Tactical Decision for Multi-UGV Confrontation with a Vision-Language Model-Based Commander2025-07-15Seeing the Signs: A Survey of Edge-Deployable OCR Models for Billboard Visibility Analysis2025-07-15DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation2025-07-14EmbRACE-3K: Embodied Reasoning and Action in Complex Environments2025-07-14