TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/CABiNet

CABiNet

Context Aggregated Bi-lateral Network for Semantic Segmentation

Computer VisionIntroduced 200023 papers

Description

With the increasing demand of autonomous systems, pixelwise semantic segmentation for visual scene understanding needs to be not only accurate but also efficient for potential real-time applications. In this paper, we propose Context Aggregation Network, a dual branch convolutional neural network, with significantly lower computational costs as compared to the state-of-the-art, while maintaining a competitive prediction accuracy. Building upon the existing dual branch architectures for high-speed semantic segmentation, we design a high resolution branch for effective spatial detailing and a context branch with light-weight versions of global aggregation and local distribution blocks, potent to capture both long-range and local contextual dependencies required for accurate semantic segmentation, with low computational overheads. We evaluate our method on two semantic segmentation datasets, namely Cityscapes dataset and UAVid dataset. For Cityscapes test set, our model achieves state-of-the-art results with mIOU of 75.9%, at 76 FPS on an NVIDIA RTX 2080Ti and 8 FPS on a Jetson Xavier NX. With regards to UAVid dataset, our proposed network achieves mIOU score of 63.5% with high execution speed (15 FPS).

Papers Using This Method

A Training-Free Framework for Precise Mobile Manipulation of Small Everyday Objects2025-02-19From 2D CAD Drawings to 3D Parametric Models: A Vision-Language Approach2024-12-16Refined and Segmented Price Sentiment Indices from Survey Comments2024-11-15Robot Utility Models: General Policies for Zero-Shot Deployment in New Environments2024-09-09CABINET: Content Relevance based Noise Reduction for Table Question Answering2024-02-02AccessLens: Auto-detecting Inaccessibility of Everyday Objects2024-01-29BreastRegNet: A Deep Learning Framework for Registration of Breast Faxitron and Histopathology Images2024-01-18Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation2024-01-04USA-Net: Unified Semantic and Affordance Representations for Robot Memory2023-04-24CabiNet: Scaling Neural Collision Detection for Object Rearrangement with Procedural Scene Generation2023-04-18Orbit: A Unified Simulation Framework for Interactive Robot Learning Environments2023-01-10OpenD: A Benchmark for Language-Driven Door and Drawer Opening2022-12-10Toward an Intelligent Tutoring System for Argument Mining in Legal Texts2022-10-24Robot Active Neural Sensing and Planning in Unknown Cluttered Environments2022-08-23MINSU (Mobile Inventory And Scanning Unit):Computer Vision and AI2022-04-14Estimation of Evaporator Valve Sizes in Supermarket Refrigeration Cabinets2022-02-21Modeling Long-horizon Tasks as Sequential Interaction Landscapes2020-06-08Who mentions whom? Recognizing political actors in proceedings2020-05-01Customizing Pareto Simulated Annealing for Multi-objective Optimization of Control Cabinet Layout2019-06-04Toward Ergonomic Risk Prediction via Segmentation of Indoor Object Manipulation Actions Using Spatiotemporal Convolutional Networks2019-02-14