Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/CABiNet

CABiNet

Context Aggregated Bi-lateral Network for Semantic Segmentation

Computer VisionIntroduced 200023 papers

Description

With the increasing demand of autonomous systems, pixelwise semantic segmentation for visual scene understanding needs to be not only accurate but also efficient for potential real-time applications. In this paper, we propose Context Aggregation Network, a dual branch convolutional neural network, with significantly lower computational costs as compared to the state-of-the-art, while maintaining a competitive prediction accuracy. Building upon the existing dual branch architectures for high-speed semantic segmentation, we design a high resolution branch for effective spatial detailing and a context branch with light-weight versions of global aggregation and local distribution blocks, potent to capture both long-range and local contextual dependencies required for accurate semantic segmentation, with low computational overheads. We evaluate our method on two semantic segmentation datasets, namely Cityscapes dataset and UAVid dataset. For Cityscapes test set, our model achieves state-of-the-art results with mIOU of 75.9%, at 76 FPS on an NVIDIA RTX 2080Ti and 8 FPS on a Jetson Xavier NX. With regards to UAVid dataset, our proposed network achieves mIOU score of 63.5% with high execution speed (15 FPS).

Papers Using This Method

A Training-Free Framework for Precise Mobile Manipulation of Small Everyday Objects2025-02-19 From 2D CAD Drawings to 3D Parametric Models: A Vision-Language Approach2024-12-16 Refined and Segmented Price Sentiment Indices from Survey Comments2024-11-15 Robot Utility Models: General Policies for Zero-Shot Deployment in New Environments2024-09-09 CABINET: Content Relevance based Noise Reduction for Table Question Answering2024-02-02 AccessLens: Auto-detecting Inaccessibility of Everyday Objects2024-01-29 BreastRegNet: A Deep Learning Framework for Registration of Breast Faxitron and Histopathology Images2024-01-18 Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation2024-01-04 USA-Net: Unified Semantic and Affordance Representations for Robot Memory2023-04-24 CabiNet: Scaling Neural Collision Detection for Object Rearrangement with Procedural Scene Generation2023-04-18 Orbit: A Unified Simulation Framework for Interactive Robot Learning Environments2023-01-10 OpenD: A Benchmark for Language-Driven Door and Drawer Opening2022-12-10 Toward an Intelligent Tutoring System for Argument Mining in Legal Texts2022-10-24 Robot Active Neural Sensing and Planning in Unknown Cluttered Environments2022-08-23 MINSU (Mobile Inventory And Scanning Unit):Computer Vision and AI2022-04-14 Estimation of Evaporator Valve Sizes in Supermarket Refrigeration Cabinets2022-02-21 Modeling Long-horizon Tasks as Sequential Interaction Landscapes2020-06-08 Who mentions whom? Recognizing political actors in proceedings2020-05-01 Customizing Pareto Simulated Annealing for Multi-objective Optimization of Control Cabinet Layout2019-06-04 Toward Ergonomic Risk Prediction via Segmentation of Indoor Object Manipulation Actions Using Spatiotemporal Convolutional Networks2019-02-14