TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Affordance Transfer Learning for Human-Object Interaction ...

Affordance Transfer Learning for Human-Object Interaction Detection

Zhi Hou, Baosheng Yu, Yu Qiao, Xiaojiang Peng, DaCheng Tao

2021-04-07CVPR 2021 1Affordance RecognitionHuman-Object Interaction DetectionScene UnderstandingTransfer LearningHuman-Object Interaction Concept DiscoveryAffordance Detection
PaperPDFCode(official)Code(official)

Abstract

Reasoning the human-object interactions (HOI) is essential for deeper scene understanding, while object affordances (or functionalities) are of great importance for human to discover unseen HOIs with novel objects. Inspired by this, we introduce an affordance transfer learning approach to jointly detect HOIs with novel objects and recognize affordances. Specifically, HOI representations can be decoupled into a combination of affordance and object representations, making it possible to compose novel interactions by combining affordance representations and novel object representations from additional images, i.e. transferring the affordance to novel objects. With the proposed affordance transfer learning, the model is also capable of inferring the affordances of novel objects from known affordance representations. The proposed method can thus be used to 1) improve the performance of HOI detection, especially for the HOIs with unseen objects; and 2) infer the affordances of novel objects. Experimental results on two datasets, HICO-DET and HOI-COCO (from V-COCO), demonstrate significant improvements over recent state-of-the-art methods for HOI detection and object affordance detection. Code is available at https://github.com/zhihou7/HOI-CL

Results

TaskDatasetMetricValueModel
Human-Object Interaction DetectionHICO-DET(Unknown Concepts)COCO-Val201736.8ATL
Human-Object Interaction DetectionHICO-DET(Unknown Concepts)HICO42ATL
Human-Object Interaction DetectionHICO-DET(Unknown Concepts)Novel Classes15.64ATL
Human-Object Interaction DetectionHICO-DET(Unknown Concepts)Obj36534.38ATL
Human-Object Interaction DetectionHICO-DETCOCO-Val201752.01ATL
Human-Object Interaction DetectionHICO-DETHICO59.44ATL
Human-Object Interaction DetectionHICO-DETNovel classes15.64ATL
Human-Object Interaction DetectionHICO-DETObject36550.94ATL
Affordance RecognitionHICO-DET(Unknown Concepts)COCO-Val201736.8ATL
Affordance RecognitionHICO-DET(Unknown Concepts)HICO42ATL
Affordance RecognitionHICO-DET(Unknown Concepts)Novel Classes15.64ATL
Affordance RecognitionHICO-DET(Unknown Concepts)Obj36534.38ATL
Affordance RecognitionHICO-DETCOCO-Val201752.01ATL
Affordance RecognitionHICO-DETHICO59.44ATL
Affordance RecognitionHICO-DETNovel classes15.64ATL
Affordance RecognitionHICO-DETObject36550.94ATL
Human-Object Interaction Concept DiscoveryHICO-DETUnknown (AP)24.38Affordance Transfer

Related Papers

RaMen: Multi-Strategy Multi-Modal Learning for Bundle Construction2025-07-18Advancing Complex Wide-Area Scene Understanding with Hierarchical Coresets Selection2025-07-17Argus: Leveraging Multiview Images for Improved 3-D Scene Understanding With Large Language Models2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17Disentangling coincident cell events using deep transfer learning and compressive sensing2025-07-17Best Practices for Large-Scale, Pixel-Wise Crop Mapping and Transfer Learning Workflows2025-07-16Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based Adaptation2025-07-15Tactical Decision for Multi-UGV Confrontation with a Vision-Language Model-Based Commander2025-07-15