TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Visual Compositional Learning for Human-Object Interaction...

Visual Compositional Learning for Human-Object Interaction Detection

Zhi Hou, Xiaojiang Peng, Yu Qiao, DaCheng Tao

2020-07-24ECCV 2020 8Affordance RecognitionHuman-Object Interaction Detection
PaperPDFCodeCodeCodeCode(official)

Abstract

Human-Object interaction (HOI) detection aims to localize and infer relationships between human and objects in an image. It is challenging because an enormous number of possible combinations of objects and verbs types forms a long-tail distribution. We devise a deep Visual Compositional Learning (VCL) framework, which is a simple yet efficient framework to effectively address this problem. VCL first decomposes an HOI representation into object and verb specific features, and then composes new interaction samples in the feature space via stitching the decomposed features. The integration of decomposition and composition enables VCL to share object and verb features among different HOI samples and images, and to generate new interaction samples and new types of HOI, and thus largely alleviates the long-tail distribution problem and benefits low-shot or zero-shot HOI detection. Extensive experiments demonstrate that the proposed VCL can effectively improve the generalization of HOI detection on HICO-DET and V-COCO and outperforms the recent state-of-the-art methods on HICO-DET. Code is available at https://github.com/zhihou7/VCL.

Results

TaskDatasetMetricValueModel
Human-Object Interaction DetectionHICO-DET(Unknown Concepts)COCO-Val201728.71VCL
Human-Object Interaction DetectionHICO-DET(Unknown Concepts)HICO32.76VCL
Human-Object Interaction DetectionHICO-DET(Unknown Concepts)Novel Classes12.05VCL
Human-Object Interaction DetectionHICO-DET(Unknown Concepts)Obj36527.58VCL
Human-Object Interaction DetectionHICO-DETCOCO-Val201736.74VCL
Human-Object Interaction DetectionHICO-DETHICO43.15VCL
Human-Object Interaction DetectionHICO-DETNovel classes12.05VCL
Human-Object Interaction DetectionHICO-DETObject36535.73VCL
Affordance RecognitionHICO-DET(Unknown Concepts)COCO-Val201728.71VCL
Affordance RecognitionHICO-DET(Unknown Concepts)HICO32.76VCL
Affordance RecognitionHICO-DET(Unknown Concepts)Novel Classes12.05VCL
Affordance RecognitionHICO-DET(Unknown Concepts)Obj36527.58VCL
Affordance RecognitionHICO-DETCOCO-Val201736.74VCL
Affordance RecognitionHICO-DETHICO43.15VCL
Affordance RecognitionHICO-DETNovel classes12.05VCL
Affordance RecognitionHICO-DETObject36535.73VCL

Related Papers

RoHOI: Robustness Benchmark for Human-Object Interaction Detection2025-07-12Bilateral Collaboration with Large Vision-Language Models for Open Vocabulary Human-Object Interaction Detection2025-07-09VolumetricSMPL: A Neural Volumetric Body Model for Efficient Interactions, Contacts, and Collisions2025-06-29HOIverse: A Synthetic Scene Graph Dataset With Human Object Interactions2025-06-24On the Robustness of Human-Object Interaction Detection against Distribution Shift2025-06-22Egocentric Human-Object Interaction Detection: A New Benchmark and Method2025-06-17InterActHuman: Multi-Concept Human Animation with Layout-Aligned Audio Conditions2025-06-11HunyuanVideo-HOMA: Generic Human-Object Interaction in Multimodal Driven Human Animation2025-06-10