Video-to-image Affordance Grounding on OPRA

Metric: Top-1 Action Accuracy (higher is better)

LeaderboardDataset
Loading chart...
#ModelTop-1 Action AccuracyExtra DataPaperDateCode
1Afformer (ViTDet-B encoder)52.27NoAffordance Grounding from Demonstration Video to...2023-03-26Code
2Afformer (ResNet-50-FPN encoder)52.14NoAffordance Grounding from Demonstration Video to...2023-03-26Code
3Demo2Vec40.79No---