TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/RoboPoint

RoboPoint

Reported on 10 benchmarks across 2 tasks · 1 paper · 8 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing10 results

  • Visual Question Answering (VQA)on6-DoF SpatialBench
    Orientation-rel· 2024-06-15
    33.8
    best: 54.6 (SoFar)
    SOTA
    RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for RoboticsarXiv:2406.10721
  • Visual Question Answering (VQA)on6-DoF SpatialBench
    Position-abs· 2024-06-15
    30.8
    best: 33.8 (SoFar)
    SOTA
    RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for RoboticsarXiv:2406.10721
  • Visual Question Answering (VQA)on6-DoF SpatialBench
    Position-rel· 2024-06-15
    43.8
    best: 59.6 (SoFar)
    SOTA
    RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for RoboticsarXiv:2406.10721
  • Visual Question Answering (VQA)on6-DoF SpatialBench
    Total· 2024-06-15
    33.5
    best: 43.9 (SoFar)
    SOTA
    RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for RoboticsarXiv:2406.10721
  • Visual Question Answeringon6-DoF SpatialBench
    Orientation-rel· 2024-06-15
    33.8
    best: 54.6 (SoFar)
    SOTA
    RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for RoboticsarXiv:2406.10721
  • Visual Question Answeringon6-DoF SpatialBench
    Position-abs· 2024-06-15
    30.8
    best: 33.8 (SoFar)
    SOTA
    RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for RoboticsarXiv:2406.10721
  • Visual Question Answeringon6-DoF SpatialBench
    Position-rel· 2024-06-15
    43.8
    best: 59.6 (SoFar)
    SOTA
    RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for RoboticsarXiv:2406.10721
  • Visual Question Answeringon6-DoF SpatialBench
    Total· 2024-06-15
    33.5
    best: 43.9 (SoFar)
    SOTA
    RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for RoboticsarXiv:2406.10721
  • Visual Question Answering (VQA)on6-DoF SpatialBench
    Orientation-abs· 2024-06-15
    25.8
    best: 31.3 (SoFar)
    RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for RoboticsarXiv:2406.10721
  • Visual Question Answeringon6-DoF SpatialBench
    Orientation-abs· 2024-06-15
    25.8
    best: 31.3 (SoFar)
    RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for RoboticsarXiv:2406.10721