Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/RoboPoint

RoboPoint

Reported on 10 benchmarks across 2 tasks · 1 paper · 8 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing10 results

Visual Question Answering (VQA)on6-DoF SpatialBench
Orientation-rel· 2024-06-15
33.8
best: 54.6 (SoFar)
SOTA
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics arXiv:2406.10721
Visual Question Answering (VQA)on6-DoF SpatialBench
Position-abs· 2024-06-15
30.8
best: 33.8 (SoFar)
SOTA
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics arXiv:2406.10721
Visual Question Answering (VQA)on6-DoF SpatialBench
Position-rel· 2024-06-15
43.8
best: 59.6 (SoFar)
SOTA
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics arXiv:2406.10721
Visual Question Answering (VQA)on6-DoF SpatialBench
Total· 2024-06-15
33.5
best: 43.9 (SoFar)
SOTA
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics arXiv:2406.10721
Visual Question Answeringon6-DoF SpatialBench
Orientation-rel· 2024-06-15
33.8
best: 54.6 (SoFar)
SOTA
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics arXiv:2406.10721
Visual Question Answeringon6-DoF SpatialBench
Position-abs· 2024-06-15
30.8
best: 33.8 (SoFar)
SOTA
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics arXiv:2406.10721
Visual Question Answeringon6-DoF SpatialBench
Position-rel· 2024-06-15
43.8
best: 59.6 (SoFar)
SOTA
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics arXiv:2406.10721
Visual Question Answeringon6-DoF SpatialBench
Total· 2024-06-15
33.5
best: 43.9 (SoFar)
SOTA
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics arXiv:2406.10721
Visual Question Answering (VQA)on6-DoF SpatialBench
Orientation-abs· 2024-06-15
25.8
best: 31.3 (SoFar)
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics arXiv:2406.10721
Visual Question Answeringon6-DoF SpatialBench
Orientation-abs· 2024-06-15
25.8
best: 31.3 (SoFar)
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics arXiv:2406.10721