Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/RT-2-X

RT-2-X

Reported on 8 benchmarks across 1 task · 1 paper · 7 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Robots8 results

Robot ManipulationonSimplerEnv-Google Robot
Variant Aggregation· uses extra data· 2023-07-28
0.661
best: 0.688 (SpatialVLA)
SOTA
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control arXiv:2307.15818
Robot ManipulationonSimplerEnv-Google Robot
Variant Aggregation-Move Near· uses extra data· 2023-07-28
0.792
SOTA
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control arXiv:2307.15818
Robot ManipulationonSimplerEnv-Google Robot
Variant Aggregation-Pick Coke Can· uses extra data· 2023-07-28
0.823
best: 0.907 (SoFar)
SOTA
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control arXiv:2307.15818
Robot ManipulationonSimplerEnv-Google Robot
Visual Matching· uses extra data· 2023-07-28
0.606
best: 0.749 (SoFar)
SOTA
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control arXiv:2307.15818
Robot ManipulationonSimplerEnv-Google Robot
Visual Matching-Move Near· uses extra data· 2023-07-28
0.779
best: 0.917 (SoFar)
SOTA
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control arXiv:2307.15818
Robot ManipulationonSimplerEnv-Google Robot
Visual Matching-Open/Close Drawer· uses extra data· 2023-07-28
0.25
best: 0.227 (Octo-Base)
SOTA
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control arXiv:2307.15818
Robot ManipulationonSimplerEnv-Google Robot
Visual Matching-Pick Coke Can· uses extra data· 2023-07-28
0.787
best: 0.923 (SoFar)
SOTA
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control arXiv:2307.15818
Robot ManipulationonSimplerEnv-Google Robot
Variant Aggregation-Open/Close Drawer· uses extra data· 2023-07-28
0.353
best: 0.011 (Octo-Base)
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control arXiv:2307.15818