TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/RT-2-X

RT-2-X

Reported on 8 benchmarks across 1 task · 1 paper · 7 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Robots8 results

  • Robot ManipulationonSimplerEnv-Google Robot
    Variant Aggregation· uses extra data· 2023-07-28
    0.661
    best: 0.688 (SpatialVLA)
    SOTA
    RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic ControlarXiv:2307.15818
  • Robot ManipulationonSimplerEnv-Google Robot
    Variant Aggregation-Move Near· uses extra data· 2023-07-28
    0.792
    SOTA
    RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic ControlarXiv:2307.15818
  • Robot ManipulationonSimplerEnv-Google Robot
    Variant Aggregation-Pick Coke Can· uses extra data· 2023-07-28
    0.823
    best: 0.907 (SoFar)
    SOTA
    RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic ControlarXiv:2307.15818
  • Robot ManipulationonSimplerEnv-Google Robot
    Visual Matching· uses extra data· 2023-07-28
    0.606
    best: 0.749 (SoFar)
    SOTA
    RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic ControlarXiv:2307.15818
  • Robot ManipulationonSimplerEnv-Google Robot
    Visual Matching-Move Near· uses extra data· 2023-07-28
    0.779
    best: 0.917 (SoFar)
    SOTA
    RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic ControlarXiv:2307.15818
  • Robot ManipulationonSimplerEnv-Google Robot
    Visual Matching-Open/Close Drawer· uses extra data· 2023-07-28
    0.25
    best: 0.227 (Octo-Base)
    SOTA
    RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic ControlarXiv:2307.15818
  • Robot ManipulationonSimplerEnv-Google Robot
    Visual Matching-Pick Coke Can· uses extra data· 2023-07-28
    0.787
    best: 0.923 (SoFar)
    SOTA
    RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic ControlarXiv:2307.15818
  • Robot ManipulationonSimplerEnv-Google Robot
    Variant Aggregation-Open/Close Drawer· uses extra data· 2023-07-28
    0.353
    best: 0.011 (Octo-Base)
    RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic ControlarXiv:2307.15818