TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/MuZero Unplugged

MuZero Unplugged

Reported on 27 benchmarks across 3 tasks · 1 paper · 12 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Robots9 results

  • Continuous Controlonfish.swim
    Return· 2021-04-13
    681.6
    SOTA
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • Continuous Controlonmanipulator.insert_peg
    Return· 2021-04-13
    556
    SOTA
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • Continuous Controlonhumanoid.run
    Return· 2021-04-13
    643.1
    SOTA
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • Continuous Controlonmanipulator.insert_ball
    Return· 2021-04-13
    659.2
    SOTA
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • Continuous Controlonwalker.walk
    Return· 2021-04-13
    949.5
    best: 975.46 (SMuZero)
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • Continuous Controlonwalker.stand
    Return· 2021-04-13
    887.2
    best: 987.79 (SMuZero)
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • Continuous Controloncheetah.run
    Return· 2021-04-13
    869.9
    best: 914.39 (SMuZero)
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • Continuous Controloncartpole.swingup
    Return· 2021-04-13
    594.3
    best: 868.87 (SMuZero)
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • Continuous Controlonfinger.turn_hard
    Return· 2021-04-13
    759
    best: 963.07 (SMuZero)
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294

Methodology9 results

  • 3Donfish.swim
    Return· 2021-04-13
    681.6
    SOTA
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • 3Donmanipulator.insert_peg
    Return· 2021-04-13
    556
    SOTA
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • 3Donhumanoid.run
    Return· 2021-04-13
    643.1
    SOTA
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • 3Donmanipulator.insert_ball
    Return· 2021-04-13
    659.2
    SOTA
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • 3Donwalker.walk
    Return· 2021-04-13
    949.5
    best: 975.46 (SMuZero)
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • 3Donwalker.stand
    Return· 2021-04-13
    887.2
    best: 987.79 (SMuZero)
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • 3Doncheetah.run
    Return· 2021-04-13
    869.9
    best: 914.39 (SMuZero)
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • 3Doncartpole.swingup
    Return· 2021-04-13
    594.3
    best: 868.87 (SMuZero)
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • 3Donfinger.turn_hard
    Return· 2021-04-13
    759
    best: 963.07 (SMuZero)
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294

Medical9 results

  • 3D Face Modellingonfish.swim
    Return· 2021-04-13
    681.6
    SOTA
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • 3D Face Modellingonmanipulator.insert_peg
    Return· 2021-04-13
    556
    SOTA
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • 3D Face Modellingonhumanoid.run
    Return· 2021-04-13
    643.1
    SOTA
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • 3D Face Modellingonmanipulator.insert_ball
    Return· 2021-04-13
    659.2
    SOTA
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • 3D Face Modellingonwalker.walk
    Return· 2021-04-13
    949.5
    best: 975.46 (SMuZero)
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • 3D Face Modellingonwalker.stand
    Return· 2021-04-13
    887.2
    best: 987.79 (SMuZero)
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • 3D Face Modellingoncheetah.run
    Return· 2021-04-13
    869.9
    best: 914.39 (SMuZero)
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • 3D Face Modellingoncartpole.swingup
    Return· 2021-04-13
    594.3
    best: 868.87 (SMuZero)
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294
  • 3D Face Modellingonfinger.turn_hard
    Return· 2021-04-13
    759
    best: 963.07 (SMuZero)
    Online and Offline Reinforcement Learning by Planning with a Learned ModelarXiv:2104.06294