Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/MuZero Unplugged

MuZero Unplugged

Reported on 27 benchmarks across 3 tasks · 1 paper · 12 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Robots9 results

Continuous Controlonfish.swim
Return· 2021-04-13
681.6
SOTA
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
Continuous Controlonmanipulator.insert_peg
Return· 2021-04-13
556
SOTA
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
Continuous Controlonhumanoid.run
Return· 2021-04-13
643.1
SOTA
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
Continuous Controlonmanipulator.insert_ball
Return· 2021-04-13
659.2
SOTA
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
Continuous Controlonwalker.walk
Return· 2021-04-13
949.5
best: 975.46 (SMuZero)
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
Continuous Controlonwalker.stand
Return· 2021-04-13
887.2
best: 987.79 (SMuZero)
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
Continuous Controloncheetah.run
Return· 2021-04-13
869.9
best: 914.39 (SMuZero)
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
Continuous Controloncartpole.swingup
Return· 2021-04-13
594.3
best: 868.87 (SMuZero)
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
Continuous Controlonfinger.turn_hard
Return· 2021-04-13
759
best: 963.07 (SMuZero)
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294

Methodology9 results

3Donfish.swim
Return· 2021-04-13
681.6
SOTA
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
3Donmanipulator.insert_peg
Return· 2021-04-13
556
SOTA
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
3Donhumanoid.run
Return· 2021-04-13
643.1
SOTA
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
3Donmanipulator.insert_ball
Return· 2021-04-13
659.2
SOTA
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
3Donwalker.walk
Return· 2021-04-13
949.5
best: 975.46 (SMuZero)
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
3Donwalker.stand
Return· 2021-04-13
887.2
best: 987.79 (SMuZero)
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
3Doncheetah.run
Return· 2021-04-13
869.9
best: 914.39 (SMuZero)
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
3Doncartpole.swingup
Return· 2021-04-13
594.3
best: 868.87 (SMuZero)
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
3Donfinger.turn_hard
Return· 2021-04-13
759
best: 963.07 (SMuZero)
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294

Medical9 results

3D Face Modellingonfish.swim
Return· 2021-04-13
681.6
SOTA
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
3D Face Modellingonmanipulator.insert_peg
Return· 2021-04-13
556
SOTA
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
3D Face Modellingonhumanoid.run
Return· 2021-04-13
643.1
SOTA
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
3D Face Modellingonmanipulator.insert_ball
Return· 2021-04-13
659.2
SOTA
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
3D Face Modellingonwalker.walk
Return· 2021-04-13
949.5
best: 975.46 (SMuZero)
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
3D Face Modellingonwalker.stand
Return· 2021-04-13
887.2
best: 987.79 (SMuZero)
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
3D Face Modellingoncheetah.run
Return· 2021-04-13
869.9
best: 914.39 (SMuZero)
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
3D Face Modellingoncartpole.swingup
Return· 2021-04-13
594.3
best: 868.87 (SMuZero)
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294
3D Face Modellingonfinger.turn_hard
Return· 2021-04-13
759
best: 963.07 (SMuZero)
Online and Offline Reinforcement Learning by Planning with a Learned Model arXiv:2104.06294