TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/SAC

SAC

Reported on 60 benchmarks across 9 tasks · 4 papers · 8 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Playing Games45 results

  • OpenAI GymonHumanoid-v4
    Average Return· 2018-01-04
    6211.5
    best: 6923.22 (MEow)
    SOTA
    Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic ActorarXiv:1801.01290
  • OpenAI GymonHalfCheetah-v4
    Average Return· 2018-01-04
    15836.04
    SOTA
    Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic ActorarXiv:1801.01290
  • OpenAI GymonAnt-v4
    Average Return· 2018-01-04
    5208.09
    best: 6586.33 (MEow)
    SOTA
    Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic ActorarXiv:1801.01290
  • OpenAI GymonWalker2d-v4
    Average Return· 2018-01-04
    5745.27
    SOTA
    Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic ActorarXiv:1801.01290
  • OpenAI GymonHopper-v4
    Average Return· 2018-01-04
    2882.56
    best: 3332.99 (MEow)
    SOTA
    Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic ActorarXiv:1801.01290
  • Atari GamesonAtari 2600 Ms. Pacman
    Score· 2019-10-16
    690.9
    best: 243401.1 (MuZero)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 Freeway
    Score· 2019-10-16
    4.4
    best: 34 (TRPO-hash)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 Pong
    Score· 2019-10-16
    -20.98
    best: 21 (Duel noop)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 Enduro
    Score· 2019-10-16
    0.8
    best: 14330 (GDI-I3)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 Breakout
    Score· 2019-10-16
    0.7
    best: 864 (GDI-H3(200M frames))
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 Frostbite
    Score· 2019-10-16
    59.4
    best: 631378.53 (MuZero)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 Space Invaders
    Score· 2019-10-16
    160.8
    best: 154380 (GDI-H3(200M frames))
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 James Bond
    Score· 2019-10-16
    68.3
    best: 620780 (GDI-H3)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 Amidar
    Score· 2019-10-16
    7.9
    best: 29660.08 (Agent57)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 Crazy Climber
    Score· 2019-10-16
    3668.7
    best: 565909.85 (Agent57)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 Battle Zone
    Score· 2019-10-16
    4386.7
    best: 934134.88 (Agent57)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 Beam Rider
    Score· 2019-10-16
    432.1
    best: 454993.53 (MuZero)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 Asterix
    Score· 2019-10-16
    272
    best: 999999 (GDI-H3)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 Kangaroo
    Score· 2019-10-16
    29.3
    best: 24034.16 (Agent57)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 Assault
    Score· 2019-10-16
    350
    best: 143972.03 (MuZero)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 Alien
    Score· 2019-10-16
    216.9
    best: 741812.63 (MuZero)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 Seaquest
    Score· 2019-10-16
    211.6
    best: 1000000 (GDI-H3(200M frames))
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 Q*Bert
    Score· 2019-10-16
    280.5
    best: 580328.14 (Agent57)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 Road Runner
    Score· 2019-10-16
    305.3
    best: 999999 (GDI-H3)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Atari GamesonAtari 2600 Up and Down
    Score· 2019-10-16
    250.7
    best: 986440 (GDI-I3)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Ms. Pacman
    Score· 2019-10-16
    690.9
    best: 243401.1 (MuZero)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Freeway
    Score· 2019-10-16
    4.4
    best: 34 (TRPO-hash)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Pong
    Score· 2019-10-16
    -20.98
    best: 21 (Duel noop)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Enduro
    Score· 2019-10-16
    0.8
    best: 14330 (GDI-I3)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Breakout
    Score· 2019-10-16
    0.7
    best: 864 (GDI-H3(200M frames))
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Frostbite
    Score· 2019-10-16
    59.4
    best: 631378.53 (MuZero)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Space Invaders
    Score· 2019-10-16
    160.8
    best: 154380 (GDI-H3(200M frames))
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 James Bond
    Score· 2019-10-16
    68.3
    best: 620780 (GDI-H3)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Amidar
    Score· 2019-10-16
    7.9
    best: 29660.08 (Agent57)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Crazy Climber
    Score· 2019-10-16
    3668.7
    best: 565909.85 (Agent57)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Battle Zone
    Score· 2019-10-16
    4386.7
    best: 934134.88 (Agent57)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Beam Rider
    Score· 2019-10-16
    432.1
    best: 454993.53 (MuZero)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Asterix
    Score· 2019-10-16
    272
    best: 999999 (GDI-H3)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Kangaroo
    Score· 2019-10-16
    29.3
    best: 24034.16 (Agent57)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Assault
    Score· 2019-10-16
    350
    best: 143972.03 (MuZero)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Alien
    Score· 2019-10-16
    216.9
    best: 741812.63 (MuZero)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Seaquest
    Score· 2019-10-16
    211.6
    best: 1000000 (GDI-H3(200M frames))
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Q*Bert
    Score· 2019-10-16
    280.5
    best: 580328.14 (Agent57)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Road Runner
    Score· 2019-10-16
    305.3
    best: 999999 (GDI-H3)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207
  • Video GamesonAtari 2600 Up and Down
    Score· 2019-10-16
    250.7
    best: 986440 (GDI-I3)
    Soft Actor-Critic for Discrete Action SettingsarXiv:1910.07207

Medical5 results

  • 3D Face ModellingonPyBullet HalfCheetah
    Return· 2020-05-12
    2883
    SOTA
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719
  • Image GenerationonGTAV-to-Cityscapes Labels
    mIoU· 2021-04-30
    53.8
    best: 77.7 (DCF)
    Self-supervised Augmentation Consistency for Adapting Semantic SegmentationarXiv:2105.00097
  • 3D Face ModellingonPyBullet Ant
    Return· 2020-05-12
    2859
    best: 3459 (SAC gSDE)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719
  • 3D Face ModellingonPyBullet Walker2D
    Return· 2020-05-12
    2215
    best: 2341 (SAC gSDE)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719
  • 3D Face ModellingonPyBullet Hopper
    Return· 2020-05-12
    2477
    best: 2646 (SAC gSDE)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719

Robots4 results

  • Continuous ControlonPyBullet HalfCheetah
    Return· 2020-05-12
    2883
    SOTA
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719
  • Continuous ControlonPyBullet Ant
    Return· 2020-05-12
    2859
    best: 3459 (SAC gSDE)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719
  • Continuous ControlonPyBullet Walker2D
    Return· 2020-05-12
    2215
    best: 2341 (SAC gSDE)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719
  • Continuous ControlonPyBullet Hopper
    Return· 2020-05-12
    2477
    best: 2646 (SAC gSDE)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719

Methodology4 results

  • 3DonPyBullet HalfCheetah
    Return· 2020-05-12
    2883
    SOTA
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719
  • 3DonPyBullet Ant
    Return· 2020-05-12
    2859
    best: 3459 (SAC gSDE)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719
  • 3DonPyBullet Walker2D
    Return· 2020-05-12
    2215
    best: 2341 (SAC gSDE)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719
  • 3DonPyBullet Hopper
    Return· 2020-05-12
    2477
    best: 2646 (SAC gSDE)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719

Computer Vision1 result

  • Image-to-Image TranslationonGTAV-to-Cityscapes Labels
    mIoU· 2021-04-30
    53.8
    best: 77.7 (DCF)
    Self-supervised Augmentation Consistency for Adapting Semantic SegmentationarXiv:2105.00097

Miscellaneous1 result

  • 1 Image, 2*2 StitchingonGTAV-to-Cityscapes Labels
    mIoU· 2021-04-30
    53.8
    best: 77.7 (DCF)
    Self-supervised Augmentation Consistency for Adapting Semantic SegmentationarXiv:2105.00097