TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/TD3

TD3

Reported on 17 benchmarks across 4 tasks · 2 papers · 2 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Playing Games5 results

  • OpenAI GymonAnt-v4
    Average Return· 2018-02-26
    5942.55
    best: 6586.33 (MEow)
    SOTA
    Addressing Function Approximation Error in Actor-Critic MethodsarXiv:1802.09477
  • OpenAI GymonHopper-v4
    Average Return· 2018-02-26
    3319.98
    best: 3332.99 (MEow)
    SOTA
    Addressing Function Approximation Error in Actor-Critic MethodsarXiv:1802.09477
  • OpenAI GymonHumanoid-v4
    Average Return· 2018-02-26
    198.44
    best: 6923.22 (MEow)
    Addressing Function Approximation Error in Actor-Critic MethodsarXiv:1802.09477
  • OpenAI GymonHalfCheetah-v4
    Average Return· 2018-02-26
    12026.73
    best: 15836.04 (SAC)
    Addressing Function Approximation Error in Actor-Critic MethodsarXiv:1802.09477
  • OpenAI GymonWalker2d-v4
    Average Return· 2018-02-26
    2612.74
    best: 5745.27 (SAC)
    Addressing Function Approximation Error in Actor-Critic MethodsarXiv:1802.09477

Robots4 results

  • Continuous ControlonPyBullet HalfCheetah
    Return· 2020-05-12
    2687
    best: 2883 (SAC)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719
  • Continuous ControlonPyBullet Ant
    Return· 2020-05-12
    2865
    best: 3459 (SAC gSDE)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719
  • Continuous ControlonPyBullet Walker2D
    Return· 2020-05-12
    2106
    best: 2341 (SAC gSDE)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719
  • Continuous ControlonPyBullet Hopper
    Return· 2020-05-12
    2470
    best: 2646 (SAC gSDE)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719

Methodology4 results

  • 3DonPyBullet HalfCheetah
    Return· 2020-05-12
    2687
    best: 2883 (SAC)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719
  • 3DonPyBullet Ant
    Return· 2020-05-12
    2865
    best: 3459 (SAC gSDE)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719
  • 3DonPyBullet Walker2D
    Return· 2020-05-12
    2106
    best: 2341 (SAC gSDE)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719
  • 3DonPyBullet Hopper
    Return· 2020-05-12
    2470
    best: 2646 (SAC gSDE)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719

Medical4 results

  • 3D Face ModellingonPyBullet HalfCheetah
    Return· 2020-05-12
    2687
    best: 2883 (SAC)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719
  • 3D Face ModellingonPyBullet Ant
    Return· 2020-05-12
    2865
    best: 3459 (SAC gSDE)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719
  • 3D Face ModellingonPyBullet Walker2D
    Return· 2020-05-12
    2106
    best: 2341 (SAC gSDE)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719
  • 3D Face ModellingonPyBullet Hopper
    Return· 2020-05-12
    2470
    best: 2646 (SAC gSDE)
    Smooth Exploration for Robotic Reinforcement LearningarXiv:2005.05719