TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Soft Actor-Critic for Discrete Action Settings

Soft Actor-Critic for Discrete Action Settings

Petros Christodoulou

2019-10-16Reinforcement LearningAtari Gamesreinforcement-learning
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCode

Abstract

Soft Actor-Critic is a state-of-the-art reinforcement learning algorithm for continuous action settings that is not applicable to discrete action settings. Many important settings involve discrete actions, however, and so here we derive an alternative version of the Soft Actor-Critic algorithm that is applicable to discrete action settings. We then show that, even without any hyperparameter tuning, it is competitive with the tuned model-free state-of-the-art on a selection of games from the Atari suite.

Results

TaskDatasetMetricValueModel
Atari GamesAtari 2600 Ms. PacmanScore690.9SAC
Atari GamesAtari 2600 FreewayScore4.4SAC
Atari GamesAtari 2600 PongScore-20.98SAC
Atari GamesAtari 2600 EnduroScore0.8SAC
Atari GamesAtari 2600 BreakoutScore0.7SAC
Atari GamesAtari 2600 FrostbiteScore59.4SAC
Atari GamesAtari 2600 Space InvadersScore160.8SAC
Atari GamesAtari 2600 James BondScore68.3SAC
Atari GamesAtari 2600 AmidarScore7.9SAC
Atari GamesAtari 2600 Crazy ClimberScore3668.7SAC
Atari GamesAtari 2600 Battle ZoneScore4386.7SAC
Atari GamesAtari 2600 Beam RiderScore432.1SAC
Atari GamesAtari 2600 AsterixScore272SAC
Atari GamesAtari 2600 KangarooScore29.3SAC
Atari GamesAtari 2600 AssaultScore350SAC
Atari GamesAtari 2600 AlienScore216.9SAC
Atari GamesAtari 2600 SeaquestScore211.6SAC
Atari GamesAtari 2600 Q*BertScore280.5SAC
Atari GamesAtari 2600 Road RunnerScore305.3SAC
Atari GamesAtari 2600 Up and DownScore250.7SAC
Video GamesAtari 2600 Ms. PacmanScore690.9SAC
Video GamesAtari 2600 FreewayScore4.4SAC
Video GamesAtari 2600 PongScore-20.98SAC
Video GamesAtari 2600 EnduroScore0.8SAC
Video GamesAtari 2600 BreakoutScore0.7SAC
Video GamesAtari 2600 FrostbiteScore59.4SAC
Video GamesAtari 2600 Space InvadersScore160.8SAC
Video GamesAtari 2600 James BondScore68.3SAC
Video GamesAtari 2600 AmidarScore7.9SAC
Video GamesAtari 2600 Crazy ClimberScore3668.7SAC
Video GamesAtari 2600 Battle ZoneScore4386.7SAC
Video GamesAtari 2600 Beam RiderScore432.1SAC
Video GamesAtari 2600 AsterixScore272SAC
Video GamesAtari 2600 KangarooScore29.3SAC
Video GamesAtari 2600 AssaultScore350SAC
Video GamesAtari 2600 AlienScore216.9SAC
Video GamesAtari 2600 SeaquestScore211.6SAC
Video GamesAtari 2600 Q*BertScore280.5SAC
Video GamesAtari 2600 Road RunnerScore305.3SAC
Video GamesAtari 2600 Up and DownScore250.7SAC

Related Papers

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks2025-07-17QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17Autonomous Resource Management in Microservice Systems via Reinforcement Learning2025-07-17