TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/CURL: Contrastive Unsupervised Representations for Reinfor...

CURL: Contrastive Unsupervised Representations for Reinforcement Learning

Aravind Srinivas, Michael Laskin, Pieter Abbeel

2020-04-08Atari Games 100kReinforcement LearningAtari GamesContinuous ControlContrastive Learningreinforcement-learning
PaperPDFCode(official)CodeCodeCodeCodeCodeCode

Abstract

We present CURL: Contrastive Unsupervised Representations for Reinforcement Learning. CURL extracts high-level features from raw pixels using contrastive learning and performs off-policy control on top of the extracted features. CURL outperforms prior pixel-based methods, both model-based and model-free, on complex tasks in the DeepMind Control Suite and Atari Games showing 1.9x and 1.2x performance gains at the 100K environment and interaction steps benchmarks respectively. On the DeepMind Control Suite, CURL is the first image-based algorithm to nearly match the sample-efficiency of methods that use state-based features. Our code is open-sourced and available at https://github.com/MishaLaskin/curl.

Results

TaskDatasetMetricValueModel
Continuous ControlWalker, walk (DMControl100k)Score403CURL
Continuous ControlCartpole, swingup (DMControl100k)Score582CURL
Continuous ControlCheetah, run (DMControl500k)Score518CURL
Continuous ControlReacher, easy (DMControl500k)Score929CURL
Continuous ControlFinger, spin (DMControl100k)Score767CURL
Continuous ControlCheetah, run (DMControl100k)Score299CURL
Continuous ControlFinger, spin (DMControl500k)Score926CURL
Continuous ControlBall in cup, catch (DMControl500k)Score959CURL
Continuous ControlReacher, easy (DMControl100k)Score538CURL
Continuous ControlWalker, walk (DMControl500k)Score902CURL
Continuous ControlCartpole, swingup (DMControl500k)Score841CURL
Continuous ControlBall in cup, catch (DMControl100k)Score769CURL
Atari GamesAtari 2600 BoxingScore4.8CURL
Atari GamesAtari 2600 Ms. PacmanScore1492.8CURL
Atari GamesAtari 2600 FreewayScore27.9CURL
Atari GamesAtari 2600 PongScore2.1CURL
Atari GamesAtari 2600 KrullScore3833.6CURL
Atari GamesAtari 2600 BreakoutScore18.2CURL
Atari GamesAtari 2600 FrostbiteScore924CURL
Atari GamesAtari 2600 GopherScore801.4CURL
Atari GamesAtari 2600 James BondMedium Human-Normalized Score400.1CURL
Atari GamesAtari 2600 AmidarScore232.3CURL
Atari GamesAtari 2600 Crazy ClimberScore27805.6CURL
Atari GamesAtari 2600 Demon AttackScore834CURL
Atari GamesAtari 2600 Battle ZoneScore11208CURL
Atari GamesAtari 2600 AsterixScore524.3CURL
Atari GamesAtari 2600 Kung-Fu MasterScore14280CURL
Atari GamesAtari 2600 KangarooScore345.3CURL
Atari GamesAtari 2600 AssaultScore543.7CURL
Atari GamesAtari 2600 AlienScore1148.2CURL
Atari GamesAtari 2600 SeaquestScore408CURL
Atari GamesAtari 2600 Chopper CommandScore1198CURL
Atari GamesAtari 2600 HEROScore6235.1CURL
Atari GamesAtari 2600 Bank HeistScore193.7CURL
Atari GamesAtari 2600 Private EyeScore105.2CURL
Atari GamesAtari 2600 Q*BertScore1225.6CURL
Atari GamesAtari 2600 Road RunnerScore6786.7CURL
Atari GamesAtari 2600 Up and DownScore2735.2CURL
Video GamesAtari 2600 BoxingScore4.8CURL
Video GamesAtari 2600 Ms. PacmanScore1492.8CURL
Video GamesAtari 2600 FreewayScore27.9CURL
Video GamesAtari 2600 PongScore2.1CURL
Video GamesAtari 2600 KrullScore3833.6CURL
Video GamesAtari 2600 BreakoutScore18.2CURL
Video GamesAtari 2600 FrostbiteScore924CURL
Video GamesAtari 2600 GopherScore801.4CURL
Video GamesAtari 2600 James BondMedium Human-Normalized Score400.1CURL
Video GamesAtari 2600 AmidarScore232.3CURL
Video GamesAtari 2600 Crazy ClimberScore27805.6CURL
Video GamesAtari 2600 Demon AttackScore834CURL
Video GamesAtari 2600 Battle ZoneScore11208CURL
Video GamesAtari 2600 AsterixScore524.3CURL
Video GamesAtari 2600 Kung-Fu MasterScore14280CURL
Video GamesAtari 2600 KangarooScore345.3CURL
Video GamesAtari 2600 AssaultScore543.7CURL
Video GamesAtari 2600 AlienScore1148.2CURL
Video GamesAtari 2600 SeaquestScore408CURL
Video GamesAtari 2600 Chopper CommandScore1198CURL
Video GamesAtari 2600 HEROScore6235.1CURL
Video GamesAtari 2600 Bank HeistScore193.7CURL
Video GamesAtari 2600 Private EyeScore105.2CURL
Video GamesAtari 2600 Q*BertScore1225.6CURL
Video GamesAtari 2600 Road RunnerScore6786.7CURL
Video GamesAtari 2600 Up and DownScore2735.2CURL
3DWalker, walk (DMControl100k)Score403CURL
3DCartpole, swingup (DMControl100k)Score582CURL
3DCheetah, run (DMControl500k)Score518CURL
3DReacher, easy (DMControl500k)Score929CURL
3DFinger, spin (DMControl100k)Score767CURL
3DCheetah, run (DMControl100k)Score299CURL
3DFinger, spin (DMControl500k)Score926CURL
3DBall in cup, catch (DMControl500k)Score959CURL
3DReacher, easy (DMControl100k)Score538CURL
3DWalker, walk (DMControl500k)Score902CURL
3DCartpole, swingup (DMControl500k)Score841CURL
3DBall in cup, catch (DMControl100k)Score769CURL
3D Face ModellingWalker, walk (DMControl100k)Score403CURL
3D Face ModellingCartpole, swingup (DMControl100k)Score582CURL
3D Face ModellingCheetah, run (DMControl500k)Score518CURL
3D Face ModellingReacher, easy (DMControl500k)Score929CURL
3D Face ModellingFinger, spin (DMControl100k)Score767CURL
3D Face ModellingCheetah, run (DMControl100k)Score299CURL
3D Face ModellingFinger, spin (DMControl500k)Score926CURL
3D Face ModellingBall in cup, catch (DMControl500k)Score959CURL
3D Face ModellingReacher, easy (DMControl100k)Score538CURL
3D Face ModellingWalker, walk (DMControl500k)Score902CURL
3D Face ModellingCartpole, swingup (DMControl500k)Score841CURL
3D Face ModellingBall in cup, catch (DMControl100k)Score769CURL

Related Papers

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks2025-07-17QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17Autonomous Resource Management in Microservice Systems via Reinforcement Learning2025-07-17