Science Cast

DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic

Dexter NeoOctober 27, 2023 9:45am

Views (96)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic

arXivPDFOctober 26, 2023 12:00am

Authors

Dexter Neo, Tsuhan Chen

Abstract

We present a novel extension to the family of Soft Actor-Critic (SAC) algorithms. We argue that based on the Maximum Entropy Principle, discrete SAC can be further improved via additional statistical constraints derived from a surrogate critic policy. Furthermore, our findings suggests that these constraints provide an added robustness against potential domain shifts, which are essential for safe deployment of reinforcement learning agents in the real-world. We provide theoretical analysis and show empirical results on low data regimes for both in-distribution and out-of-distribution variants of Atari 2600 games.

TwitterandLinkedIn

0 comments

Add comment

DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic

DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments