StudyPreprintWikiReinforcement LearningSequential DecisionsModerateSoft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic ActorRead full paper →AuthorsTuomas Haarnoja, Aurick Zhou, Pieter Abbeel, Sergey LevineYear2018Read full paper →More Reinforcement Learning research