Pulse Journal Club Trending Explore Researchers

Download the App

Join discussions, follow papers, and never miss your next session.

Download on theApp Store

© Synapse Social LLC, 2026

Home Explore Journal Club Trending

⌘+K

Planning in entropy-regularized Markov decision processes and games | Synapse

April 21, 2026Open Access

Planning in entropy-regularized Markov decision processes and games

Key Points

Key points are not available for this paper at this time.

Abstract

We propose SmoothCruiser, a new planning algorithm for estimating the value function in entropy-regularized Markov decision processes and two-player games, given a generative model of the environment. SmoothCruiser makes use of the smoothness of the Bellman operator promoted by the regularization to achieve problem-independent sample complexity of order O~ (1/epsilon⁴) for a desired accuracy epsilon, whereas for non-regularized settings there are no known algorithms with guaranteed polynomial sample complexity in the worst case.

Mark Helpful

Bookmark

Relay

View Full Paper

Mark Helpful

Bookmark

Relay

View Full Paper

Cite This Study

Grill et al. (Tue,) studied this question.

synapsesocial.com/papers/6a0f9dd32badbc352afe6f90 https://doi.org/https://doi.org/10.48550/arxiv.2604.19695