March 18, 2024Open Access

Interpretable Policy Extraction with Neuro-Symbolic Reinforcement Learning

Key Points

Key points are not available for this paper at this time.

Abstract

This paper presents a novel RL algorithm, S-REINFORCE, designed by leveraging two types of function approximators, namely Neural Network (NN) and Symbolic Regressor (SR), to produce numerical and symbolic policies for dynamic decision-making tasks, respectively. A symbolic policy uncovers functional relations between the underlying states and action-probabilities. Further, the symbolic policy is utilized through importance sampling (IS) to improve the rewards received during the learning process. The effectiveness of S-REINFORCE has been validated on various dynamic decision-making problems involving low and high dimensional action spaces. The results obtained clearly demonstrate that by leveraging the complementary strengths of NN and SR, S-REINFORCE generates policies that exhibit both good performance and interpretability. This makes S-REINFORCE an excellent choice for real-world applications where transparency and causality play a crucial role.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Dutta et al. (Mon,) studied this question.

www.synapsesocial.com/papers/68e7397eb6db6435876b2b3c — DOI: https://doi.org/10.1109/icassp48485.2024.10446037

Also consider

Synapse has enriched 4 closely related papers on similar clinical questions. Consider them for comparative context:

Scaling Laws for Reward Model Overoptimization· 2022 · 36 citations
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play· 2018 · 3,498 citations
Advances in Neural Information Processing Systems 21· 2009 · 1,859 citations
Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning· 1992 · 557 citations

Authors

Rajdeep Dutta

Qincheng Wang

Ankur Singh

Actions

Institutions

Nanyang Technological University

Agency for Science, Technology and Research

Institute for Infocomm Research

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Interpretable Policy Extraction with Neuro-Symbolic Reinforcement Learning

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion