We propose a diagnostic decomposition for reinforcement learning locomotion controllers, partitioning action determination into External Perception (E), Constraint Enforcement (C), and Policy Residue (R), normalized to E+C+R=1. Applied to the ROM-GRL framework (Liu & Lau, 2025), the decomposition provides a vocabulary for distinguishing sensor dependence, reward overfitting, and autonomous gait mastery. Four experiments with falsification criteria are proposed. No experimental results are presented.
Building similarity graph...
Analyzing shared references across papers
Loading...
Kevin Vaillancourt (Wed,) studied this question.
www.synapsesocial.com/papers/69d896566c1944d70ce07bff — DOI: https://doi.org/10.5281/zenodo.19474027
Kevin Vaillancourt
Building similarity graph...
Analyzing shared references across papers
Loading...