What question did this study set out to answer?

To address the limitations in conventional evolutionary reinforcement learning by introducing a co-evolutionary approach.

March 25, 2026Open Access

Co-Evolutionary Proximal Distilled Evolutionary Reinforcement Learning with Gated Knowledge Transfer

Key Points

To address the limitations in conventional evolutionary reinforcement learning by introducing a co-evolutionary approach.
Propose Co-PDERL for dual-population co-evolution of actors and critics.
Employ phenotype-aware operators for safer evolutionary modifications.
Utilize condition-gated synchronization to enhance knowledge transfer between populations.
Co-PDERL outperforms standard ERL and PDERL on MuJoCo benchmarks.
Improves sample efficiency and asymptotic performance.
Effectively mitigates issues related to single critic dependence.

Abstract

Evolutionary reinforcement learning (ERL) offers a compelling alternative for continuous control by combining the population-level exploration of evolutionary algorithms with the gradient-based exploitation of reinforcement learning. However, applying conventional genetic operators to deep networks can be highly destructive, often inducing abrupt behavioral shifts that erase previously learned skills. Proximal distilled evolutionary reinforcement learning (PDERL) addresses this issue with phenotype-aware operators, leveraging proximal mutation and distillation crossover to produce safer and more constructive variations. Despite these advances, PDERL and many ERL frameworks still exhibit a fundamental evaluation asymmetry: an evolving actor population is guided by a single, centralized critic for fitness evaluation and action filtering. This single-critic dependence creates a bottleneck and a potential single point of failure, where bias or instability in value estimation can misdirect the evolutionary search. To overcome this limitation, we propose co-evolutionary proximal distilled evolutionary reinforcement learning (Co-PDERL), a heterogeneous dual-population framework that co-evolves both actor and critic populations. Co-PDERL extends phenotype-aware evolution to the value-function landscape via a loss-filtered distillation crossover and a Jacobian-based proximal mutation tailored for critics, and employs a condition-gated synchronization mechanism to enable robust bidirectional knowledge transfer between the evolutionary populations and the reinforcement learning agent. Experiments on MuJoCo continuous control benchmarks show that Co-PDERL outperforms competitive baselines on most tasks, including standard ERL and PDERL, improving both sample efficiency and asymptotic performance by effectively alleviating the single-critic bottleneck.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Zhao et al. (Mon,) studied this question.

www.synapsesocial.com/papers/69c37bc2b34aaaeb1a67e89d — DOI: https://doi.org/10.3390/math14061078

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Actor-Critic With Synthesis Loss for Solving Approximation Biases· 2024 · 5 citations
Completely Derandomized Self-Adaptation in Evolution Strategies· 2001 · 4,220 citations
MuJoCo: A physics engine for model-based control· 2012 · 4,395 citations
Deep Reinforcement Learning for Robotics: A Survey of Real-World Successes· 2024 · 95 citations
Enhancing environmental modeling and maximum diffusion reinforcement learning using evolutionary computation for optimal performance

Authors

Ying Zhao

Yi Ding

Yinglong Dai

Journals

Mathematics

Actions

Institutions

Hunan Normal University

University of Aizu

Changsha Normal University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Co-Evolutionary Proximal Distilled Evolutionary Reinforcement Learning with Gated Knowledge Transfer

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion