What question did this study set out to answer?

The research aims to improve long-range motion planning for autonomous systems using an advanced deep reinforcement learning framework.

April 10, 2026

An Enhanced Deep Reinforcement Learning Approach to Motion Planning with Knowledge Transfer and Online Demonstrations

Key Points

The research aims to improve long-range motion planning for autonomous systems using an advanced deep reinforcement learning framework.
Developed a high-level planning action policy considering low-level control properties.
Implemented a parallel architecture with separate actor and critic neural networks.
Stored only high-level actions in an experience buffer for efficient learning in long-range tasks.
Integrated online demonstrations with global planning algorithms to enhance learning quality.
Enhanced agent performance significantly in long-range trajectory planning tasks.
Mitigated the sparse reward problem through high-level policy design.
Improved training efficiency and navigation optimality using knowledge transfer.

Abstract

Abstract Deep reinforcement learning is now widely applied in motion planning problems for autonomous systems due to its model-free nature and its ability to solve complex control problems through trial and error. However, the success of deep reinforcement learning depends heavily on the exploration policy and the design of the reward function. This dependence makes it challenging to solve long-range planning problems and requires careful reward function design to avoid the sparse reward problem. In this paper, we propose an enhanced deep reinforcement learning framework that learns the high-level planning action policy while considering the low-level control properties and improves training efficiency and navigation optimality. The high-level policy enables the agent to make long-term decisions with a flexible horizon. Using a high-level policy also mitigates the sparse reward problem in long-range planning tasks. By storing only high-level actions and transitions in the experience buffer, the agent can efficiently learn in long-range trajectory planning tasks. The proposed parallel architecture with separate actor and critic neural networks allows for the integration of high-level domain knowledge transfer while maintaining the ability to generate new knowledge tailored to specific problems. Integrating online demonstrations during training using global planning algorithms can significantly enhance the quality of experiences employed during reinforcement learning. Experimental results show that transferring high-level geometry knowledge and applying online error correction through demonstrations can significantly enhance the agent's performance in long-range trajectory planning tasks.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Chuanhui Hu

Yan Jin

Journals

Journal of Computing and Information Science in Engineering

Actions

Institutions

University of California, Los Angeles

Asian Pacific AIDS Intervention Team

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

An Enhanced Deep Reinforcement Learning Approach to Motion Planning with Knowledge Transfer and Online Demonstrations

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study