What question did this study set out to answer?

This research aims to optimize robotic surface processing tasks on complex 3D geometries using deep reinforcement learning.

March 25, 2026Open Access

Deep Reinforcement Learning Based Parameter Optimization for Processing Curved Surfaces

Puntos clave

This research aims to optimize robotic surface processing tasks on complex 3D geometries using deep reinforcement learning.
Developed a deep reinforcement learning policy for parameter estimation related to tool angles.
Defined process trajectory through intermediate surface path points.
Evaluated the model on a variety of unseen basin geometries in simulation.
Compared the method against an iterative sampling-based approach.
Successfully derived two critical parameters for polishing processes from the geometry of workpieces and tools.
Minimized collisions between the workpiece and the tool during processing.
Achieved a significant improvement in the stability of tool angles throughout the operation.

Resumen

The planning of robotic surface processing tasks, such as polishing, cleaning, and painting on 3D objects with varying shapes and sizes, remain challenging. Traditional heuristic methods often face computational limits, while supervised approaches such as learning from demonstrations require significant amounts of expert data for each task. Although reinforcement learning has shown success in handling fat or subtly curved surfaces, planning of process on surfaces with complex and diverse shape remains challenging due to their geometric variability, the high dimensionality of the action space, and the need to adapt tool angles to varying surface curvatures. In our work, we show how two process parameters, relevant for a polishing process, can be derived from the geometry of the workpiece and the tool in a time-efficient manner. Given a process trajectory defined by intermediate surface path points, we learned a deep reinforcement learning-based policy to estimate parameters related to the inclination and orientation of a polishing tool such that collisions of workpiece and tool are avoided, the polished surface is maximized, and the tool angles are as stable as possible along the path. We further evaluated our method on various unseen and diverse basin geometries in simulation and also compared it with an iterative sampling-based method.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo