Joint MECHATRONICS 2025, ROBOTICS 2025 Paper Abstract

Joint MECHATRONICS 2025, ROBOTICS 2025 Paper Abstract

Paper WeBT4.1

Sharma, Ankit (Indian Institute of Technology Roorkee, Hitachi India Pvt. Ltd.), Chakrabarty, Sohom (Indian Institute of Technology Roorkee)

Curriculum-Based Proximal Policy Optimization for Hovering UAVs: Controlling Slack-To-Taut Dynamics During Suspended Payload Release

Scheduled for presentation during the Regular Session "Aerial and Space Robots" (WeBT4), Wednesday, July 16, 2025, 14:00−14:20, Room 108

Joint 10th IFAC Symposium on Mechatronic Systems and 14th Symposium on Robotics, July 15-18, 2025, Paris, France

This information is tentative and subject to change. Compiled on July 16, 2025

Keywords Mobile robots and vehicles, Robust robot control, Modeling and identification

Abstract

Designing a unified controller for a UAV carrying a suspended payload presents significant challenges, particularly during transitions when the cable shifts between slack and taut states. These transitions generate abrupt jerks and introduce highly nonlinear dynamics. Conventional approaches typically depend on multiple specialized controllers and rely heavily on mathematical modelling, which complicates implementation. To address these issues, we propose a Curriculum-Based Proximal Policy Optimization (C-PPO) controller that employs a single integrated policy capable of managing both slack and taut cable conditions without requiring explicit system modelling. By leveraging model-free reinforcement learning, the controller learns directly from dynamic interactions within a MuJoCo virtual environment, encompassing UAV dynamics and payload movements. We demonstrate theoretically that the C-PPO’s learned policy is both unique and robust, and effective in mitigating instabilities caused by sudden jerks. Empirically, it was also shown that C-PPO maintains stable hovering performance even under previously unseen conditions, including variations in payload mass, rope length, and wind profiles, by conducting experiments within the MuJoCo environment.