Learning to Guide Online Multi-Contact Receding Horizon Planning

In Receding Horizon Planning (RHP), it is critical that the motion being executed facilitates the completion of the task, e.g. building momentum to overcome large obstacles. This requires a value function to inform the desirability of robot states. However, given the complex dynamics, value functions are often approximated by expensive computation of trajectories in an extended planning horizon. In this work, to achieve online multi-contact Receding Horizon Planning (RHP), we propose to learn an oracle that can predict local objectives (intermediate goals) for a given task based on the current robot state and the environment. Then, we use these local objectives to construct local value functions to guide a short-horizon RHP. To obtain the oracle, we take a supervised learning approach, and we present an incremental training scheme that can improve the prediction accuracy by adding demonstrations on how to recover from failures. We compare our approach against the baseline (long-horizon RHP) for planning centroidal trajectories of humanoid walking on moderate slopes as well as large slopes where static stability cannot be achieved. We validate these trajectories by tracking them via a whole-body inverse dynamics controller in simulation. We show that our approach can achieve online RHP for 95%-98.6% cycles, outperforming the baseline (8%-51.2%).

Learning to Guide Online Multi-Contact Receding Horizon Planning

Graph Chatbot

Chat with Graph Search

Self-Correcting Quadratic Programming-Based Robot Control

Closed-Loop Robotic Cooking of Scrambled Eggs with a Salinity-based ‘Taste’Sensor

Slow-fast Dynamics of Strongly Coupled Adaptive Frequency Oscillators*

Self-Correcting Quadratic Programming-Based Robot Control

Slow-fast Dynamics of Strongly Coupled Adaptive Frequency Oscillators*

Closed-Loop Robotic Cooking of Scrambled Eggs with a Salinity-based ‘Taste’Sensor