Learning to Guide Online Multi-Contact Receding Horizon Planning

In Receding Horizon Planning (RHP), it is critical that the motion being executed facilitates the completion of the task, e.g. building momentum to overcome large obstacles. This requires a value function to inform the desirability of robot states. However, given the complex dynamics, value functions are often approximated by expensive computation of trajectories in an extended planning horizon. In this work, to achieve online multi-contact Receding Horizon Planning (RHP), we propose to learn an oracle that can predict local objectives (intermediate goals) for a given task based on the current robot state and the environment. Then, we use these local objectives to construct local value functions to guide a short-horizon RHP. To obtain the oracle, we take a supervised learning approach, and we present an incremental training scheme that can improve the prediction accuracy by adding demonstrations on how to recover from failures. We compare our approach against the baseline (long-horizon RHP) for planning centroidal trajectories of humanoid walking on moderate slopes as well as large slopes where static stability cannot be achieved. We validate these trajectories by tracking them via a whole-body inverse dynamics controller in simulation. We show that our approach can achieve online RHP for 95%-98.6% cycles, outperforming the baseline (8%-51.2%).

Learning to Guide Online Multi-Contact Receding Horizon Planning

Graph Chatbot

Chattez avec Graph Search

Online Multicontact Receding Horizon Planning via Value Function Approximation

Exact Obstacle Avoidance for Robots in Complex and Dynamic Environments Using Local Modulation

Memento Mori: Reliable robustness in self-reconfigurable modular robots

Online Multicontact Receding Horizon Planning via Value Function Approximation

Exact Obstacle Avoidance for Robots in Complex and Dynamic Environments Using Local Modulation

Memento Mori: Reliable robustness in self-reconfigurable modular robots