Learning from Failed Demonstrations in Unreliable Systems

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

This paper presents a method to teach a robot to play Ping Pong from failed demonstrations in a highly noisy and uncertain setting. To infer useful information from failed demonstrations, we use a MultiDonut Algorithm [7] that minimises the probability of repeating a failed demonstration and generates new attempts similar but not quite the same as the demonstration. We compare human demonstrations against a random strategy and show that human demonstrations provide useful information and hence yield faster learning, especially in higher dimensions. We show that learning from observing failed attempts allows the robot to perform the task more reliably than any individual demonstrator did. We also show how this algorithm adapts to gradual deterioration in the system and increases the chances of success when interacting with an unreliable system.

Learning from Failed Demonstrations in Unreliable Systems

Graph Chatbot

Chat with Graph Search

Technosignatures Longevity and Lindy's Law

Exact Obstacle Avoidance for Robots in Complex and Dynamic Environments Using Local Modulation

Robot Learning using Tensor Networks

Technosignatures Longevity and Lindy's Law

Robot Learning using Tensor Networks

Exact Obstacle Avoidance for Robots in Complex and Dynamic Environments Using Local Modulation