Learning search behaviour from humans

Aude Billard
2013
Article de conférence

Résumé

A frequent method for taking into account the partially observable nature of an environment in which robots interact lies in formulating the problem domain as a Partially Observable Markov Decision Process (POMDP). By having humans demonstrate how to act in this partially observable context we can leverage their prior knowledge, experience and intuition, which is difficult to encode directly in a controller, to solve a task formulated as a POMDP. In this work we learn search behaviours from human demonstrators and transfer this knowledge to a robot in a context where no visual information is available. The task consists of finding a block on a table. This is a non-trivial problem since no visual information is available and as a result, the belief of the demonstrator’s state (position in the environment) has to be inferred. We show that by representing the belief of the human’s position in the environment by a particle filter (PF) and learning a mapping from this belief to their end-effector velocities with a Gaussian Mixture Model (GMM), we model the human’s search process. We compare the different types of search behaviour demonstrated by the humans to that of our learned model, to validate that the search process has been successfully modelled. We then contrast the performance of this human-inspired search model to a greedy controller and show that (similarly to humans) the learned controller minimises uncertainty, hence demonstrating more robustness in the face of false belief.

Source officielle

https://infoscience.epfl.ch/record/195169?ln=fr

À propos de ce résultat

Cette page est générée automatiquement et peut contenir des informations qui ne sont pas correctes, complètes, à jour ou pertinentes par rapport à votre recherche. Il en va de même pour toutes les autres pages de ce site. Veillez à vérifier les informations auprès des sources officielles de l'EPFL.

Learning search behaviour from humans

Graph Chatbot

Chattez avec Graph Search

Exploration-based model learning with self-attention for risk-sensitive robot control

Product of experts for robot learning from demonstration

Human-Human, Human-Robot and Robot-Robot Interaction While Walking: Data Analysis, Modelling and Control

Exploration-based model learning with self-attention for risk-sensitive robot control

Product of experts for robot learning from demonstration

Human-Human, Human-Robot and Robot-Robot Interaction While Walking: Data Analysis, Modelling and Control