Imitation Learning in Discounted Linear MDPs without exploration assumptions
Publications associées (34)
Graph Chatbot
Chattez avec Graph Search
Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.
AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.
In this paper, we propose a method for modeling trajectory patterns with both regional and velocity observations through the probabilistic topic model. By embedding Gaussian models into the discrete topic model framework, our method uses continuous velocit ...
We examine the performance of stochastic-gradient learners over connected networks for global optimization problems involving risk functions that are not necessarily quadratic. We consider two well-studied classes of distributed schemes including consensus ...
We examine the problem of learning a set of parameters from a distributed dataset. We assume the datasets are collected by agents over a distributed ad-hoc network, and that the communication of the actual raw data is prohibitive due to either privacy cons ...
We present an algorithm enabling a humanoid robot to visually learn its body schema, knowing only the number of degrees of freedom in each limb. By “body schema” we mean the joint positions and orientations and thus the kinematic function. The learning is ...
The physical face-to-face classroom still represents the core educational setting in which everyday CSCL practice takes place. However, current classrooms are not limited anymore to books, blackboards and other physical artifacts: laptops, tablets, digital ...
International Society of the Learning Sciences2015
Neurological patients with impaired upper limbs often receive arm therapy to restore or relearn lost motor functions. During the last years robotic devices were developed to assist the patient during the training. In daily life the diversity of movements i ...
Learning analytics (LA) is often considered as a means to improve learning and learning environments by measuring student behaviour, analysing the tracked data and acting upon the results. The use of LA tools implies recording and processing of student act ...
Open ended learning is a dynamic process based on the continuous analysis of new data, guided by past experience. On one side it is helpful to take advantage of prior knowledge when only few information on a new task is available (transfer learning). On th ...
Teacher orchestration of technology-enhanced learning (TEL) processes plays a major role in students' outcomes, especially in face-to-face classrooms. However, few studies look into the fine-grained details of how such orchestration unfolds, the challenges ...
From e-commerce to social networking sites, recommender systems are gaining more and more interest. They provide connections, news, resources, or products of interest. This paper presents a federated recommender system, which exploits data from different o ...