Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes
Publications associées (105)
Graph Chatbot
Chattez avec Graph Search
Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.
AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.
This paper proposes an online tree-based Bayesian approach for reinforcement learning. For inference, we employ a generalised context tree model. This defines a distribution on multivariate Gaussian piecewise-linear models, which can be updated in closed f ...
Abstract Mathematical modeling and simulation of a head stabilization platform. The stabilization platform is capable of moving on the pitch degree of freedom. The platform is modeled in two different approaches; considering only the mass of the platform ( ...
In the Bayesian approach to sequential decision making, exact calculation of the (subjective) utility is intractable. This extends to most special cases of interest, such as reinforcement learning problems. While utility bounds are known to exist for this ...
The goal of this paper is to prove that a safe and efficient energy transfer is possible between an external transducer located on the patient's skin and a device deeply implanted in the abdomen. An ultrasound propagation model based on the Rayleigh-Sommer ...
Institute of Electrical and Electronics Engineers2012
We revisit a recently developed iterative learning algorithm that enables systems to learn from a repeated operation with the goal of achieving high tracking performance of a given trajectory. The learning scheme is based on a coarse dynamics model of the ...
Chemical incidents are typically caused by loss of control, resulting in runaway reactions or process deviations in different stages of the production. In the case of fed-batch reactors, the problem generally encountered is the accumulation of heat. This i ...
We propose a diffusion strategy to enable social learning over networks. Individual agents observe signals influenced by the state of the environment. The individual measurements are not sufficient to enable the agents to detect the true state of the envir ...
This paper introduces StarlETH, a compliant quadrupedal robot that is designed to study fast, efficient, and versatile locomotion. The platform is fully actuated with high compliant series elastic actuation, making the system torque controllable and at the ...
We present an operational framework for the calibration of demand models for dynamic traffic simulations, where calibration refers to the estimation of a structurally predefined model's parameters from real data. Our focus is on disaggregate simulators tha ...
With the prosperity of cloud computing, an increasing number of Small and Medium-sized Enterprises (SMEs) move their business to public clouds such as Amazon EC2. To help tenants deploy services in the cloud, researchers either conduct performance evaluati ...