Training Efficient Controllers via Analytic Policy Gradient
Graph Chatbot
Chat with Graph Search
Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.
DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.
This thesis studies the automatic design and optimization of high-performing robust controllers for mobile robots using exclusively on-board resources. Due to the often large parameter space and noisy performance metrics, this constitutes an expensive opti ...
The desire to operate chemical processes in a safe and economically optimal way has motivated the development of so-called real-time optimization (RTO) methods [1]. For continuous processes, these methods aim to compute safe and optimal steady-state set ...
This paper reports on the number field sieve computation of a 768-bit prime field discrete logarithm, describes the different parameter optimizations and resulting algorithmic changes compared to the factorization of a 768-bit RSA modulus, and briefly disc ...
Converting a color image to a grayscale image, namely decolorization, is an important process for many real-world applications. Previous methods build contrast loss functions to minimize the contrast differences between the color images and the resultant g ...
A probabilistic interpretation of model predictive control is presented, enabling extensions to multiple coordinate systems. The resulting controller follows a minimal intervention principle, by learning and retrieving movements through the coordination of ...
This paper proposes a Model Predictive Control (MPC) scheme to solve the target estimation and tracking problem. The objective is to derive a feedback law that drives an autonomous robotic vehicle to follow a target vehicle using an on-line estimate of the ...
This paper presents a method to verify closed-loop properties of optimization-based controllers for deterministic and stochastic constrained polynomial discrete-time dynamical systems. The closed-loop properties amenable to the proposed technique include g ...
This paper proposes a stability verification method for systems controlled by an early terminated first-order method (e.g., an MPC problem approximately solved by a fixed number of iterations of the fast gradient method). The method is based on the observa ...
Learning motion control as a unified process of designing the reference trajectory and the controller is one of the most challenging problems in robotics. The complexity of the problem prevents most of the existing optimization algorithms from giving satis ...
High-speed applications impose a hard real-time constraint on the solution of a model predictive control (MPC) problem, which generally prevents the computation of the optimal control input. As a result, in most MPC implementations guarantees on feasibilit ...