Training Efficient Controllers via Analytic Policy Gradient
Graph Chatbot
Chattez avec Graph Search
Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.
AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.
A probabilistic interpretation of model predictive control is presented, enabling extensions to multiple coordinate systems. The resulting controller follows a minimal intervention principle, by learning and retrieving movements through the coordination of ...
The desire to operate chemical processes in a safe and economically optimal way has motivated the development of so-called real-time optimization (RTO) methods [1]. For continuous processes, these methods aim to compute safe and optimal steady-state set ...
This thesis studies the automatic design and optimization of high-performing robust controllers for mobile robots using exclusively on-board resources. Due to the often large parameter space and noisy performance metrics, this constitutes an expensive opti ...
High-speed applications impose a hard real-time constraint on the solution of a model predictive control (MPC) problem, which generally prevents the computation of the optimal control input. As a result, in most MPC implementations guarantees on feasibilit ...
Converting a color image to a grayscale image, namely decolorization, is an important process for many real-world applications. Previous methods build contrast loss functions to minimize the contrast differences between the color images and the resultant g ...
This paper proposes a Model Predictive Control (MPC) scheme to solve the target estimation and tracking problem. The objective is to derive a feedback law that drives an autonomous robotic vehicle to follow a target vehicle using an on-line estimate of the ...
This paper reports on the number field sieve computation of a 768-bit prime field discrete logarithm, describes the different parameter optimizations and resulting algorithmic changes compared to the factorization of a 768-bit RSA modulus, and briefly disc ...
This paper proposes a stability verification method for systems controlled by an early terminated first-order method (e.g., an MPC problem approximately solved by a fixed number of iterations of the fast gradient method). The method is based on the observa ...
Learning motion control as a unified process of designing the reference trajectory and the controller is one of the most challenging problems in robotics. The complexity of the problem prevents most of the existing optimization algorithms from giving satis ...
This paper presents a method to verify closed-loop properties of optimization-based controllers for deterministic and stochastic constrained polynomial discrete-time dynamical systems. The closed-loop properties amenable to the proposed technique include g ...