Mean field for Markov Decision Processes: from Discrete to Continuous Optimization
Publications associées (43)
Graph Chatbot
Chattez avec Graph Search
Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.
AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.
Optimization arises naturally when process performance needs improvement. This is often the case in industry because of competition – the product has to be proposed at the lowest possible cost. From the point of view of control, optimization consists in de ...
Many practical chemical engineering processes involve a sequence of distinct transient operations, forming multistage systems in which each stage is described by mixed sets of differential and algebraic equations (DAEs). These models usually involve decisi ...
The mitogen-activated protein kinase (MAPK) cascades are ubiquitous in eukaryotic signal transduction, and these pathways are conserved in cells from yeast to mammals. They relay extracellular stimuli from the plasma membrane to targets in the cytoplasm an ...
Given three or four synchronized videos taken at eye level and from different angles, we show that we can effectively use dynamic programming to accurately follow up to six individuals across thousands of frames in spite of significant occlusions. In addit ...
For the optimization of dynamic systems, it is customary to use measurements to combat the effect of uncertainty. In this context, an approach that consists of tracking the necessary conditions of optimality is gaining in popularity. The approach relies st ...
We provide necessary optimality conditions for a general class of discounted infinite-horizon dynamic optimization problems. As part of the resulting maximum principle we obtain explicit bounds on the adjoint variable, stronger than the transversality cond ...
An analytical methodology for prediction of the platoon arrival profiles and queue length along signalized arterials is proposed. Traffic between successive traffic signals is modeled as a two-step Markov decision process (MDP). Traffic dynamics are modele ...
Some new domain decomposition methods (DDM) based on optimal control approach are introduced for the coupling of first- and second-order equations on overlapping subdomains. Several cost functionals and control functions are proposed. Uniqueness and existe ...
We consider the problem of a sensor network tracking a moving target that exhibits a Markov model of mobility. The sensor nodes have adjustable power levels and the precision of the measurement of the target location depends on both the relative distance f ...
Some new domain decomposition methods (DDM) based on optimal control approach are introduced for the coupling of first- and second-order equations on overlapping subdomains. Several cost functionals and control functions are proposed. Uniqueness and existe ...