Publication

A prescriptive Dirichlet power allocation policy with deep reinforcement learning

Olga Fink
2022
Article

Résumé

Prescribing optimal operation based on the condition of the system, and thereby potentially prolonging its remaining useful lifetime, has tremendous potential in terms of actively managing the availability, maintenance, and costs of complex systems. Reinforcement learning (RL) algorithms are particularly suitable for this type of problem given their learning capabilities. A special case of a prescriptive operation is the power allocation task, which can be considered as a sequential allocation problem whereby the action space is bounded by a simplex constraint. A general continuous action-space solution of such sequential allocation problems has still remained an open research question for RL algorithms. In continuous action space, the standard Gaussian policy applied in reinforcement learning does not support simplex constraints, while the Gaussian-softmax policy introduces a bias during training. In this work, we propose the Dirichlet policy for continuous allocation tasks and analyze the bias and variance of its policy gradients. We demonstrate that the Dirichlet policy is bias-free and provides significantly faster convergence, better performance, and better robustness to hyperparameter changes as compared to the Gaussian-softmax policy. Moreover, we demonstrate the applicability of the proposed algorithm on a prescriptive operation case in which we propose the Dirichlet power allocation policy and evaluate its performance on a case study of a set of multiple lithium-ion (Li-I) battery systems. The experimental results demonstrate the potential to prescribe optimal operation, improving the efficiency and sustainability of multi-power source systems.

Source officielle

https://infoscience.epfl.ch/record/294726?ln=fr

À propos de ce résultat

Cette page est générée automatiquement et peut contenir des informations qui ne sont pas correctes, complètes, à jour ou pertinentes par rapport à votre recherche. Il en va de même pour toutes les autres pages de ce site. Veillez à vérifier les informations auprès des sources officielles de l'EPFL.

Graph Chatbot

Chattez avec Graph Search

Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.

AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.

Connectez-vous pour utiliser Chat avec Graph Search

Olga Fink
2022
Article

Résumé

Source officielle

https://infoscience.epfl.ch/record/294726?ln=fr

À propos de ce résultat

Proximité ontologique

Génie énergétique

Stockage de l'énergie: Stockage de l'énergie

Concepts associés (33)

Publications associées (33)

MOOCs associés (16)

A prescriptive Dirichlet power allocation policy with deep reinforcement learning

Graph Chatbot

Chattez avec Graph Search

Fusing Pre-existing Knowledge and Machine Learning for Enhanced Building Thermal Modeling and Control

Inverse design of metal-organic frameworks for direct air capture of CO2via deep reinforcement learning

Predicting the long-term collective behaviour of fish pairs with deep learning

Predicting the long-term collective behaviour of fish pairs with deep learning

Fusing Pre-existing Knowledge and Machine Learning for Enhanced Building Thermal Modeling and Control

Inverse design of metal-organic frameworks for direct air capture of CO2via deep reinforcement learning