Publications associées à Reinforcement learning from human feedback

Fusing Pre-existing Knowledge and Machine Learning for Enhanced Building Thermal Modeling and Control

Buildings play a pivotal role in the ongoing worldwide energy transition, accounting for 30% of the global energy consumption. With traditional engineering solutions reaching their limits to tackle such large-scale problems, data-driven methods and Machine ...

EPFL2024

Inverse design of metal-organic frameworks for direct air capture of CO2via deep reinforcement learning

Berend Smit, Xiaoqi Zhang, Sauradeep Majumdar, Hyunsoo Park

The combination of several interesting characteristics makes metal-organic frameworks (MOFs) a highly sought-after class of nanomaterials for a broad range of applications like gas storage and separation, catalysis, drug delivery, and so on. However, the e ...

Royal Soc Chemistry2024

Lessons Learned from Data-Driven Building Control Experiments: Contrasting Gaussian Process-based MPC, Bilevel DeePC, and Deep Reinforcement Learning

Colin Neil Jones, Loris Di Natale, Jicheng Shi, Emilio Maddalena, Yingzhao Lian

This manuscript offers the perspective of experimentalists on a number of modern data-driven techniques: model predictive control relying on Gaussian processes, adaptive data-driven control based on behavioral theory, and deep reinforcement learning. These ...

2023

Safe multi-agent deep reinforcement learning for joint bidding and maintenance scheduling of generation units

Olga Fink, Mina Montazeri

This paper proposes a safe reinforcement learning algorithm for generation bidding decisions and unit maintenance scheduling in a competitive electricity market environment. In this problem, each unit aims to find a bidding strategy that maximizes its reve ...

2023

Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes

Jesus Sanchez Rodriguez

Machine learning is often cited as a new paradigm in control theory, but is also often viewed as empirical and less intuitive for students than classical model-based methods. This is particularly the case for reinforcement learning, an approach that does n ...

PUBLIC LIBRARY SCIENCE2023

DIET Controller: Dynamic Indoor Environment using Deep Reinforcement Learning

Arnab Chatterjee

Heating, Ventilation, and Air Conditioning (HVAC) Systems utilize much energy, accounting for 40% of total building energy use. The temperatures in buildings are commonly held within narrow limits, leading to higher energy use. Measurements from office bui ...

EPFL2023

Multi-agent reinforcement learning with graph convolutional neural networks for optimal bidding strategies of generation units in electricity markets

Olga Fink, Mina Montazeri

Finding optimal bidding strategies for generation units in electricity markets would result in higher profit. However, it is a challenging problem due to the system uncertainty which is due to the lack of knowledge of the strategies of other generation uni ...

PERGAMON-ELSEVIER SCIENCE LTD2023

Deep Learning for Localized-Haptic Feedback in Tactile Surfaces

Camilo Hernandez Mejia

Touchscreens are nowadays the preferred choice for user interfaces in consumer electronics. Significant technological advances have been made in terms of touch sensing and visual quality. However, the haptic feedback offered by commercial products is still ...

EPFL2023

Real-time model calibration with deep reinforcement learning

Olga Fink

The real-time, and accurate inference of model parameters is of great importance in many scientific and engineering disciplines that use computational models (such as a digital twin) for the analysis and prediction of complex physical processes. However, f ...

2022

A prescriptive Dirichlet power allocation policy with deep reinforcement learning

Olga Fink

Prescribing optimal operation based on the condition of the system, and thereby potentially prolonging its remaining useful lifetime, has tremendous potential in terms of actively managing the availability, maintenance, and costs of complex systems. Reinfo ...

2022