Publication

Learning search polices from humans in a partially observable context

Related publications (38)

About
Privacy
Disclaimer

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Learning search polices from humans in a partially observable context

Graph Chatbot

Chat with Graph Search

Quantifying the Unknown: Data-Driven Approaches and Applications in Energy Systems

Multi-robot task allocation for safe planning against stochastic hazard dynamics

Ride-hail vehicle routing (RIVER) as a congestion game

Multi-robot task allocation for safe planning against stochastic hazard dynamics

Learning-Augmented Dynamic Power Management with Multiple States via New Ski Rental Bounds

Motivating Innovation: The Effect of Loss Aversion on the Willingness to Persist

Multi-armed Bandits in Action

Robust Adaptive Decision Making: Bayesian Optimization and Beyond

From Infinite to Finite Programs: Explicit Error Bounds with Applications to Approximate Dynamic Programming

Efficient Learning from Comparisons

Quantifying the Unknown: Data-Driven Approaches and Applications in Energy Systems

Multi-robot task allocation for safe planning against stochastic hazard dynamics

Motivating Innovation: The Effect of Loss Aversion on the Willingness to Persist

Multi-armed Bandits in Action

From Infinite to Finite Programs: Explicit Error Bounds with Applications to Approximate Dynamic Programming

Learning-Augmented Dynamic Power Management with Multiple States via New Ski Rental Bounds

Ride-hail vehicle routing (RIVER) as a congestion game

Multi-robot task allocation for safe planning against stochastic hazard dynamics

Robust Adaptive Decision Making: Bayesian Optimization and Beyond

Efficient Learning from Comparisons