Volkan Cevher, Jonathan Mark Scarlett, Ilija Bogunovic
In this paper, we consider the problem of sequentially optimizing a black-box function f based on noisy samples and bandit feedback. We assume that f is smooth in the sense of having a bounded norm in some reproducing kernel Hilbert space (RKHS), yield ... 2017