Publication

Asymptotically Optimal Contextual Bandit Algorithm Using Hierarchical Structures