Publications by Christos Dimitrakakis

Ensembles for sequence learning

This thesis explores the application of ensemble methods to sequential learning tasks. The focus is on the development and the critical examination of new methods or novel applications of existing methods, with emphasis on supervised and reinforcement lear ...

EPFL2007

Ensembles for Sequence Learning

Christos Dimitrakakis

This thesis explores the application of ensemble methods to sequential learning tasks. The focus is on the development and the critical examination of new methods or novel applications of existing methods, with emphasis on supervised and reinforcement lear ...

École Polytechnique Fédérale de Lausanne2006

Nearly optimal exploration-exploitation decision thresholds

Christos Dimitrakakis

While in general trading off exploration and exploitation in reinforcement learning is hard, under some formulations relatively simple solutions exist. Optimal decision thresholds for the multi-armed bandit problem, one for the infinite horizon discounted ...

2006

Nearly optimal exploration-exploitation decision thresholds

Christos Dimitrakakis

While in general trading off exploration and exploitation in reinforcement learning is hard, under some formulations relatively simple solutions exist. Optimal decision thresholds for the multi-armed bandit problem, one for the infinite horizon discounted ...

IDIAP2006

Nearly optimal exploration-exploitation decision thresholds

Christos Dimitrakakis

While in general trading off exploration and exploitation in reinforcement learning is hard, under some formulations relatively simple solutions exist. Optimal decision thresholds for the multi-armed bandit problem, one for the infinite horizon discounted ...

2006

Online statistical estimation for vehicle control

Christos Dimitrakakis

This tutorial examines simple physical models of vehicle dynamics and overviews methods for parameter estimation and control. Firstly, techniques for the estimation of parameters that deal with constraints are detailed. Secondly, methods for controlling th ...

IDIAP2006

Boosting word error rates

Samy Bengio, Christos Dimitrakakis

We apply boosting techniques to the problem of word error rate minimisation in speech recognition. This is achieved through a new definition of sample error for boosting and a training procedure for hidden Markov models. For this purpose we define a sample ...

2005

Boosting word error rates

Samy Bengio, Christos Dimitrakakis

We apply boosting techniques to the problem of word error rate minimisation in speech recognition. This is achieved through a new definition of sample error for boosting and a training procedure for hidden Markov models. For this purpose we define a sample ...

2005

Online Policy Adaptation for Ensemble Classifiers

Samy Bengio, Christos Dimitrakakis

Ensemble algorithms can improve the performance of a given learning algorithm through the combination of multiple base classifiers into an ensemble. In this paper we attempt to train and combine the base classifiers using an adaptive policy. This policy is ...

2005

Gradient estimates of return

Samy Bengio, Christos Dimitrakakis

The exploration-exploitation trade-off that arises when one considers simple point estimates of expected returns no longer appears when full distributions are considered. This work develops a simple gradient-based approach for mainting such distributions a ...

IDIAP2005

Christos Dimitrakakis

Graph Chatbot

Chat with Graph Search

Ensembles for sequence learning

Ensembles for Sequence Learning

Nearly optimal exploration-exploitation decision thresholds

Nearly optimal exploration-exploitation decision thresholds

Nearly optimal exploration-exploitation decision thresholds

Online statistical estimation for vehicle control

Boosting word error rates

Boosting word error rates

Online Policy Adaptation for Ensemble Classifiers

Gradient estimates of return

Ensembles for Sequence Learning

Ensembles for sequence learning

Nearly optimal exploration-exploitation decision thresholds

Boosting word error rates

Nearly optimal exploration-exploitation decision thresholds

Online statistical estimation for vehicle control

Online Policy Adaptation for Ensemble Classifiers

Boosting word error rates

Gradient estimates of return

Nearly optimal exploration-exploitation decision thresholds