Posez n’importe quelle question sur les cours, conférences, exercices, recherches, actualités, etc. de l’EPFL ou essayez les exemples de questions ci-dessous.
AVERTISSEMENT : Le chatbot Graph n'est pas programmé pour fournir des réponses explicites ou catégoriques à vos questions. Il transforme plutôt vos questions en demandes API qui sont distribuées aux différents services informatiques officiellement administrés par l'EPFL. Son but est uniquement de collecter et de recommander des références pertinentes à des contenus que vous pouvez explorer pour vous aider à répondre à vos questions.
While Reinforcement Learning (RL) aims to train an agent from a reward function in a given environment, Inverse Reinforcement Learning (IRL) seeks to recover the reward function from observing an expert’s behavior. It is well known that, in general, variou ...
Using data on international equity portfolio allocations by U.S. mutual funds, we estimate a portfolio expression derived from a standard mean-variance portfolio model extended with portfolio frictions. The optimal portfolio depends on the previous month a ...
This thesis addresses the question of a patent value from three different angles. It comprises three papers on the patent valuation methods. The patent valuation issues are well-known to the world of research and practice. However, the debates over what th ...
We analyze and implement the kernel ridge regression (KR) method developed in Filipovic et al. (Stripping the discount curve-a robust machine learning approach. Swiss Finance Institute Research Paper No. 22-24. SSRN. https://ssrn.com/abstract=4058150, 2022 ...
When humans or animals perform an action that led to a desired outcome, they show a tendency to repeat it. The mechanisms underlying learning from past experience and adapting future behavior are still not fully understood. In this thesis, I study how huma ...
Discount is the difference between the face value of a bond and its present value. We propose an arbitrage-free dynamic framework for discount models, which provides an alternative to the Heath-Jarrow-Morton framework for forward rates. We derive general c ...
Central to global agreement on carbon emissions are strategic interactions amongst regions over abatement policy and the benefits to be shared. These are re-examined in this paper, in which benefits from mitigation stem from a meta-analysis that links carb ...
Reward timing, that is, the delay after which reward is delivered following an action is known to strongly influence reinforcement learning. Here, we asked if reward timing could also modulate how people learn and consolidate new motor skills. In 60 health ...
We present a non-parametric method to estimate the discount curve from market quotes based on the Moore-Penrose pseudoinverse. The discount curve reproduces the market quotes perfectly, has maximal smoothness, and is given in closed-form. The method is eas ...
We present a nonparametric method to estimate the discount curve from market quotes based on the Moore-Penrose pseudoinverse. The discount curve reproduces the market quotes perfectly, has maximal smoothness, and is given in closed-form. The method is easy ...