Relevance of Wasserstein Distance for Path-Dependent Option Pricing
Determine whether minimizing the Wasserstein distance between learned and target payoff distributions is an adequate evaluation or training criterion for distributional reinforcement learning models that price path-dependent options such as arithmetic Asian call options under the risk-neutral measure, specifically with respect to capturing financially critical aspects including tail behavior and extreme quantile accuracy of the conditional payoff distribution.
References
We conclude that minimizing the Wasserstein distance alone may not suffice to evaluate the practical adequacy of the learned value distributions for option pricing, as it does not directly reflect financially critical aspects such as tail behavior or extreme quantile accuracy. Given that no thorough study of this limitation exists to our knowledge, we retain this perspective in our modeling approach and leave a deeper investigation to future research; especially its relevance for path-dependent option pricing remains an open question.