Solving dynamic portfolio selection problems via score-based diffusion models

Abstract

In this paper, we tackle the dynamic mean-variance portfolio selection problem in a model-free manner, based on (generative) diffusion models. We propose using data sampled from the real model P (which is unknown) with limited size to train a generative model Q (from which we can easily and adequately sample). With adaptive training and sampling methods that are tailor-made for time series data, we obtain quantification bounds between P and Q in terms of the adapted Wasserstein metric A W2. Importantly, the proposed adapted sampling method also facilitates conditional sampling. In the second part of this paper, we provide the stability of the mean-variance portfolio optimization problems in A W 2. Then, combined with the error bounds and the stability result, we propose a policy gradient algorithm based on the generative environment, in which our innovative adapted sampling method provides approximate scenario generators. We illustrate the performance of our algorithm on both simulated and real data. For real data, the algorithm based on the generative environment produces portfolios that beat several important baselines, including the Markowitz portfolio, the equal weight (naive) portfolio, and S\&P 500.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…