Entropy Regularization under Bayesian Drift Uncertainty

Abstract

We solve the entropy-regularized mean-variance portfolio problem under Bayesian drift uncertainty. Combining continuous-time Bayesian filtering with stochastic policy optimization, the main finding is negative: the two mechanisms are orthogonal. Posterior dynamics are policy independent, so entropy regularization cannot accelerate learning about the unknown drift. The mean control is identical to the deterministic Bayesian Markowitz feedback, and entropy enters only through policy variance. On the technical side, the optimal policy is Gaussian, the value function is quadratic in wealth, and the belief-dependent coefficients close in exponential form. The framework recovers both parent models as limiting cases.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…