Health risk modelling by transforming a multi-dimensional unknown distribution to a multi-dimensional Gaussian
Abstract
The traditional approach of health risk modelling with multiple data sources proceeds via regression-based methods assuming a marginal distribution for the outcome variable. The data is collected for N subjects over a J time-period or from J data sources. The response obtained from ith subject is Yi=(Yi1,·s, YiJ). For N subjects we obtain a J dimensional joint distribution for the subjects. In this work we propose a novel approach of transforming any J dimensional joint distribution to that of a J dimensional Gaussian keeping the Shannon entropy constant. This is in stark contrast to the traditional approaches of assuming a marginal distribution for each Yij by treating the Yij's as independent observations. The said transformation is implemented in our computer package called ENTRA.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.