Modeling Website Visits

Abstract

We propose a multivariate model for the number of hits on a set of popular websites, and show it to accurately reflect the behavior recorded in a data set of Internet users in the United States. We assume that the random vector of visits is distributed according to a censored multivariate normal with marginals transformed to be discrete Pareto IV and, following the ideas of Gaussian graphical models, we enforce sparsity on the inverse covariance matrix to reduce dimensionality and to visualize the dependence structure as a graph. The model allows for an easy inclusion of covariates and is useful for comprehending the behavior of Internet users as a function of their age and gender.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…