Inferences in a Virtual Community: Demography, User Preferences, and Network Topology
Abstract
This paper presents a computational procedure for extracting demography data, mining patterns of human preferences, and measuring the topology of a virtual network. The network was created from the personal and relationships data of an online Internet-based community, where persons are considered nodes in the network, and relationships between persons are considered edges. A community of Friendster users whose listed hometown is Los Ba\~nos, Laguna was used as a test bed for the methodology. The method was able to provide the following demographic, preferential, and topological results about the test bed: (1) There are more female users (52.34\%) than male (47.66\%); (2) Homophily (i.e., birds-of-a-feather adage) is observed in the preferences of users with respect to age levels, such that they are strongly biased towards being friends with users of a similar age; (3) There is heterophily in gender preference such that friendship among users of the opposite gender occurs more often. (4) It exhibits a small-world characteristic with an average path length of 4.5 (maximum=12) among connected users, shorter than the well-known six degrees of separation~travers69; And (5) The network exhibits a scale-free characteristics with heavily-tailed power-law distribution (with the power λ = -1.02 and R2 = 0.84) suggesting the presence of many users acting as the network hubs. The methodology was successful in providing important data from a virtual community which can be used by several researchers in the fields of statistics, mathematics, physics, social sciences, and computer science.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.