Attribution Bias in Philosophical Knowledge Graphs: Corpus Frequency versus Temporal Sourcing
Abstract
Computational knowledge graphs assign philosophical concepts to traditions based on corpus frequency: the school that mentions a concept most becomes its attributed tradition. We argue this conflates three measurements: textual power, historical priority, and philosophical significance, demonstrated using the darshana-graph, a knowledge graph of 28,322 relationships across Hindu, Buddhist, and Jain traditions. Seven of the top 25 concepts by betweenness centrality predate their attributed school by 288 to 2,288 years. Moksha, attributed to Advaita Vedanta, appears first in Jain sources over 1,200 years earlier. The most reliable snapshot, at 300 BCE using only explicitly dated sources, shows a genuinely pluralistic structure: 59% Vedic, 24% Jain, 18% Buddhist. We also quantify a critical distortion in the temporal method: between 300 CE and 800 CE the network grows from 18 to 1,028 nodes, with 97.4% carrying Advaita proxy dates, revealing that apparent dominance reflects textual survival, not philosophical history. Beyond correcting attribution bias, the temporally grounded graph enables structural homology analysis across traditions. Ego-network feature vectors applied to 48 temporally labelled concepts across eight traditions identify cross-tradition concept pairs with high structural similarity. The method recovers known correspondences including purusha-jiva (Samkhya/Jain, sim 0.990) and prakriti-maya (Samkhya/Vedic, sim 0.972), and surfaces novel homologies. Nibbana and samsara score 0.954 despite being doctrinal opposites: both function as the ultimate reference concept in their tradition's soteriology. Cetana (Buddhist intention) and ajiva (Jain non-living matter) score 0.923, a pairing absent from the literature. These are not claims of doctrinal equivalence but of measurable structural homology: different philosophical vocabularies navigating a shared conceptual space.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.