A Computational Framework for Analysis of Dynamic Social Networks Tanya Berger-Wolf University of Illinois at Chicago Joint work with Jared Saia University of New Mexico Zebras Dan Rubenstein, Siva Sandaresan, Ilya Fischhoff (Princeton) Movie credit: “Champions of the Wild”, Omni-Film Productions. Ants Stephen Pratt (Princeton) People – Hidden Groups Baumes et al. (RPI) Context • disease modeling Eubank et.al.‘04, Keeling’99, Kretzschmar&Morris’96 • cultural and information transmission Baumes et.al.’04, Broido&Claffy’01, Carley’96, Chen&Carley’05, Kempe et.al.’03, Tsvetovat et.al.’03,Tyler et.al.’03, Wellman’97 • intelligence and surveillance Airoldi&Malin’04,Baumes et.al.’04, Kolata’05, Malin’04, MagdonIsmail et.al.’03 • business management Bernstein et.al.’02, Carley&Prietula’01, Papadimitriou’97, Papadimitriou&Servan-Schreiber’99 • conservation biology and behavioral ecology Croft et.al.’04, Cross et.al.’05, Lusseau&Newman’04 Social Networks: Static vs Dynamic a b a a c c a b b a Strength or probability of interaction over a period of time c b c c a 1/3 1/3 a b c b 1/3 c b Individuals Advantage of Dynamic Networks: •More accurate information •Time related questions: – How do processes spread through population? – Who are the individuals that change the dynamics of interaction (leaders, interaction facilitators, etc.)? How do they emerge? – How do social structures change with outside circumstances? – What is the average lifespan of a social structure and are there recurring structures? Input – Individual Information b f a e c d Individual Information Input – Problem: Objects within a cluster are closer to each other than to objects in other clusters Input – Pairwaise Information Baumes et al.(RPI) and Washington Post Jan-Dec 2001 2000 Aug-Sep Jan-Apr May-Jul 2001 Pentagon Pennsylvania WTC North WTC South file 4 1 3 9 1 4 4 4 4 8 1 9 8 9 4 4 2 1 t=3 t=4 3 2 t=1 t=2 3 1 1 Theseus’s Paradox • During a twelve month period 95% of all the atoms that make up your 50 trillion cells are replaced • FAA regulations: airplane = left rudder number • Ship of Theseus "The ship wherein Theseus and the youth of Athens returned [from Crete] had thirty oars, and was preserved by the Athenians down even to the time of Demetrius Phalereus, for they took away the old planks as they decayed, putting in new and stronger timber in their place, insomuch that this ship became a standing example among the philosophers, for the logical question of things that grow; one side holding that the ship remained the same, and the other contending that it was not the same." A group persist in time (is a metagroup) if some (big) fraction β of it exists some (big) fraction α of time •A time snapshot is a partition g1t…gmt •Similarity measure sim ( git1 , g jt2 ) 2 git1 g jt2 git1 g jt2 •Metagroup is a path of length ≥ α with edges of weight ≥ β 2/3 2/3 2/3 2/3 2/3 2 1 3 4 10 5 9 8 7 6 Time Time step step = =4 1 seconds second 1 2 3 4 3/4 0 5 2 6 3 7 4 8 5 3/4 0 9 3 10 4 1 5 2 6 3/4 0 3 4 4 5 5 6 6 7 3/4 0 7 5 8 6 9 7 10 8 4 1 9 8/9 3 1 1 t=1 1/2 4 1 4 1 4 8 1 1 8 8/9 9 3 3/4 4 1 4 2 11 2 1/2 1 β=.8 t=3 t=2 β=.5 t=4 Simple Stats: • • • • Metagroup = path length ≥ α Total #metagroups = #paths length ≥ α Maximal metagroup length = max path length Most persistent metagroup = longest path in a DAG • Let x be a member of MG is it appears in it at least γ times. Largest metagroup = dynamic programming on membership set. Group Connectivity Given groups g1,…,gl, are they in the same metagroup? … g1 g2 gl-1 gl Most persistent/largest/loudest/.. metagroup that contains these groups A metagroup that contains largest number of these groups – dynamic programming Individual Connectivity Given individuals S={s1,…,sl}, are they in the same metagroup? • Metagroup that contains max number of individuals in S • Most persistent/largest/shiniest.. metagroup that contains all individuals in S Critical Group Set The smallest set of groups whose absence leaves no metagroups (for given α and β) Formally: remove fewest vertices in a DAG so there are no paths of length > k-1 K-path Vertex Shattering Set K-path Vertex Shattering Set NP-hard: 2-path shattering set k=2 = independent set ? k=T Polynomial: T-path shattering set (T is the longest path length) – min vertex cut in a DAG Critical Individual Set The smallest set of individuals whose absence leaves no metagroups (for given α and β) a b a b cd ab cd a a b b ab cd c c c d d d … Other questions: • Close Group: individuals that appear together more than others. • Loyal Individuals: appear most frequently in any metagroup. • Individual Membership: metagroup which maximizes the cardinality of the set of groups in which a given individual occurs. Extra/Introvert: member of the largest/smallest number of metagroups. • Metagroup Representative: an individual who occurs more in a metagroup than any other individual and occurs in it more than in any other metagroup. • Demographic Distinction: given a coloring of individuals, is there a property that distinguishes one color from the others? • Critical Parameter Values: largest values of α, β for which there exists at least k metagroups. Largest γ for which each metagroup has at least k members. • Sampling Rate: largest time step such that the answer does not change if the time step is decreased but changes if it is increased. • Critical Time Moments: e.g., the time when the groups' membership changes most, i.e. minimal edge weight sum between time steps. • Data Augmented Solution Reconciliation: given partial sets of observations and a partial solution, find is the combined solution to the entire input. Conclusions • New data structure with explicit time component of social interactions • Generic – applicable in many contexts • Powerful – can ask meaningful questions (finding leaders of zebras) • But! (And?) many hard algorithmic questions – lots of work! Credits: Jared Saia Dan Rubenstein Siva Sundaresan Ilya Fischoff Simon Levin S. Muthu Muthukrishnan Martin Pal