A Stochastic Model for Directed Graphs with Transition Rates Determined... Reciprocity Stanley S. Wasserman Sociological Methodology

advertisement
A Stochastic Model for Directed Graphs with Transition Rates Determined by
Reciprocity
Stanley S. Wasserman
Sociological Methodology, Vol. 11. (1980), pp. 392-412.
Stable URL:
http://links.jstor.org/sici?sici=0081-1750%281980%2911%3C392%3AASMFDG%3E2.0.CO%3B2-T
Sociological Methodology is currently published by American Sociological Association.
Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at
http://www.jstor.org/about/terms.html. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained
prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you may use content in
the JSTOR archive only for your personal, non-commercial use.
Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at
http://www.jstor.org/journals/asa.html.
Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed
page of such transmission.
The JSTOR Archive is a trusted digital repository providing for long-term preservation and access to leading academic
journals and scholarly literature from around the world. The Archive is supported by libraries, scholarly societies, publishers,
and foundations. It is an initiative of JSTOR, a not-for-profit organization with a mission to help the scholarly community take
advantage of advances in technology. For more information regarding JSTOR, please contact support@jstor.org.
http://www.jstor.org
Tue Oct 2 13:52:23 2007
A STOCHASTIC MODEL
FOR DIRECTED GRAPHS
WITH TRANSITION RATES
DETERMINED BY RECIPROCITY
Stanley S. Wasserman
UNIVERSITY OF MINNESOTA
This chapter addresses the problem of describing mathematically the evolution of a binary directed graph, or social network, over time. We develop a new statistical method for the
analysis of social networks and discuss its usefulness for sociologists
Partial support was provided by the National Science Foundation, Grant
SOC 73-05489 to Carnegie-Mellon University. Revisions were made while the
author was a Postdoctoral Training Fellow of the Social Science Research Council.
This chapter is an abbreviated version of Technical Report 305, University of
Minnesota, School of Statistics. I wish to thank Joseph Galaskiewicz for comments
on the manuscript.
STOCHASTIC MODEL FOR DIRECTED GRAPHS
and others studying networks of individuals, organizations, or
groups.
Interest in substantive sociological network theory is at an
all-time high level-more and more researchers have been using the
social network paradigm to study social structure. The number of
sophisticated mathematical models for directed graphs is increasing
at a pace equal to the rise in interest. Mathematicians and statisticians are seeing the application of their research to social networks.
But even with the growing popularity of networks, many of
the methods used by network analysts are very elementary, and
some are possibly obsolete. This situation has both positive and
negative features. O n the positive side, because most of the quantitative aspects of the field are relatively new, much of the data
analysis is exploratory in nature, and simple well-known methods
are often sufficient to provide both sociological insights and directions for future research. O n the negative side, the elementary
methods typically involve univariate statistics such as density
measures, correlation coefficients, and the like, and researchers have
not been able to benefit from carefully developed stochastic models
and modern methods of multivariate statistical analysis. To paraphrase Burt (1978), a discipline of applied network analysis needs to
be defined, emphasized, and used to bridge the gap between the
substantive sociological theorists and the mathematical and statistical modelers.
This chapter was presented at a conference on Stochastic
Process Models for Social Structure in December 1977. The current
gap between sociologists and mathematicians was apparent at the
conference-although everyone recognized the need for dynamic
stochastic analysis of networks, the new models described in the
papers given by statisticians were either so mathematical that they
were not accessible to most participants or they involved mathematical and statistical assumptions that were gross simplifications.
In order to tap the current interest in stochastic modeling, it is
necessary both to develop a new methodology based on a stochastic
modeling framework that is reasonably free of mathematical difficulties and to demonstrate how the methodology could be used to
answer fundamental sociological questions. This chapter is a timid
start in that direction.
STANLEY S. WASSERMAN
Social groups are acknowledged to change over time, an
evolution that alters the links that exist between group members.
However, there has been little effort directed at the construction of
realistic models for this evolution. Undirected graphs and multivalued graphs have been effectively modeled stochastically, but the
fitting of these models to binary directed graphs yields little insight
into the binary processes under investigation. Indeed, the most
important features of these processes are the on/off nature and the
directedness of the arcs in the structural graph. Ignoring these two
qualities in model construction would be a fundamental error.
We shall describe a simple stochastic model for the process
of change in a binary digraph and propose several strategies for
estimating the parameters of the model. The appropriate strategy
for a given situation depends on the number of times the group is
observed in its evolution. An application of this model to several
data sets (Wasserman, 1977) has yielded some interesting insights
into group processes.
This chapter uses a general mathematical language for
presentation of results, since the model has application in other
areas, such as communication, transportation, and the natural
sciences (see Wasserman, 1978). Sociological applications are discussed in the last section. The mathematics have been simplified,
and all proofs have been omitted. Those interested can consult
Wasserman (1977) or the technical report mentioned in the opening
footnote for details.
The stochastic model described here is quite simple. But it is
unfair to assess the merits of such a model by its complexity or lack
thereof. Much can be learned from simple models. The information
to be gained may yield valuable insights into the process in question. The ideas presented in this chapter may allow researchers to
postulate and analyze more complicated models.
I. INTRODUCTION TO THE PROBLEM
Consider a directed graph or digraph that structurally represents the state of some process, sociological or otherwise, at time t. A
digraph is a set V = {v,, u p , . . . , vg) of nodes and a set L =
{I,, I,, . . . , I,) of directed arcs connecting pairs of nodes. We let
STOCHASTIC MODEL FOR DIRECTED GRAPHS
395
li = vjvk be the directed line running from node vj to node vk and
further stipulate that the digraph be binary; that is, if two distinct
arcs exist such that li = vjvk and li, = ujvk, then li = li8.In addition,
arcs li = vjvi do not exist, ruling out the existence of loops in the
digraph. We commonly let D, represent a digraph with g nodes.
Let X(t) be the adjacency matrix representing the state of D,at
time t. Specifically, X(t) = [Xij(t)], where
Xii(t) =
if vivi E L at time t
otherwise
The time parameter t is assumed to be continuous, t 2 0.
The matrices x, w, y, z, . . . are single states of the continuous time stochastic process X(t). The process has a state space S of
all possible (g x g) binary-valued matrices with zero diagonal2~b-I)in number, making S quite large.
The problem that this chapter considers is the formulation
and evaluation of a stochastic model for X(t), where the transitions
between states of S depend on the state of the digraph at time t. We
accomplish this by assuming X(t) to be a continuous-time Markov
chain and by allowing the infinitesimal transition rates to be specific
functions of the elernents of X(t). This solution to the problem was
first suggested by Holland and Leinhardt (1977a) and has been
further elaborated on by Holland and Leinhardt (1977b) and by
Wasserman (1977, 1978). Several models using these assumptions
are discussed in depth by Wasserman (1977), including the model
incorporating only reciprocity, which is the subject of this discussion.
In Section 11, we discuss the Holland-Leinhardt modeling
framework and outline its assumptions. We present a simple model
for digraphs based on reciprocity and describe how this particular
parameterization substantially reduces the size of the state space. In
Section 111, we compute moments and equilibrium distribution for
the stochastic process arising from the reciprocity model.
The parameterization chosen for the reciprocity model has
four parameters that can vary from digraph to digraph. In the latter
sections of this chapter we estimate these parameters. In Section IV,
we consider whether an observed set of observations of a digraph is
representable as a continuous-time Markov chain and give a simple
396
STANLEY
S. WASSERMAN
procedure for computing reasonable parameter estimates. Maximum-likelihood estimation, a computationally more difficult procedure, is discussed in Section V. We conclude with some comments
for social networkers.
II. T H E M O D E L I N G F R A M E W O R K
The Holland-Leinhardt framework consists of two assumptions regarding the stochastic nature of the arcs X i j ( t ) . The first is
that X ( t ) is a Markov chain:
ASSUMPTION
1. Markov
X ( t ) is a standard Markov chain with jnite state space S and
Pxy(t,h) = P { X ( t
+ h ) = yl X ( t ) = x)
(1)
as the probability transition matrix.
+
Second, we assume that for small intervals of time ( t , t
h ) , the
changes in the arcs of a digraph are statistically independent:
ASSUMPTION
2. Conditional Change Independence
Assumption 2 is crucial. It implies that the probability of
any two arcs changing simultaneously is essentially zero. In a small
interval of time, only two changes can occur for a single arc: Arcs
present at time t may disappear at time t
h and vice versa. Thus
we represent the probability of arc changes as
+
Note that A, the change rate function, depends on the state of the
digraph at time t and on t itself.
Let q,,(t) be the injnitesimal transition rates of X ( t ) , the
STOCHASTIC MODEL FOR DIRECTED GRAPHS
digraph process. We have
Aij(x,t)
9xy(t) =
I0
I
if y and x differ only in the
(i,j)th element
if y and x differ by more than one
element
(4a)
and
as terms of Q(t), the matrix of infinitesimal transition rates. Wasserman (1977) discusses some characteristics of S and proves that if
a simple condition is satisfied, specifically a restriction to nonzero
change rates Aii(x,t), then the digraph process has an equilibrium
distribution.
III. THE RECIPROCITY MODEL AND THE DYAD PROCESS
One can postulate various functional forms for the change
rate function A, defined in Equation (3), and generate many models
incorporating Assumptions (1) and (2) of the modeling framework.
We propose one such model in this section with a parameterization
that produces a state space D of only four states. The model is for
reciprocity where the tendency over time for the arc vivi to exist
depends only on the presence or absence of arc upi.
We shall consider the (9,) dyads, or 2-subgraphs, of D,, which
by assumption are independent and identically distributed. After
computing moments and equilibrium distribution of the dyad
process, we briefly describe its probability transition matrix.
We dichotomize the change rate function (3) into two
functions to allow for both types of arc changes. Define
398
STANLEY S. WASSERMAN
for small h. Note that the rates of the process are now time-homogeneous and, consequently, the digraph process is stationary in time.
For the reciprocity model, we assume that
There are g(g - 1) pairs of the change intensities (6a) and (6b) such
that the pair for ( i , j ) depends only on the pair for (j,i).
T he
parameters ho and XI are measures of the "overall" rate of change
for an arc, and p0 and p, measure the "importance" of a reciprocated arc. Common sense suggests that empirically
and
since, in the absence of a reciprocated arc, there should be a greater
A,,). Moreover, we
tendency for arcs to appear than disappear (A,
assume that in the presence of a reciprocated arc (1) the tendency
for an arc to appear from vi to vj should increase (po 0) and
(2) the tendency for the arc from vi to vj to disappear should
decrease (pl
0). Positivity of hl
p1 requires that p,
-Al.
Inequalities (7a) and (7b) are more likely to be true in friendship
networks where the tendency over time is toward mutuality. These
parameters are discussed further in Section VI.
Let
<
<
+
>
>
be the dyad for the pair of nodes (i,j ) . The state space D of the dyad
Dij(t) contains four states as illustrated in Figure 1. The Q intensity
matrix for the dyad process is shown in Table 1.
The parameterization (Equations 6a and 6b) of the reciprocity model yields a set of dyad processes {Dij(t)) that are independent. The entire X(t) digraph process can be represented as ( 4 )
independent dyad processes consisting of the symmetrically positioned pairs of off-diagonal elements of X(t). Moreover, the {Dij(t))
STOCHASTIC MODEL FOR DIRECTED GRAPHS
Figure 1. States contained in D
w .
4
I
2
(a) Mutual Relation (M)
(b) Asymmetric Relation ( A l )
*
J
2
(c) Asymmetric Relation (AO)
.
2
(d) Null Relation (N)
are identically distributed, continuous-time, four-state Markov
chains with state space D and intensity matrix Q.
We now compute the moments of the dyad process-that is,
the probability, on average, that the arc vivj is present or absent and
that vivj and viai are present at time t. Let
TABLE 1 Q Matrix for the Dyad Process State
( 0 )
(1,0)
(0,l)
(1,l)
-2Ao
A,
A,
0
A0
-(A0
+ A1 + PO)
0
A1
+ PI
A@
0
-
0
A1
+ 1 + PO
+ PI
0
A0 + Po
A,
yo
-2@1 + P I )
+
400
STANLEY S. WASSERMAN
be the vector of first and second moments of the {Dij(t)).Also define
The moments m(t) are computed by solving the differential equation
The solution of ( l l ) , subject to a set of initial conditions at time
to, 1s
where n(t) is the solution of the homogeneous equation
Calculation of the integral (Equation 12) yields a very
complicated solution for m(t) that gives little insight into the nature
of the process. However, the equilibrium distribution of the dyad
process, the probabilities that a dyad is in one of the four states as
t + co, is simple to calculate and easy to comprehend. There are
three ways the equilibrium probabilities can be found: letting
t + co in Expression (12); setting the system of differential equations in (1 l ) to zero and solving for m(oo); or showing that the dyad
process is reversible, allowing the equilibrium probabilities to be
simply found from the reversibility equations. Mathematical details
are given in Wasserman (1977).
Let (q{Dij(t);Dij(t h ) ) ) be the elements of the Q matrix
for the dyad process given in Table 1. We define
+
rM(t)= P{Dij(t) = (1,l))
(mutual)
7iAl(t) = P{Dij(t) = (1,O))
(asymmetric)
7iAO(t)= P{Dij(t) = (0,l))
(asymmetric)
' i ~ ~ (=t )P{Dii(t) = (0,O))
(null)
(14)
STOCHASTIC MODEL FOR DIRECTED GRAPHS
40 1
as the elements of ~ ( t )the
, vector of probabilities of the four states of
the dyad process. The equilibrium probabilities are
These probabilities are simple functions of the model parameters
and can be estimated by computing them with parameter estimates.
Comparing Equation (15) with the empirical distributions gives
information on how close the network and the dyad process are to
equilibrium.
The probability transition matrix for the dyad process is a
very complicated expression. We find it by examining the eigenvalues and eigenvectors of the (4 x 4) infinitesimal generator Q and
then compute etQ. The reader is referred to Wasserman (1977) for
the details.
IV. EMBEDDABILITY
To estimate the four parameters of the reciprocity model
and determine whether the model provides a good description of the
evolution of a directed graph, we must use empirical observations
on the process. We use the general notion of embeddability, the
determination of whether an obserued probability transition matrix
could have arisen from a continuous-time Markov chain. We continue the discussion in the next section, where we estimate the
model parameters.
Suppose that we have observations on some process z(t)
denoted by r,(t), z2(t), . . . , zK(t).Assume that the process has finite
state space Z, with states labeled 1, 2 , . . . , N. For example, if
r(t) = Dii(t),the dyad process, we have K = (3), Z = D, and N = 4.
402
STANLEY S. WASSERMAN
These observations {z(t)) are collected at times t = to, tl, . . . , tn,
where the t, are distinct increasing positive numbers.
If n is knite and nonzero, we define the empirical probabili~
1 5 n, with elements [pii(tl - tk)]
transition matrix P(t, - tk),0 5 k
as follows (where t = tl - tk):
<
where Tj(t) is the number of 2's in state i at time t, and in state j
at time tl. These estimates are maximum-likelihood estimates of the
elements of the probability transition matrix of a stationary,
discrete-time Markov chain (Anderson and Goodman, 1957). For
moderate n, there is a substantial number of estimated transition
matrices that can be used to test the suitability of a continuous-time
Markov chain for the data.
Embeddability was first posed as a problem by Elfving
(1937) and discussed by Kingman (1962), but only recently has the
problem been rigorously solved by Singer and Spilerman (1974).
The embedding problem, as formulated by Singer and Spilerman
(1976), is:
Find simple test criteria on the elements of an observed
stochastic matrix @(t),0
t
cc,which will guarantee that it
car, be written in the form P(t) = etB for some Q w i t h elements
for i # j
qij 2 0
< <
A
50
for i = 1,2, . . . , N
C qii = 0
f o r i = 1 , 2 , . . . ,N
qii
(17)
i
The data-analytic problem is to find the subclass of all Q matrices
with structure (17) that could have given rise to the observed ^P
matrix or matrices.
This subclass of Q matrices is
Q = {all matrices Q with structure (17) such that (tl - t k ) Q
(18)
= log F(t, - t,) for 0
k
1 5 n)
< <
403
STOCHASTIC MODEL FOR DIRECTED GRAPHS
Note that Qmay contain more than one element even when n = 1
because the logarithm function of a matrix is a "one to many"
function. Moreover, when n
1, the collection of Q matrices
compatible with a subset of the empirical ^P matrices may not
coincide with the Q matrices compatible with a different subset.
This situation is caused by the negation of the Markov assumption
of time-homogeneous transition rates.
Although there are many necessary conditions for Q to be
nonempty, no simple sufficiency criteria exist. One must verify that
the observed ^P matrix satisfies the necesary conditions; and even
then, there may not be a Q matrix compatible with the observed
transition matrix.
1f ^P has positive, distinct, real eigenvalues, however, there is
a unique Q matrix for a given P. The scalar logarithm function is
multiple-valued only when it has complex arguments and is undefined with negative arguments. The following theorem, due to
Sylvester, and also given by Singer and Spilerman (1976), states the
result for this special instance.
>
A
THEOREM.
SYLVESTER'S
If P is an ( N X N ) matrix w i t h
is single-valued
distinct eigenvalues hl, h2, . . . , AN, and f f
in a neighborhood of each of the eigenvalues, then
( a )
For proof see either Gantmacher (1960) or Sylvester (1883).
The application of this theory to the reciprocity model is
straightforward. Suppose, for example, that we assume the model is
operating and we have observations on the digraph at time to and t,.
.,We reduce X(to) and X(tl) to two sets of (q) dyads and form
P(tl - to), a 4 x 4 matrix of dyad transitions. If this empirical probability transition matrix has four positive, real, distinct eigenvalues, we can calculate the unique empirical Q matrix for these
observations. Since we know the functional form of Q (Table I), we
can easily solve for the parameters A,, A,, po, p1 by setting the
theoretical elements of Q equal to the observed, calculated values.
Wasserman (1977) gives conditions on the elements of ^P
A
404
STANLEY S. WASSERMAN
which ensure that the eigenvalues of ^P are real, distinct, and
positive. The conditions generally hold if the diagonal elements of ^P
are large, relative to the off-diagonal elements. This "diagonaldominant" situation is likely to occur when the two observation
time points to and tl are close. Taking observations close in time
allows only a few transitions to other states so that the diagonal
elements remain near unity.
Occasionally log ^P yields a Q that does not have structure
(17); that is, the off-diagonal elements of
may be negative. If so,
then one may force the negative elements to zero, adjust the remaining terms to maintain the zero-row.,sum, and obtain approximate parameter estimates. O r a different P can be examined if there
are more than two observations on the process. This strategy of
estimating Q by computing logarithms of empirical transition
matrices is flexible, but it yields good estimates. We discuss the
usefulness of this technique in the next section.
V. PARAMETER ESTIMATION
In this section we examine estimation of the parameters of
the reciprocity model. There are three possible sampling schemes:
continuous record; single observation; and two or more observations.
Continuous Record
When one has a continuous record of all changes in each of
the ( 9 , ) dyads for a time interval (t,,,~),estimation is relatively easy
and need not be discussed. Billingsley (1961) discusses the theory for
general Markov processes. One assumes that the Q matrix is a
function of a vector of parameters 8 and reduces the continuous
record { D i j ( t ) ,to 5 t 5 T ) for all dyads to observations on the
discrete jumps of the process and the total waiting time in each
state. Billingsley estimates 8 by maximum likelihood and discusses
likelihood ratio tests.
Single Observation
When one has a finite number of observations on a continuous-time Markov chain, estimation is rather difficult. Darwin
(1956) recognizes this and states that the complex form of the
STOCHASTIC MODEL FOR DIRECTED GRAPHS
405
likelihood, with observations on the process taken once or at regular
intervals, almost prohibits the use of maximum-likelihood estimation. Keiding (1974, 1975) considers sampling of the process at
equidistant time points to, ti, tZj,. . . , tni, but only derives asymptotic results for n -+ c ~ .
We now consider estimation of the four parameters of the
reciprocity model, A,, A,, po, pl, assuming that we have a single
observation on the entire digraph. A single adjacency matrix is the
data set most frequently collected by sociologists studying social
networks. We discuss the likelihood function of the parameters,
given this single adjacency matrix, and estimate two meaningful
functions of the parameters.
Suppose we have a sole observation of the stationary X(t)
process, which we label x. The matrix x contains (8) independent
observations of the dyad process Dij(t) with state space D. We are
interested in the numbers of each type of dyad. Since the labeling of
the nodes in the directed graph is arbitrary, we cannot distinguish a
(1,O) asymmetric from a (0,l) asymmetric. Consequently, the information in x can be summarized by three statistics:
xiixji = number of mutuals
M(t) =
i>i
A(t) =
x
(1 - xir)xii + xii(l - xii)] = number of asymrnetrics i >i N(t) =
x
(1 - xii)(l - xfi) = number of nulls i>i +
+
where M(t)
A(t)
N(t) = (92).
Define L(0 I x) as the likelihood function of the parameters
given the single matrix x, where 0 = (Ao,A,, po, pl)'. L is merely a
multinomial likelihood:
L(O 1 X) =
7iM(t)xtlxl*~Al(t)"'l(l-~l,)
all dyads
i>l
x
(21)
~~~(t)(l-x~,)x~%7i~(t)(~-x,~)(~-x,,)
406
STANLEY S. WASSERMAN
where the T'S, defined in Equation (14), are the probabilities of the
four states of the dyad process. The log likelihood can be written
succinctly as
+
where aA(t) = aAl(t) aAO(t).Indeed, M(t), A(t), and N(t) are
sufficient statistics.
The expressions for a M ,a*, and aNare linear combinations
of the elements of the m(t) vector of moments given in (12) and
hence are quite complicated expressions. For computational ease,
we shall use the steady-state values of a M ,T*, and aN,defined in
(15). Let
so that
The log likelihood (22) is
log L(0 I x) = M(t)log(a/2b) - N(t)log(2b/c)
-
($)log(a/2b
+ 6/26 + 1)
(26)
where A(t) = (92) - M(t) - N(t).
The likelihood function (26) depends on four unknown
parameters 8, but it contains only two "pieces of information": M(t)
and N(t). Hence we can estimate only two functions of the parameters, which are the change ratios
8 , is the ratio of the probabilities of change, in a small time interval,
in the presence of a reciprocated arc and O 2 is the ratio in the absence
of a reciprocated arc. The maximum-likelihood estimates of
(see Wasserman, 1977) are
Q2
and
407
STOCHASTIC MODEL FOR DIRECTED GRAPHS
and are easily computed from the data.
There are two other ratios that are more interesting than fll
and f12. These are
These ratios, K~ and K,, directly emphasize the importance of a
reciprocated arc: K~ ( K ~is) the effect of a reciprocated arc on the
change from a nonchoice to a choice (choice to a nonchoice). We
1 and K~
1. But these "reciprocity" ratios
suspect that K~
cannot be estimated via maximum likelihood with only one observation on the digraph. (See Wasserman, 1977, for proof of this fact.)
Note that the ratio of the change ratios fll and 8, is an odds
ratio. We have
>
<
the increase in the odds of a new arc vivj coming into existence
during the interval (t, t
h ) due to the existence of ujui. With no
reciprocity effect, log (f11/02) = 0; if log (01/f12) 0, a positive
reciprocity effect is present. So even though fll and f12 considered
separately are not very informative, their ratio is quite interesting.
+
>
Two or More Observations
We now examine situations where one has a data set containing several observations on the directed graph. First consider
408
STANLEY S. WASSERMAN
>
n = 2, and let X(tl) and X(t2), t2
tl, be the two observations on
the process. We examine each of the pairs [Xii(tl), Xii(tl)] and
[Xij(tz),Xii(t2)] in turn and form a 4 x 4 contingency table with
rows corresponding to the dyad state at time tl and columns corresponding to the t2 state. There are (8) "counts" in this table. (See
Bishop, Fienberg, and Holland, 1975, for discussion of representation
of data from Markov models as contingency tables.)
We let T denote such a table, with entries (tkl),where
k = 1 = null (0,O)
k = 2 = asymmetric (1,O)
k = 3 = asymmetric (0,l)
(32)
k = 4 = mutual (1,l)
and similarly for the subscript I.
The likelihood function when n = 2 is
L[e I x(t1) = x, ~ ( t , )=
= [ ~ , ( t , ) ~ (TA(tl)A(tl)
~~)
TN(tl)~(tl)~
where the [pkl(tZ- tl)] are the elements of the probability transition matrix for the dyad process.
Suppose n
2. We now have observations on the digraph
process X(tl), X(t2), . . . , X(tn). We form (n - 1) matrices Tm,
m = 1, 2 , . . . , (n - I), with elements (tklm),where Tm gives the
transitions of the dyads at time tm to the dyads at time tm+,. The
process is stationary, so that the probability transition matrix for the
dyads depends only on tm+, - t,. The likelihood function is
>
When we have equidistant sampling, such that t, - t, =
t, - t2 = . . . - tn - tnPl = t, then the likelihood (34) simplifies to
STOCHASTIC MODEL FOR DIRECTED GRAPHS
We can "pool" transitions across time points if we are certain of
time homogeneity, so that we have an effective sample of ( n - 1) (9,)
dyads.
To estimate the four parameters 8, we can differentiate the
logarithm of ( 3 3 ) or ( 3 4 )with respect to each of the four parameters,
setting the resulting derivatives to zero, to obtain a system of four
equations in four unknowns. But the PkI'sare nonlinear, being sums
of exponentials. In this situation, maximum-likelihood estimation is
not only unreliable, but the solutions can be obtained only approximately.
We may study the embeddability of the data in hand,
however, and develop a new strategy. If n = 2, we compute the
(4 x 4 ) empirical probability transition matrix and, using the
., rules
outlined in the previous section, find an empirical Qmatrix, Q. The
elements of Q are simple functions of the parameters Ao, A,, po, pl;
consequently, reliable estimates of the parameters are easily obtained. If n
2, we have several Q matrices; we compute several
plausible estimates of 0 and study each. We can use these estimates
as starting values for a Newton-Raphson iterative solution to the
likelihood equations, or we can explore the likelihood function in
the vicinity of these points, as is done in Wasserman (1977).
>
A
VI. SOCIOLOGICAL CONSIDERATIONS
To conclude this chapter, we wish to mention two ways in
which the model can be utilized by social network researchers. First,
the reciprocity model is a device to study the "structural tendency"
toward reciprocity; second, it serves as a benchmark, a standard to
which empirical data can be contrasted to uncover higher levels of
structural tendencies that might be present.
The utility of the model is that by specifying the probabilities that dyads change state, it allows the theorist to postulate how
the current structure of a group with a fixed amount of reciprocity
influences the future structure. We can study tendencies toward
410
STANLEY S. WASSERMAN
integration or symmetry by examining whether null relations become asymmetric relations, asymmetric relations become symmetric, mutual relations, and whether mutual relations exhibit a large
degree of stability. We can also examine tendencies away from
symmetry by postulating that arcs will disappear over time, that
dyads are likely to go through transitions such as M + A + Nmutual to asymmetric to null. Whatever the overall direction of
movement, the four parameters of the model not only measure the
direction but also state the overall effect of reciprocated arcs on the
movement.
The parameters ho and A, allow us to assess whether asymmetric ties tend to be formed (A,) and whether asymmetric ties tend
to disappear (A,) over time, in the absence of the reciprocated arc
vjvi.The parameters must be positive, and their relative magnitude
reflects these two tendencies. With movement toward symmetry, ho
should exceed hl, since the number of linkages should increase over
time. In contrast, movements away from symmetry imply that Xo
should be less than A,.
The parameter sums Xo
p, and hl
p1 measure the
chance that linkages between actors i and j form, or disappear, in
the presence of the reciprocated linkage. Thus p, and p, directly
measure the effect of reciprocated arcs on tendencies toward and
away from symmetry. Large values of A,
po indicate movement
from asymmetric toward mutual relations, while small values of
A, + p1 imply that mutual relationships are unlikely to lose linkages.
The second major use of the reciprocity model is as a
benchmark-a null model against which networks with complicated
structure can be compared. A study of residuals from the model and
its fit will allow researchers to isolate more complicated structural
tendencies, such as popularity, isolation, and expansiveness. Because
of its simplicity, the reciprocity model will rarely fit a data set so
well that a sociologist can confidently conclude that reciprocity
effects are adequate to describe the structure of a given network.
However, the model is valuable in that it allows both the opportunity to rule out reciprocity as the sole structural tendency and the
chance to find other tendencies in the model's lack of fit.
There is a strong tradition of the model-as-benchmark in
sociometric analysis. The Davis-Holland-Leinhardt transitivity
+
+
+
STOCHASTIC MODEL FOR DIRECTED GRAPHS
41 1
studies are based on the same philosophy. After adjusting a triad
census for effects of mutuality and asymmetry (Holland and Leinhardt, 1975) and, more recently, distributions of indegrees and
outdegrees (Holland and Leinhardt, 1979), a researcher can study
how far the census is away from its expected value to determine how
much transitivity remains. The ideal state of no intransitivity is a
benchmark, with small values of T indicating adherence to the ideal.
This modeling strategy is likely to increase in future sociometric
research, as researchers realize that we are limited to these simple
models. The construction of realistic, "all-inclusive" models is just
too difficult mathematically.
This research is limited in other ways. The reciprocity
model incorporates Markov assumptions that may not be appropriate for specific types of networks. Moreover, the transition rates
are postulated as constants over time, and dyads are assumed to be
independent and subject to the same homogeneous rates. Even so,
the history of Markov modeling in sociology, beginning with
Kemeny and Snell (1960, 1962) and continuing with Coleman
(1965), is rich and has given researchers many insights into sociological processes. A Markov model of reciprocity in social networks
should prove to be a useful tool for social networkers.
REFERENCES
ANDERSON, T. W., AND GOODMAN, L. A.
1957 "Statistical inference about Markov chains." Annals of Mathematical Statistics 28:89-110.
BILLINGSLEY, P.
1961 Statistical Inferences for Markov Processes. Chicago: University of
Chicago Press.
BISHOP, Y. M. M., FIENBERG, S. E., AND HOLLAND, P. W.
1975 Discrete Multivariate Analysis: Theory and Practice. Cambridge,
Mass.: MIT Press.
BURT, R. S.
1978 "Applied network analysis: An overview." Sociological Methods
and Research 7:123-130.
COLEMAN, J. S.
1965 Introduction to Mathematical Sociology. Glencoe, Ill.: Free Press.
DARWIN, J. H.
1956 "The behaviour of an estimator for a simple birth and death
process." Biometrika 43:23-31.
412 STANLEY S. WASSERMAN
ELFVING, G.
1937 "Zur Theorie der Markoffschen Ketten." Acta Social Science
Fennicae n., Series A.2, 8: 1-1 7.
GANTMACHER, F. R.
1960 The Theory of Matrices. Vol. 1. New York: Chelsea.
HOLLAND, P. W., AND LEINHARDT, S.
1975 "Local structure in social networks." In D. R. Heise (Ed.), Sociological Methodology 1976. San Francisco: Jossey-Bass. 1977a "A dynamic model for social networks." Journal ofMathematica1 Sociologv 5:5-20.
1977b "Social structure as a network process." Zeitschrift fur Soziologie
6:386-402.
1979 "Structural sociometry." In P. W. Holland and S. Leinhardt
(Eds.), Social Networks: Surveys, Advances and Commentary. New
York: Academic Press.
KEIDING, N.
1974 "Estimation in the birth process." Biometrika 61:71-80.
1975 "Maximum likelihood estimation in the birth-and-death process model." Annals of Statistics 3:363-372.
KEMENY, J. G., AND SNELL, J. L.
1960 Finite Markov Chains. Princeton: Van Nostrand.
1962 Mathematical Models in the Social Sciences. Waltham, Mass.: Blaisdell.
KINGMAN, J. F. C.
1962 "The imbedding problem for finite Markov chains." Zeitschrift
fur Wahrscheinlichkeitstheorie 1: 14-24.
SINGER, B., AND SPILERMAN, S.
1974 "Social mobility models for heterogeneous populations." In
H. Costner (Ed.), Sociological Methodology, 1973- 1974. San
Francisco: Jossey-Bass.
1976 "The representation of social processes as Markov models."
American Journal of Sociology 82: 1-54.
SYLVESTER, J. J.
1883 "On the equation to the secular inequalities in the planetary
theory." Philosophical Magazine 16:267-269.
WASSERMAN, S. S.
1977 "Stochastic models for directed graphs." Unpublished doctoral
dissertation. Department of Statistics, ~HarvardUniversity.
1978 "Models for binary directed graphs and their applications."
Advances in Applied Probability 10:803-8 18.
Download