Networks in Biology Bioinformatics: Sequence Analysis COMP 571 - Spring 2015

advertisement
Networks in Biology
Bioinformatics: Sequence Analysis
COMP 571 - Spring 2015
Luay Nakhleh, Rice University
Transcriptional Networks
Systems of transcription factors (TFs) and their target genes
[Source: Wikimedia]
Signaling Networks
Systems of signaling proteins and their interactions
[Source: Wikimedia]
Metabolic Networks
Systems of chemical reactions that convert nutrients into cellular
materials or use them to provide energy
[Source: Wikimedia]
Protein Interaction Networks
Systems of proteins and their interactions
[Source: Barabasi et al., 2004]
Neuronal Networks
Systems of neurons and their interactions
[Source: neuralwiki.blogpost.com]
Ecological Networks
Systems of organisms and their interactions
[Source: Wikimedia]
Phylogenetic Networks
Reticulate evolutionary histories
[Source: Teshima and Innan, 2012]
Social Interaction Networks
Systems of individuals and their interactions
[Source: Christakis and Fowler, 2007]
Keep In Mind
✤
A network can represent any system of entities where pairwise
relationships can be established...
✤
The network representation lends the system to a wide array of
network-oriented analytical and visualization tools.
Networks as a Guiding Tool
✤
Networks have been used to
✤
guide GWAS
✤
predict protein function
✤
model epidemics
✤
...
Vol. 29 ISMB/ECCB 2013, pages i171–i179
doi:10.1093/bioinformatics/btt238
BIOINFORMATICS
Efficient network-guided multi-locus association mapping
with graph cuts
Chloé-Agathe Azencott1,*, Dominik Grimm1, Mahito Sugiyama1, Yoshinobu Kawahara2 and
Karsten M. Borgwardt1,3
1
Machine Learning and Computational Biology
Researchet
Group,
C.-A.Azencott
al. Max Planck Institute for Developmental Biology & Max
Planck Institute for Intelligent Systems Spemannstr. 38, 72076 Tübingen, Germany, 2The Institute of Scientific and
Industrial Research (ISIR) Osaka University 8-1 Mihogaoka, Ibaraki-shi, Osaka 567-0047, Japan and 3Zentrum für
Bioinformatik, Eberhard Karls Universität Tübingen, 72076 Tübingen, Germany
(a)
(b)
1 INTRODUCTION
Downloaded from http://bioinformatics.oxfordjournals.org/ at Rice University on November 21, 2013
sets of genetic variants to the phenotype has the potential to help
explain part of this missing heritability (Marchini et al., 2005).
Motivation: As an increasing number of genome-wide association
Although efficient multiple linear regression approaches (Cho
studies reveal the limitations of the attempt to explain phenotypic
et al., 2010; Rakitsch et al., 2012; Wang et al., 2011) make the
heritability by single genetic loci, there is a recent focus on
detection of such multivariate associations possible, they often
associating complex phenotypes with sets of genetic loci. Although
remain limited in power and hard to interpret. Incorporating
several methods for multi-locus mapping have been proposed, it is
biological knowledge into these approaches could help boosting
often unclear how to relate the detected loci to the growing knowledge
their power and interpretability. However, current methods are
about gene pathways and networks. The few methods that take
limited to predefining a reasonable number of candidate sets to
biological pathways or networks into account are either restricted to
investigating a limited number of predetermined sets of loci or do not
investigate (Cantor et al., 2010; Fridley and Biernacka, 2011; Wu
scale to genome-wide settings.
et al., 2011), for instance by relying on gene pathways. They
(c)
Results: We present SConES, a new efficient method to discover sets
consequently run the risk of missing biologically relevant loci
of genetic loci that are maximally associated with a phenotype while
that have not been included in the candidate sets. This risk is
being connected in an underlying network. Our approach is based on
made even likelier by the incomplete state of our current bioloa minimum cut reformulation of the problem of selecting features
gical knowledge.
under sparsity and connectivity constraints, which can be solved
For this reason, our goal here is to use prior knowledge in a
exactly and rapidly.
more flexible way. We propose to use a biological network,
SConES outperforms state-of-the-art competitors in terms of runtime,
defined between SNPs, to guide a multi-locus mapping approach
scales to hundreds of thousands of genetic loci and exhibits higher
that is both efficient to compute and biologically meaningful: We
power in detecting causal SNPs in simulation studies than other methaim to find a set of SNPs that (i) are maximally associated with a
ods. On flowering time phenotypes and genotypes from Arabidopsis
given phenotype and (ii) tend to be connected in a given biological
thaliana, SConES detects loci that enable accurate phenotype predicnetwork. In addition, this set must be computed efficiently on
tion and that are supported by the literature.
genome-wide data. In this article, we assume an additive model
Availability: Code is available at http://webdav.tuebingen.mpg.de/u/
to characterize multi-locus association. The network constraint
karsten/Forschung/scones/.
stems
fromthree
the types
assumption
that SNPs
influencing the same
Fig. 1. Small examples
of the
of networks
considered
Contact: chloe-agathe.azencott@tuebingen.mpg.de
phenotype are biologically linked. However, the diversity of the
Supplementary information: Supplementary data are available at
type of relationships that this can encompass, together with the
Bioinformatics online.
current incompleteness of biological knowledge, makes provid! GM (gene membership)
SNPs
are connected
ing a networknetwork:
in which all
the relevant
connections as
are in
present
unlikely. For
this reason,
although
to in
encourage
the sequence network
described
earlier
in we
thewant
text;
add- the
ABSTRACT
pairs of features connected by an e
a given size yields the so-called gra
taken by Huang et al. (2009): th
encourages selecting a small nu
blocks are sets of features define
of the problem. In the case of a g
are defined as small connected
shown in Mairal and Yu (2011)
aforementioned is a relaxation o
number of linear subgraphs or co
size grows exponentially with the
which can reach millions in the ca
only the edge-based version of the
our problem. It is however unclea
ture long-range connections betw
Li and Li (2008) propose a netw
Lasso that imposes the type of g
sirable. However, their approach
works of genes (rather than of SN
easily to the datasets we envisio
they propose relies on a singula
Laplacian of the network, whic
cannot be stored in memory.
Chuang et al. (2007) also sear
protein interaction networks that
Network-Assisted Investigation of Combined Causal
Signals from Genome-Wide Association Studies in
Schizophrenia
Peilin Jia1,2, Lily Wang3, Ayman H. Fanous4,5,6,7, Carlos N. Pato7, Todd L. Edwards8,9, The International
Schizophrenia Consortium", Zhongming Zhao1,2,10*
1 Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America, 2 Department of Psychiatry, Vanderbilt
University School of Medicine, Nashville, Tennessee, United States of America, 3 Department of Biostatistics, Vanderbilt University School of Medicine, Nashville,
Tennessee, United States of America, 4 Department of Psychiatry and Virginia Institute for Psychiatric and Behavior Genetics, Virginia Commonwealth University,
Richmond, Virginia, United States of America, 5 Washington VA Medical Center, Washington, D.C., United States of America, 6 Department of Psychiatry, Georgetown
University School of Medicine, Washington, D.C., United States of America, 7 Department of Psychiatry, Keck School of Medicine of the University of Southern California,
Los Angeles, California, United States of America, 8 Center for Human Genetics Research, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of
America, 9 Division of Epidemiolgy, Department of Medicine, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America, 10 Department of
Cancer Biology, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
Abstract
With the recent success of genome-wide association studies (GWAS), a wealth of association data has been accomplished
for more than 200 complex diseases/traits, proposing a strong demand for data integration and interpretation. A
combinatory analysis of multiple GWAS datasets, or an integrative analysis of GWAS data and other high-throughput data,
has been particularly promising. In this study, we proposed an integrative analysis framework of multiple GWAS datasets by
overlaying association signals onto the protein-protein interaction network, and demonstrated it using schizophrenia
datasets. Building on a dense module search algorithm, we first searched for significantly enriched subnetworks for
schizophrenia in each single GWAS dataset and then implemented a discovery-evaluation strategy to identify module genes
with consistent association signals. We validated the module genes in an independent dataset, and also examined them
through meta-analysis of the related SNPs using multiple GWAS datasets. As a result, we identified 205 module genes with a
joint effect significantly associated with schizophrenia; these module genes included a number of well-studied candidate
genes such as DISC1, GNA12, GNA13, GNAI1, GPR17, and GRIN2B. Further functional analysis suggested these genes are
involved in neuronal related processes. Additionally, meta-analysis found that 18 SNPs in 9 module genes had
Pmeta,161024, including the gene HLA-DQA1 located in the MHC region on chromosome 6, which was reported in previous
studies using the largest cohort of schizophrenia patients to date. These results demonstrated our bi-directional networkbased strategy is efficient for identifying disease-associated genes with modest signals in GWAS datasets. This approach can
be applied to any other complex diseases/traits where multiple GWAS datasets are available.
Citation: Jia P, Wang L, Fanous AH, Pato CN, Edwards TL, et al. (2012) Network-Assisted Investigation of Combined Causal Signals from Genome-Wide Association
Studies in Schizophrenia. PLoS Comput Biol 8(7): e1002587. doi:10.1371/journal.pcbi.1002587
Editor: Frederick P. Roth, Harvard Medical School, United States of America
Received October 31, 2011; Accepted May 15, 2012; Published July 5, 2012
Network-based prediction of protein func
REVIEW
Network-based prediction of protein function
Network-based function prediction
R Sharan et al
1
Roded Sharan , Igor Ulitsky1 and Ron Shamir*
networks, with node
networks, with nodes representing
representing proteins
and ed
Roded Sharan1, Igor Ulitsky1 and Ron Shamir*
the detecte
representing the detected PPIs.
In this review, we su
School
of
Computer
Science,
Tel
Aviv
University,
Tel
Aviv,
Israel
In this review, we survey the growing body of works
School of Computer Science, Tel Aviv University, Tel Aviv, Israel
Corresponding
School of
Computer
Science, Tel Aviv
University,
** Corresponding
author. author.
School of Computer
Science,
Tel Aviv University,
functional
annotation offunctional
proteins via annotation
their network
Tel
69978,
Israel. Tel.:
þ 972
3 6405383;
Fax:3 þ
972 3 6405384;
TelAvivAviv
69978,
Israel.
Tel.:
þ 972
6405383;
Fax: þ 972interactions
3 6405384;(summarizedinteractions
in Table I). We(summariz
distinguish
E-mail: rshamir@tau.ac.il
types of approaches (Figure 2): direct annotation schem
1 E-mail: rshamir@tau.ac.il
These
authors
contributed
equally
to
this
work.
ofbased
approaches
(F
1
which infer the function oftypes
a protein
on its connecti
These authors contributed equally to this work.
in the network, and module-assisted
schemes,
which fi
which infer
the function
Received 20.9.06; accepted 9.1.07
identify modules of related
proteins
and then annotate
e
in
the
network,
and
m
Received 20.9.06; accepted 9.1.07
module based on the known functions of its memb
of rel
Naturally, the presentedidentify
methods modules
and the emphasis
particular ones reflect themodule
opinions of based
the authors.
on the
Functional annotation of proteins is a fundamental problem
in the post-genomic era. The recent availability of protein
Naturally, the present
interaction networks for many model species has spurred
particular ones reflect t
on
the development
of computational
methods
interFunctional
annotation
of proteins
is for
a fundamental
problem
Direct
methods
preting such data in order to elucidate protein function. In
in the post-genomic era. The recent availability
of protein
this review, we describe the current computational
The
common
principle underlying all direct methods
interaction
networks
for
many
model
species
has
spurred
approaches for the task, including direct methods, which
functional annotation is that proteins that lie closer to
propagate
functional
information
through
the
network,
and
on the development of computational methods
forin interanother
the PPI network are more likely to have sim
module-assisted methods, which infer functional modules
function.
As can
be seen in Figure 3, there is an evid
preting
such
data
in
order
to
elucidate
protein
function.
In
within the network and use those for the annotation task.
correlation between network distance and functional distan
this review,
we describe
current
Although
a broad variety
of interestingthe
approaches
has computational
that is, the closer the two proteins
are in the network
the m
The
common
principl
been
developed, further
progress
field will depend
on methods,
approaches
for the
task,in the
including
direct
similar arewhich
their functional annotations. The methods
functional annotation i
systematic evaluation of the methods and their disseminabelowand
differ in the way they capture and exploit
propagate
information
the scribed
network,
Figure
2 biological
Directfunctional
versus module-assisted
approaches for functionalthrough
annotation. The scheme
shows a network in which
the functions of some proteins are known (top),
tion
in the
community.
another
in isthe
neta
correlation.
In
the
following,
we denote
PPIPPI
network
where each function is indicated by a different color. Unannotated proteins are in white. In the direct methods (left), these proteins
are assigned
a color thatthe
unusually
Molecular
13 direction
Marchof 2007;
module-assisted
methods,
functional
modules
prevalent Systems
among their Biology
neighbors.
The
thewhich
edges indicatesinfer
the influence
of the annotated
proteins
on the unannotated
In the
module-assisted methods
graph
G¼(V,E)
(see Boxones.
1function.
for
graph-theoretic
definitions).
As can
be s
(right), modules are first identified based on their density. Then, within each module, unannotated proteins are assigned a function that is unusually prevalent in the
doi:10.1038/msb4100129
within the network and use those for the annotation task.
Direct methods
module. In both methods, proteins may be assigned with several functions.
Subject
Categories: computational methods; proteins
correlation between net
Epidemics and Networks
Diseases and the Networks that
Transmit Them
✤
The patterns by which epidemics spread through a population is
determined not just by the properties of the pathogen carrying it
(contagiousness, the length of the infection period, severity, etc.), but
also by network structures within the population it is affecting.
✤
The opportunities for a disease to spread are given by a contact
network: there is a node for each person, and an edge if two people
come into contact with each other in a way that makes it possible for
the disease to spread from one to the other.
Diseases and the Networks that
Transmit Them
✤
Accurately modeling the underlying network is crucial to
understanding the spread of an epidemic.
pling, a, of
e observed
lle equation
mT. We then
n, the change
ic field in
e also
T in static
orientation
MF to equal
xperiments.
...how travel patterns within a city affect the spread of disease
..............................................................
ARTICLE IN PRESS
Modelling disease outbreaks in
realistic urban social networks
1
2
1
1
Stephen Eubank
,
Hasan
Guclu
,
V.
S.
Anil
Kumar
,
Madhav
V.
Marathe
,
Journal of Theoretical Biology 232 (2005) 71–81
www.elsevier.com/locate/y
Aravind Srinivasan3, Zoltán Toroczkai4 & Nan Wang5
1
Basic and Applied Simulation Science Group, Los Alamos National Laboratory,
MS M997,
Los Alamos,
New
Mexico 87545,
USA
Network
theory
and
SARS:
predicting
outbreak diversity
2
Department of Physics, Applied Physics and Astronomy, Rensselaer Polytechnic
a,b,!Troy,
,1
c,1,2
b,d
Institute
110
8th Street,
New York
12180-3590, USA
Lauren
Ancel
Meyers
,
Babak
Pourbohloul
,
M.E.J.
Newman
,
–64 (1972).
3
Department of Computer Science andc,2
for Advanced Computer
5).
Danuta M. Skowronski Institute
, Robert
C. Brunhamc,2 Studies,
University of Maryland, College Park, Maryland 20742, USA
n animals.
4
ion of Integrative Biology
and for
Institute
for Cellular
and Molecular
Biology, University
Texas at
Austin,
1 University
Station C0930, Austin,
Centre
Nonlinear
Studies
and Complex
Systems of
Group,
Los
Alamos
National
78712, USA
c. Lond. B 263,
Laboratory,bSanta
MS B258,
Los
Alamos,
Mexico
87545,
USA
Fe Institute, 1399 HydeNew
Park Road,
Santa
Fe, NM
87501, USA
5
c
Department
of Computer
University
Maryland,
College
University of British
Columbia Centre
for DiseaseScience,
Control, 655
West 12th of
Avenue,
Vancouver,
British Park,
Columbia, Canada V5Z 4R4
agnetite-based
ter for the Study of Complex Systems, University of Michigan, Randall Laboratory, 500 E. University Ave., Ann Arbor, MI 48109-1120, U
...how travel patterns via the worldwide airline network affect the spread of
disease
The role of the airline transportation network in the
prediction and predictability of global epidemics
Vittoria Colizza*, Alain Barrat†, Marc Barthélemy*‡, and Alessandro Vespignani*§
*School of Informatics and Center for Biocomplexity, Indiana University, Bloomington, IN 47401; and †Centre National de la Recherche Scientifique,
Unité Mixte de Recherche 8627, Université Paris-Sud, Ba! timent 210, F-91405 Orsay, France
Communicated by Giorgio Parisi, University of Rome, Rome, Italy, December 8, 2005 (received for review June 18, 2005)
The systematic study of large-scale networks has unveiled the
ubiquitous presence of connectivity patterns characterized by
large-scale heterogeneities and unbounded statistical fluctuations.
These features affect dramatically the behavior of the diffusion
processes occurring on networks, determining the ensuing statistical properties of their evolution pattern and dynamics. In this
article, we present a stochastic computational framework for the
forecast of global epidemics that considers the complete worldwide air travel infrastructure complemented with census population data. We address two basic issues in global epidemic modeling: (i) we study the role of the large scale properties of the airline
transportation network in determining the global diffusion pat-
model including the full International Air Transport Association
(www.iata.org) database, aiming at a detailed study of the
interplay among the network structure and the stochastic features of the infection dynamics in defining the global spreading
of epidemics. In particular, whereas previous studies have generally been focused in the a posteriori analysis of real case studies
of global epidemics, the large-scale modeling presented here
allows us to address more basic theoretical issues such as the
statistical properties of the epidemic pattern and the effect on it
of the complex architecture of the underlying transportation
network. Finally, such a detailed level of description allows for
the quantitative assessment of the reliability of the obtained
Diseases and the Networks that
Transmit Them
✤
Contact networks are also important in understanding how diseases
spread through animal populations (e.g., the 2001 foot-and-mouth
outbreak in the UK) and plant populations.
✤
Similar models have been employed for studying the spread of
computer viruses...
Diseases and the Networks that
Transmit Them
✤
The pathogen and the network are closely intertwined: even within
the same population, the contact networks for two different diseases
can have very different structures, depending on the diseases’
respective modes of transmission.
✤
(Think of airborne transmission based on coughs and sneezes,
compared to a sexually transmitted disease, and think of the density
of the contact networks!)
Branching Processes
✤
The simplest model of contagion: every person is in contact with k
people
✤
First wave: a person carrying a new diseases enters a population
and transmits it to each of his contacts independently with
probability p.
✤
Second wave: each person in the first wave transmits to each of his
contacts independently with probability p (the contacts of people
are mutually exclusive)
✤
and so on..
Branching Processes
648
CHAPTER 21. EPIDEMICS
(a) The contact network for a branching process
Branching Processes
(a) The contact network for a branching process
(b) With high contagion probability, the infection spreads widely
Branching Processes
(b) With high contagion probability, the infection spreads widely
(c) With low contagion probability, the infection is likely to die out quickly
Figure 21.1: The branching process model is a simple framework for reasoning about the
spread of an epidemic as one varies both the amount of contact among individuals and the
The Basic Reproductive Number R0
✤
The basic reproductive number, denote R0, is the expected number of
new cases of the disease caused by a single individual.
✤
For the simple branching process we saw, we have R0=kp.
n the tree, no one in any future wave will be infected either.
So there are really only two possibilities for a disease in the branching process model:
ches a wave where it infects no one, thus dying out after a finite number of steps; or
tinues to infect people in every wave, proceeding infinitely through the contact netwo
0
d it turns out that there is a simple condition to tell these two possibilities apart, bas
a quantity called the basic reproductive number of the disease.
The basic reproductive number, denoted R0 , is the expected number of new cases of t
ase caused by a single individual. Since in our model everyone meets k new people a
cts each with probability p, the basic reproductive number here is given by R0 = p
outcome of the disease in a branching process model is determined by whether the ba
roductive number is smaller or larger than 1.
The Basic Reproductive Number R
Claim: If R0 < 1, then with probability 1, the disease dies out after a finite
number of waves. If R0 > 1, then with probability greater than 0 the disease
persists by infecting at least one person in each wave.
give a proof of this claim in Section 21.8. Even without the details of the proof, howev
can see that the basic condition expressed in the claim — comparing R0 to 1 — ha
ural intuitive basis. When R0 < 1, the disease isn’t able to replenish itself: each infect
son produces less than one new case in expectation, and so — even if it grows brie
to the outcome of random fluctuations — the size of the outbreak is constantly trendi
The Basic Reproductive Number R0
✤
Implication: It’s always good to reduce the value of R0!
✤
Quarantining people reduces k and encouraging behavioral measures
such as sanitary practices reduces p.
✤
Clearly, the branching process is too simplistic!
The SIR Epidemic Model
✤
An individual node goes three potential stages during the course of
the epidemic:
✤
Susceptible: Before the node has caught the disease, it is susceptible
to infection from its neighbors.
✤
Infectious: Once the node has caught the disease, it is infectious
and has some probability of infecting each of its susceptible
neighbors.
✤
Removed: After a particular node has experienced the full
infectious period, this node is removed from consideration.
The SIR Epidemic Model
✤
Given this three-stage “life cycle” for the disease at each node, a
model for epidemics on networks can be defined.
✤
The network structure: a directed graph representing the contact
network (edge u to v means that if u becomes infected, the disease has
the potential to spread to v).
✤
Two other quantities: p (the probability of contagion) and tI (the
length of infection)
between people, where either has the potential to directly infect the other, we can put in
directed edges pointing each way: both from v to w and also from w to v. Since contacts
between people are often symmetric, it is fine to use networks where most edges appear in
each direction, but it is sometimes convenient to be able to express asymmetric contacts as
well.
Now, each node has the potential to go through the Susceptible-Infectious-Removed
cycle, where we abbreviate these three states as S, I, and R. The progress of the epidemic
is controlled by the contact network structure and by two additional quantities: p (the
probability of contagion) and tI (the length of the infection).
The SIR Epidemic Model
• Initially, some nodes are in the I state and all others are in the S state.
• Each node v that enters the I state remains infectious for a fixed number of steps tI .
• During each of these tI steps, v has a probability p of passing the disease to each of its
susceptible neighbors.
• After tI steps, node v is no longer infectious or susceptible to further bouts of the
disease; we describe it as removed (R), since it is now an inert node in the contact
network that can no longer either catch or transmit the disease.
This describes the full model; we refer to it as the SIR model, after the three disease states
that nodes experience. Figure 21.2 shows an example of the SIR model unfolding on a
particular contact network through successive steps; in each step, shaded nodes with dark
The SIR Epidemic Model
652
CHAPTER 21. EPIDEMICS
t
t
s
u
s
u
y
y
z
x
r
z
x
v
r
v
w
w
(a)
(b)
t
t
s
u
s
u
y
y
r
z
x
z
x
v
r
v
w
w
(c)
(d)
The SIS Epidemic Model
✤
The SIR epidemic model is appropriate for epidemics in which each
individual contracts the disease at most once.
✤
To allow for nodes that can be reinfected multiple times, a model can
have only the S and I, but not R, states.
ividual contracts the disease at most once. However, a simple variation on these models
llows us to reason about epidemics where nodes can be reinfected multiple times.
To represent such epidemics, we have nodes that simply alternate between two possible
tates: Susceptible (S) and Infectious (I). There is no Removed state here; rather, after a
ode is done with the Infectious state, it cycles back to the Susceptible state and is ready to
atch the disease again. Because of this alternation between the S and I states, we refer to
he model as the SIS model.
Aside from the lack of an R state, the mechanics of the model follow the SIR process
ery closely.
The SIS Epidemic Model
• Initially, some nodes are in the I state and all others are in the S state.
• Each node v that enters the I state remains infectious for a fixed number of steps tI .
• During each of these tI steps, v has a probability p of passing the disease to each of its
susceptible neighbors.
• After tI steps, node v is no longer infectious, and it returns to the S state.
igure 21.5 shows an example of the SIS model unfolding on a three-node contact network
with tI = 1. Notice how node v starts out infected, recovers, and later becomes infected
gain — we can imagine this as the contact network within a three-person apartment, or a
hree-person family, where people pass a disease on to others they’re living with, and then
The SIS Epidemic Model
56
CHAPTER 21. EPIDEMIC
u
v
u
v
v
w
(a)
u
v
w
(b)
u
v
w
(c)
u
w
(d)
w
(e)
igure 21.5: In an SIS epidemic, nodes can be infected, recover, and then be infected aga
n each step, the nodes in the Infectious state are shaded.
A Connection Between SIR and SIS
658
CHAPTER 21. EPIDEMICS
u
v
u
v
u
v
u
v
u
v
w
w
w
w
w
step 0
step 1
step 2
step 3
step 4
(a) To represent the SIS epidemic using the SIR model, we use a “‘time-expanded” contact network
u
v
u
v
u
v
u
v
u
v
w
w
w
w
w
step 0
step 1
step 2
step 3
step 4
(b) The SIS epidemic can then be represented as an SIR epidemic on this time-expanded network.
The SIRS Epidemic Model
✤
Combines SIR and SIS:
✤
After an infected node recovers, it passes briefly through the R
state on its way back to the S state.
ti !t 1 1" ! 0
if ti !t" ! t0 .
NUMBER
130, until
PHYSICAL REVIEW LETTERS
26 MARCH 2001
VOLUME
sceptible element
stays86,
as such,
at t !
initial state is random with ninf !0" ! 0.1. The 400 time
infected. Once infected, it goes (deterministia cycle that lasts t0 time steps. During the first
steps shown are representative of the stationary state. We
canWorld
see clearly
a transition
from
an endemic situation toModel
an
s, it is infected and can potentially transmit the
Small
Effect
in
an
Epidemiological
oscillatory one. At p ! 0.01 (top), where the network is
susceptible neighbor. During the last tR time
nearly a regular lattice,
cycle, it remains in state R, immune but not
1, the stationary state is a fixed point,
1,2, †
Marcelo
Kuperman
and Guillermo
Abramson
with fluctuations. The*situation
corresponds
to that of an
After the cycle is complete, it returns to the
1
Centro Atómicoendemic
Bariloche
and Instituto
Balseiro,
8400 S.fraction
C. de Bariloche,
Argentina
infection,
with a low
and persistent
of instate.
2
Consejofected
Nacional
de Investigaciones
Científicas
y Técnicas,
Argentina
individuals.
At high values
of p — like
the case with
agion of a susceptible element by an infected
(Received
)
5 October
2000
p ! 0.9 shown
in the figure
(bottom)
— large
amplitude,
e subsequent excitation of the disease cycle in
oscillations
develop.
The situation
is almost
ected, occur stochastically atAa model
local level.
Say
for the
spreadself-sustained
of an infection
is analyzed
for different
population
structures. The interactions
periodic, with a very well defined period and small flucment i is susceptible, and that it has ki neighwithin the population are described by small world networks, ranging from ordered lattices to random
tuations in amplitude. The period is slightly longer than
ich kinf are infected. Then, i will become ingraphs. For the more ordered systems, there is a fluctuating endemic state of low infection. At a finite
t0 , since it includes the average time that a susceptible inprobability kinf $ki . Observe that i will become
value
of
the
disorder
of
the network, we find a transition to self-sustained oscillations in the size of the
h probability 1 if all its neighbors are infected.
dividual remains at state S, before being infected. Epideinfected
subpopulation.
parameter-free mechanism,
there may
be other
miologically, the situation resembles the periodic epidemic
choices. For example, if the susceptible had
patterns typical of large populations [3]. A mean field
DOI: 10.1103/PhysRevLett.86.2909
PACS numbers: 89.75.Hc, 05.65.+b, 87.19.Xx, 87.23.Ge
y q of contagion with each infected neighbor,
model of the system, expected to resemble the behavior at
ave a probability of infection %1 2 !1 2 q"kinf &.
p ! 1, can easily be shown to exhibit these oscillations.
transition
apparent
— in
ted that both these
criteria give qualitatively
the theThe
show thatisa sharp
transition
in the behavior of an infection
I. Introduction.—
How does
dynamics
of between
an infec-both behaviors
this relatively small system— at the intermediate value of
s for values of q & 0.2. For other values of q,
dynamics exists at a finite value of p.
tious disease depend on the structure of a population? A
rs are outlined at the end of Section III.
II. Epidemic model.—We analyze a simple model of the
great amount
of work has
been done on the phenomenoerical results.—We
have performed
extensive
spread of an infectious
disease. We want, mainly, to point
description
of particular
p = 0.01
imulations logical
of the described
model.
Networks epidemic0.4situations [1–4].
to the role played by the network structure on the temporal
classical
to these
problems deals
103 to 106 A
vertices
havemathematical
been explored,approach
with
0.2
0. A typicalwith
realization
starts with
the genera- where the subpopulations
dynamics of the epidemic. The disease has three stages:
well mixed
populations,
random network
and(typically
the initialization
of the infected, 0.0
susceptible (S),p =infected
(I), and refractory (R). An eleinvolved
susceptible,
and removed) in0.4
0.2
elements. teract
An initial
fraction
of
0.1
infected,
ment of the population is described by a single dynamical
in proportion to their sizes. With these zero dimensusceptible,
was
used
in
all
the
results
shown
0.2
variable adopting one of these three values. Susceptible
sional models it has been possible to study, among other
initial conditions have been explored as well,
0.0
elements can pass to the infected state through contagion
epidemic features, the existence of threshold
values for
nges have been observed in the behavior. After
0.4
= 0.9 Infected elements pass to the refracby an infected pone.
of anand
infection
[5], the asymptotic
solution for
a stationary the
statespread
is achieved,
the computa0.2
tory state after an infection time tI . Refractory elements
the density
of time
infected
the effect of
llowed for several
thousand
steps people
to per- [6–8], and
stochastic fluctuations on the modulation
of an epidemic
return to the susceptible state after a recovery time tR . This
ical averages.
0.0
5200
5300
5500 is usually
5600 called SIRS, for the cycle that a
in Fig. 1 part
of three
time
displaying
situation
[9].
A series
second
classical approach
describes
spa- 5400
kind of system
t
of infected tially
elements
in the system,
ninf !t". The such as elements in a
extended
subpopulations,
single
element goes over. The contagion is possible only
s correspond
to systems
with different
values spread of an epidemic
lattice.
In this,
the geographic
duringasthe
S phase,
and only by an I element. During the
FIG. 1. Fraction of infected elements
a function
of time.
der parameter: p ! 0.01 (top), 0.2 (middle),
Three time
series[10–13],
are shown, corresponding
values
can be analyzed as a reaction-diffusion
process
R phase, to
thedifferent
elements
are immune and do not infect. SIRS
ttom). The three systems have N ! 104 and
of the disorder parameter p, as shown in the legends. Other pabearing
such
systems, known to display relaxation
4
infection cycles
withclose
tI ! similarity
4 and tR !to9. paradigmatic
The
, K as
! 3, tI models
! 4, tR are
! 9,excitable
Ninf !0" ! 0.1.
rametersreactions
are N ! 10
The small-world model (p: probability of rewiring an edge)
ninf (t)
The number of infected people (ninf(t)) by SIRS epidemic:
Acknowledgments
✤
Slides on epidemics and networks are based on the book “Networks,
Crowds, and Markets” by Easley and Kleinberg.
Download