Bioinformatics: Network Analysis Networks as a Guiding Tool Luay Nakhleh, Rice University

Bioinformatics: Network Analysis Networks as a Guiding Tool COMP 572 (BIOS 572 / BIOE 564) - Fall 2013 Luay Nakhleh, Rice University 1 ✤ Networks have been used to ✤ guide GWAS ✤ predict protein function ✤ model epidemics ✤ ... 2 Vol. 29 ISMB/ECCB 2013, pages i171–i179 doi:10.1093/bioinformatics/btt238 BIOINFORMATICS Efficient network-guided multi-locus association mapping with graph cuts Chloé-Agathe Azencott1,*, Dominik Grimm1, Mahito Sugiyama1, Yoshinobu Kawahara2 and Karsten M. Borgwardt1,3 1 Machine Learning and Computational Biology Researchet Group, C.-A.Azencott al. Max Planck Institute for Developmental Biology & Max Planck Institute for Intelligent Systems Spemannstr. 38, 72076 Tübingen, Germany, 2The Institute of Scientific and Industrial Research (ISIR) Osaka University 8-1 Mihogaoka, Ibaraki-shi, Osaka 567-0047, Japan and 3Zentrum für Bioinformatik, Eberhard Karls Universität Tübingen, 72076 Tübingen, Germany (a) (b) Downloaded from http://bioinformatics.oxfordjournals.org/ at Rice University on November 21, 2013 sets of genetic variants to the phenotype has the potential to help explain part of this missing heritability (Marchini et al., 2005). Motivation: As an increasing number of genome-wide association Although efficient multiple linear regression approaches (Cho studies reveal the limitations of the attempt to explain phenotypic et al., 2010; Rakitsch et al., 2012; Wang et al., 2011) make the heritability by single genetic loci, there is a recent focus on detection of such multivariate associations possible, they often associating complex phenotypes with sets of genetic loci. Although remain limited in power and hard to interpret. Incorporating several methods for multi-locus mapping have been proposed, it is biological knowledge into these approaches could help boosting often unclear how to relate the detected loci to the growing knowledge their power and interpretability. However, current methods are about gene pathways and networks. The few methods that take limited to predefining a reasonable number of candidate sets to biological pathways or networks into account are either restricted to investigating a limited number of predetermined sets of loci or do not investigate (Cantor et al., 2010; Fridley and Biernacka, 2011; Wu scale to genome-wide settings. et al., 2011), for instance by relying on gene pathways. They (c) Results: We present SConES, a new efficient method to discover sets consequently run the risk of missing biologically relevant loci of genetic loci that are maximally associated with a phenotype while that have not been included in the candidate sets. This risk is being connected in an underlying network. Our approach is based on made even likelier by the incomplete state of our current bioloa minimum cut reformulation of the problem of selecting features gical knowledge. under sparsity and connectivity constraints, which can be solved For this reason, our goal here is to use prior knowledge in a exactly and rapidly. more flexible way. We propose to use a biological network, SConES outperforms state-of-the-art competitors in terms of runtime, defined between SNPs, to guide a multi-locus mapping approach scales to hundreds of thousands of genetic loci and exhibits higher that is both efficient to compute and biologically meaningful: We power in detecting causal SNPs in simulation studies than other methaim to find a set of SNPs that (i) are maximally associated with a ods. On flowering time phenotypes and genotypes from Arabidopsis given phenotype and (ii) tend to be connected in a given biological thaliana, SConES detects loci that enable accurate phenotype predicnetwork. In addition, this set must be computed efficiently on tion and that are supported by the literature. genome-wide data. In this article, we assume an additive model Availability: Code is available at http://webdav.tuebingen.mpg.de/u/ to characterize multi-locus association. The network constraint karsten/Forschung/scones/. stems from the assumption that SNPs influencing the same Fig. 1. Small examples of the three types of networks considered Contact: chloe-agathe.azencott@tuebingen.mpg.de phenotype are biologically linked. However, the diversity of the Supplementary information: Supplementary data are available at type of relationships that this can encompass, together with the Bioinformatics online. current incompleteness of biological knowledge, makes providing a networknetwork: in which all the relevant connections as are in present ! GM (gene membership) SNPs are connected unlikely. For this reason, although we want to encourage the 1 INTRODUCTION the sequence network described earlier in the text; in addSNPs to form a subnetwork of the network, we also do not Twin and family/pedigree studies make it possible to estimate the near ition, SNPs same that genethey aremust linked as well. wantthe to enforce form atogether single connected compoABSTRACT pairs of features connected by an e a given size yields the so-called gra taken by Huang et al. (2009): th encourages selecting a small nu blocks are sets of features define of the problem. In the case of a g are defined as small connected shown in Mairal and Yu (2011) aforementioned is a relaxation o number of linear subgraphs or co size grows exponentially with the which can reach millions in the ca only the edge-based version of the our problem. It is however unclea ture long-range connections betw Li and Li (2008) propose a netw Lasso that imposes the type of g sirable. However, their approach works of genes (rather than of SN easily to the datasets we envisio they propose relies on a singula Laplacian of the network, whic cannot be stored in memory. Chuang et al. (2007) also sear protein interaction networks that a phenotype; however, their greed 3 Network-Assisted Investigation of Combined Causal Signals from Genome-Wide Association Studies in Schizophrenia Peilin Jia1,2, Lily Wang3, Ayman H. Fanous4,5,6,7, Carlos N. Pato7, Todd L. Edwards8,9, The International Schizophrenia Consortium", Zhongming Zhao1,2,10* 1 Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America, 2 Department of Psychiatry, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America, 3 Department of Biostatistics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America, 4 Department of Psychiatry and Virginia Institute for Psychiatric and Behavior Genetics, Virginia Commonwealth University, Richmond, Virginia, United States of America, 5 Washington VA Medical Center, Washington, D.C., United States of America, 6 Department of Psychiatry, Georgetown University School of Medicine, Washington, D.C., United States of America, 7 Department of Psychiatry, Keck School of Medicine of the University of Southern California, Los Angeles, California, United States of America, 8 Center for Human Genetics Research, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America, 9 Division of Epidemiolgy, Department of Medicine, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America, 10 Department of Cancer Biology, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America Abstract With the recent success of genome-wide association studies (GWAS), a wealth of association data has been accomplished for more than 200 complex diseases/traits, proposing a strong demand for data integration and interpretation. A combinatory analysis of multiple GWAS datasets, or an integrative analysis of GWAS data and other high-throughput data, has been particularly promising. In this study, we proposed an integrative analysis framework of multiple GWAS datasets by overlaying association signals onto the protein-protein interaction network, and demonstrated it using schizophrenia datasets. Building on a dense module search algorithm, we first searched for significantly enriched subnetworks for schizophrenia in each single GWAS dataset and then implemented a discovery-evaluation strategy to identify module genes with consistent association signals. We validated the module genes in an independent dataset, and also examined them through meta-analysis of the related SNPs using multiple GWAS datasets. As a result, we identified 205 module genes with a joint effect significantly associated with schizophrenia; these module genes included a number of well-studied candidate genes such as DISC1, GNA12, GNA13, GNAI1, GPR17, and GRIN2B. Further functional analysis suggested these genes are involved in neuronal related processes. Additionally, meta-analysis found that 18 SNPs in 9 module genes had Pmeta,161024, including the gene HLA-DQA1 located in the MHC region on chromosome 6, which was reported in previous studies using the largest cohort of schizophrenia patients to date. These results demonstrated our bi-directional networkbased strategy is efficient for identifying disease-associated genes with modest signals in GWAS datasets. This approach can be applied to any other complex diseases/traits where multiple GWAS datasets are available. Citation: Jia P, Wang L, Fanous AH, Pato CN, Edwards TL, et al. (2012) Network-Assisted Investigation of Combined Causal Signals from Genome-Wide Association Studies in Schizophrenia. PLoS Comput Biol 8(7): e1002587. doi:10.1371/journal.pcbi.1002587 Editor: Frederick P. Roth, Harvard Medical School, United States of America Received October 31, 2011; Accepted May 15, 2012; Published July 5, 2012 Copyright: ! 2012 Jia et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted 4 Network-based prediction of protein func REVIEW Network-based prediction of protein function Network-based function prediction R Sharan et al 1 Roded Sharan , Igor Ulitsky1 and Ron Shamir* networks, with nodes networks, with nodes representing representing proteins and ed Roded Sharan1, Igor Ulitsky1 and Ron Shamir* the detecte representing the detected PPIs. In this review, we su School of Computer Science, Tel Aviv University, Tel Aviv, Israel In this review, we survey the growing body of works School of Computer Science, Tel Aviv University, Tel Aviv, Israel Corresponding School of Computer Science, Tel Aviv University, ** Corresponding author. author. School of Computer Science, Tel Aviv University, functional annotation offunctional proteins via annotation their network Tel 69978, Israel. Tel.: þ 972 3 6405383; Fax: 972 3 6405384; TelAvivAviv 69978, Israel. Tel.: þ 972 3þ 6405383; Fax: þ 972interactions 3 6405384;(summarizedinteractions in Table I). We(summariz distinguish t E-mail: rshamir@tau.ac.il types of approaches (Figure 2): direct annotation schem 1 E-mail: rshamir@tau.ac.il These authors contributed equally to this work. types ofbased approaches (F 1 which infer the function of a protein on its connecti These authors contributed equally to this work. in the network, and module-assisted which fi which inferschemes, the function Received 20.9.06; accepted 9.1.07 identify modules of related proteins and then annotate e in the network, and m Received 20.9.06; accepted 9.1.07 module based on the known functions of its memb of rel Naturally, the presentedidentify methods modules and the emphasis particular ones reflect themodule opinions of based the authors. on the Functional annotation of proteins is a fundamental problem in the post-genomic era. The recent availability of protein Naturally, the present interaction networks for many model species has spurred particular ones reflect th on the development of computational methods interFunctional annotation of proteins is for a fundamental problem Direct methods preting such data in order to elucidate protein function. In in the post-genomic era. The recent availability of protein this review, we describe the current computational common principle underlying all direct methods interaction networks for many modelwhich speciesThe has spurred approaches for the task, including direct methods, functional annotation is that proteins that lie closer to propagate information the network, and on the functional development of through computational methods forin interanother the PPI network are more likely to have sim module-assisted methods, which infer functional modules As can preting such data in order to elucidate proteinfunction. function. Inbe seen in Figure 3, there is an evid within the network and use those for the annotation task. correlation between network distance and functional distan this review, we describe current Although a broad variety of interestingthe approaches has computational that is, the closer the two proteins are in the network the m The common principle been developed, further progress in the field will depend on approaches for the task, including direct methods, similar arewhich their functional annotations. The methods functional annotation i systematic evaluation of the methods and their disseminascribed below differ in the way they capture and exploit propagate information the network, andthe functionsanother Figure 2 biological Directfunctional versus module-assisted approaches for functionalthrough annotation. The scheme shows a network in which of some proteins are known (top), tion in the community. in isthe netw correlation. In the following, we denote PPIPPI network a where each function is indicated by a different color. Unannotated proteins are in white. In the direct methods (left), these proteins are assigned a color thatthe unusually Molecular Systems Biology 13 March 2007; module-assisted methods, functional modules prevalent among their neighbors. The direction of thewhich edges indicatesinfer the influence of the annotated proteins on the unannotated In the module-assisted methods graph G¼(V,E) (see Boxones. 1function. for graph-theoretic definitions). As can be se (right), modules are first identified based on their density. Then, within each module, unannotated proteins are assigned a function that is unusually prevalent in the doi:10.1038/msb4100129 Direct methods within the network and use those for the annotation task. approaches has module. In both methods, proteins may bemethods; assigned withproteins several functions. Subject Categories: computational Although a broadfunction variety of protein interesting Keywords: data integration; prediction; interaction correlation between net 5 that is, the closer the tw Epidemics and Networks 6 Diseases and the Networks that Transmit Them ✤ The patterns by which epidemics spread through a population is determined not just by the properties of the pathogen carrying it (contagiousness, the length of the infection period, severity, etc.), but also by network structures within the population it is affecting. ✤ The opportunities for a disease to spread are given by a contact network: there is a node for each person, and an edge if two people come into contact with each other in a way that makes it possible for the disease to spread from one to the other. 7 Diseases and the Networks that Transmit Them ✤ Accurately modeling the underlying network is crucial to understanding the spread of an epidemic. 8 pling, a, of e observed ille equation mT. We then n, the change ic field in e also T in static orientation MF to equal xperiments. ...how travel patterns within a city affect the spread of disease .............................................................. ARTICLE IN PRESS Modelling disease outbreaks in realistic urban social networks 1 2 1 1 Stephen Eubank , Hasan Guclu , V. S. Anil Kumar , Madhav V. Marathe , Journal of Theoretical Biology 232 (2005) 71–81 Aravind Srinivasan3, Zoltán Toroczkai4 & Nan Wang5 www.elsevier.com/locate/y 1 Basic and Applied Simulation Science Group, Los Alamos National Laboratory, MS M997, Los Alamos, New Mexico 87545, USA Network theory and SARS: predicting outbreak diversity 2 Department of Physics, Applied Physics and Astronomy, Rensselaer Polytechnic a,b,!Troy, ,1 c,1,2 b,d Institute 110 8th Street, New York 12180-3590, USA Lauren Ancel Meyers , Babak Pourbohloul , M.E.J. Newman , –64 (1972). 3 c,2 Studies, Department of Computer Science andc,2 Institute for Advanced Computer 5). Danuta M. Skowronski , Robert C. Brunham University of Maryland, College Park, Maryland 20742, USA n animals. 4 ion of Integrative Biology and for Institute for Cellular and Molecular Biology, University Texas at Austin, 1 University Station C0930, Austin, Centre Nonlinear Studies and Complex Systems of Group, Los Alamos National 78712, USA b c. Lond. B 263, Laboratory, Santa MS B258, Los Alamos, Mexico 87545, USA Fe Institute, 1399 HydeNew Park Road, Santa Fe, NM 87501, USA 5 c University of British Columbia Centre for DiseaseScience, Control, 655 West 12th of Avenue, Vancouver, British Park, Columbia, Canada V5Z 4R4 Department of Computer University Maryland, College ter for the Study of Complex Systems, University of Michigan, Randall Laboratory, 500 E. University Ave., Ann Arbor, MI 48109-1120, U agnetite-based Maryland 20742, USA 9 ...how travel patterns via the worldwide airline network affect the spread of disease The role of the airline transportation network in the prediction and predictability of global epidemics Vittoria Colizza*, Alain Barrat†, Marc Barthélemy*‡, and Alessandro Vespignani*§ *School of Informatics and Center for Biocomplexity, Indiana University, Bloomington, IN 47401; and †Centre National de la Recherche Scientifique, Unité Mixte de Recherche 8627, Université Paris-Sud, Ba! timent 210, F-91405 Orsay, France Communicated by Giorgio Parisi, University of Rome, Rome, Italy, December 8, 2005 (received for review June 18, 2005) The systematic study of large-scale networks has unveiled the ubiquitous presence of connectivity patterns characterized by large-scale heterogeneities and unbounded statistical fluctuations. These features affect dramatically the behavior of the diffusion processes occurring on networks, determining the ensuing statistical properties of their evolution pattern and dynamics. In this article, we present a stochastic computational framework for the forecast of global epidemics that considers the complete worldwide air travel infrastructure complemented with census population data. We address two basic issues in global epidemic modeling: (i) we study the role of the large scale properties of the airline transportation network in determining the global diffusion pattern of emerging diseases; and (ii) we evaluate the reliability of forecasts and outbreak scenarios with respect to the intrinsic model including the full International Air Transport Association (www.iata.org) database, aiming at a detailed study of the interplay among the network structure and the stochastic features of the infection dynamics in defining the global spreading of epidemics. In particular, whereas previous studies have generally been focused in the a posteriori analysis of real case studies of global epidemics, the large-scale modeling presented here allows us to address more basic theoretical issues such as the statistical properties of the epidemic pattern and the effect on it of the complex architecture of the underlying transportation network. Finally, such a detailed level of description allows for the quantitative assessment of the reliability of the obtained forecast with respect to the stochastic nature of the disease 10 Diseases and the Networks that Transmit Them ✤ Contact networks are also important in understanding how diseases spread through animal populations (e.g., the 2001 foot-and-mouth outbreak in the UK) and plant populations. ✤ Similar models have been employed for studying the spread of computer viruses... 11 Diseases and the Networks that Transmit Them ✤ The pathogen and the network are closely intertwined: even within the same population, the contact networks for two different diseases can have very different structures, depending on the diseases’ respective modes of transmission. ✤ (Think of airborne transmission based on coughs and sneezes, compared to a sexually transmitted disease, and think of the density of the contact networks!) 12 Branching Processes ✤ The simplest model of contagion: every person is in contact with k people ✤ First wave: a person carrying a new diseases enters a population and transmits it to each of his contacts independently with probability p. ✤ Second wave: each person in the first wave transmits to each of his contacts independently with probability p (the contacts of people are mutually exclusive) ✤ and so on.. 13 Branching Processes 648 CHAPTER 21. EPIDEMICS (a) The contact network for a branching process 14 Branching Processes (a) The contact network for a branching process (b) With high contagion probability, the infection spreads widely 15 Branching Processes (b) With high contagion probability, the infection spreads widely (c) With low contagion probability, the infection is likely to die out quickly Figure 21.1: The branching process model is a simple framework for reasoning about the spread of an epidemic as one varies both the amount of contact among individuals and the level of contagion. 16 The Basic Reproductive Number R0 ✤ The basic reproductive number, denote R0, is the expected number of new cases of the disease caused by a single individual. ✤ For the simple branching process we saw, we have R0=kp. 17 n the tree, no one in any future wave will be infected either. So there are really only two possibilities for a disease in the branching process model: ches a wave where it infects no one, thus dying out after a finite number of steps; or tinues to infect people in every wave, proceeding infinitely through the contact netwo 0 d it turns out that there is a simple condition to tell these two possibilities apart, bas a quantity called the basic reproductive number of the disease. The basic reproductive number, denoted R0 , is the expected number of new cases of t ase caused by a single individual. Since in our model everyone meets k new people a cts each with probability p, the basic reproductive number here is given by R0 = p outcome of the disease in a branching process model is determined by whether the ba roductive number is smaller or larger than 1. The Basic Reproductive Number R Claim: If R0 < 1, then with probability 1, the disease dies out after a finite number of waves. If R0 > 1, then with probability greater than 0 the disease persists by infecting at least one person in each wave. give a proof of this claim in Section 21.8. Even without the details of the proof, howev can see that the basic condition expressed in the claim — comparing R0 to 1 — ha ural intuitive basis. When R0 < 1, the disease isn’t able to replenish itself: each infect son produces less than one new case in expectation, and so — even if it grows brie to the outcome of random fluctuations — the size of the outbreak is constantly trendi 18 The Basic Reproductive Number R0 ✤ Implication: It’s always good to reduce the value of R0! ✤ Quarantining people reduces k and encouraging behavioral measures such as sanitary practices reduces p. 19 ✤ Clearly, the branching process is too simplistic! 20 The SIR Epidemic Model ✤ An individual node goes three potential stages during the course of the epidemic: ✤ Susceptible: Before the node has caught the disease, it is susceptible to infection from its neighbors. ✤ Infectious: Once the node has caught the disease, it is infectious and has some probability of infecting each of its susceptible neighbors. ✤ Removed: After a particular node has experienced the full infectious period, this node is removed from consideration. 21 The SIR Epidemic Model ✤ Given this three-stage “life cycle” for the disease at each node, a model for epidemics on networks can be defined. ✤ The network structure: a directed graph representing the contact network (edge u to v means that if u becomes infected, the disease has the potential to spread to v). ✤ Two other quantities: p (the probability of contagion) and tI (the length of infection) 22 between people, where either has the potential to directly infect the other, we can put in directed edges pointing each way: both from v to w and also from w to v. Since contacts between people are often symmetric, it is fine to use networks where most edges appear in each direction, but it is sometimes convenient to be able to express asymmetric contacts as well. Now, each node has the potential to go through the Susceptible-Infectious-Removed cycle, where we abbreviate these three states as S, I, and R. The progress of the epidemic is controlled by the contact network structure and by two additional quantities: p (the probability of contagion) and tI (the length of the infection). The SIR Epidemic Model • Initially, some nodes are in the I state and all others are in the S state. • Each node v that enters the I state remains infectious for a fixed number of steps tI . • During each of these tI steps, v has a probability p of passing the disease to each of its susceptible neighbors. • After tI steps, node v is no longer infectious or susceptible to further bouts of the disease; we describe it as removed (R), since it is now an inert node in the contact network that can no longer either catch or transmit the disease. This describes the full model; we refer to it as the SIR model, after the three disease states that nodes experience. Figure 21.2 shows an example of the SIR model unfolding on a particular contact network through successive steps; in each step, shaded nodes with dark borders are in the I state and shaded nodes with thin borders are in the R state. 23 The SIR Epidemic Model 652 CHAPTER 21. EPIDEMICS t t s u s u y y z x r z x v r v w w (a) (b) t t s u s u y y r z x z x v r v w w (c) (d) 24 The SIS Epidemic Model ✤ The SIR epidemic model is appropriate for epidemics in which each individual contracts the disease at most once. ✤ To allow for nodes that can be reinfected multiple times, a model can have only the S and I, but not R, states. 25 ividual contracts the disease at most once. However, a simple variation on these models llows us to reason about epidemics where nodes can be reinfected multiple times. To represent such epidemics, we have nodes that simply alternate between two possible tates: Susceptible (S) and Infectious (I). There is no Removed state here; rather, after a ode is done with the Infectious state, it cycles back to the Susceptible state and is ready to atch the disease again. Because of this alternation between the S and I states, we refer to he model as the SIS model. Aside from the lack of an R state, the mechanics of the model follow the SIR process ery closely. The SIS Epidemic Model • Initially, some nodes are in the I state and all others are in the S state. • Each node v that enters the I state remains infectious for a fixed number of steps tI . • During each of these tI steps, v has a probability p of passing the disease to each of its susceptible neighbors. • After tI steps, node v is no longer infectious, and it returns to the S state. igure 21.5 shows an example of the SIS model unfolding on a three-node contact network with tI = 1. Notice how node v starts out infected, recovers, and later becomes infected gain — we can imagine this as the contact network within a three-person apartment, or a hree-person family, where people pass a disease on to others they’re living with, and then 26 The SIS Epidemic Model 56 CHAPTER 21. EPIDEMIC u v u v v w (a) u v w (b) u v w (c) u w (d) w (e) igure 21.5: In an SIS epidemic, nodes can be infected, recover, and then be infected aga n each step, the nodes in the Infectious state are shaded. 27 A Connection Between SIR and SIS 658 CHAPTER 21. EPIDEMICS u v u v u v u v u v w w w w w step 0 step 1 step 2 step 3 step 4 (a) To represent the SIS epidemic using the SIR model, we use a “‘time-expanded” contact network u v u v u v u v u v w w w w w step 0 step 1 step 2 step 3 step 4 (b) The SIS epidemic can then be represented as an SIR epidemic on this time-expanded network. 28 The SIRS Epidemic Model ✤ Combines SIR and SIS: ✤ After an infected node recovers, it passes briefly through the R state on its way back to the S state. 29 ti !t 1 1" ! 0 if ti !t" ! t0 . NUMBER 130, until PHYSICAL REVIEW LETTERS 26 MARCH 2001 VOLUME usceptible element stays86, as such, at t ! initial state is random with ninf !0" ! 0.1. The 400 time infected. Once infected, it goes (deterministia cycle that lasts t0 time steps. During the first steps shown are representative of the stationary state. We canWorld see clearly a transition from an endemic situation toModel an ps, it is infected and can potentially transmit the Small Effect in an Epidemiological oscillatory one. At p ! 0.01 (top), where the network is susceptible neighbor. During the last tR time nearly a regular lattice, e cycle, it remains in state R, immune but not 1, the stationary state is a fixed point, 1,2, † Marcelo Kuperman * and Guillermo Abramson with fluctuations. The situation corresponds to that of an After the cycle is complete, it returns to the 1 Centro Atómico Bariloche and Instituto Balseiro, 8400 S.fraction C. de Bariloche, Argentina endemic infection, with a low and persistent of instate. 2 Consejofected Nacional de Investigaciones Científicas y Técnicas, Argentina individuals. At high values of p — like the case with agion of a susceptible element by an infected (Received ) 5 October 2000 p ! 0.9 shown in the figure (bottom) — large amplitude, e subsequent excitation of the disease cycle in oscillations develop. The situation is almost ected, occur stochastically atAa model local level. Say for the spreadself-sustained of an infection is analyzed for different population structures. The interactions with a very well defined period and small flucment i is susceptible, andwithin that it the has population ki neigh- areperiodic, described by small world networks, ranging from ordered lattices to random tuations in amplitude. period is endemic slightly longer than ich kinf are infected. Then, i will become ingraphs. For the more ordered systems, there isThe a fluctuating state of low infection. At a finite t probability kinf $ki . Observe that i will become , since it includes the average time that a susceptible in0 value of the disorder of the network, we find a transition to self-sustained oscillations in the size of the th probability 1 if all its neighbors are infected. dividual remains at state S, before being infected. Epideinfected subpopulation. s parameter-free mechanism, there may be other miologically, the situation resembles the periodic epidemic choices. For example, ifDOI: the susceptible had patterns typical of large populations [3]. A mean field 10.1103/PhysRevLett.86.2909 PACS numbers: 89.75.Hc, 05.65.+b, 87.19.Xx, 87.23.Ge y q of contagion with each infected neighbor, model of the system, expected to resemble the behavior at ave a probability of infection %1 2 !1 2 q"kinf &. p ! 1, can easily be shown to exhibit these oscillations. transition apparent — in ted that both these criteria give qualitatively the the The show thatisa sharp transition in the behavior of an infection I. Introduction.— How does dynamics of between an infec-both behaviors this relatively small system— at the intermediate value of s for valuestious of q & 0.2. For other values of q, dynamics exists at a finite value of p. disease depend on the structure of a population? A rs are outlined at the end of Section III. II. Epidemic model.—We analyze a simple model of the great amount of work has been done on the phenomenoerical results.—We have performed extensive spread of an infectious disease. We want, mainly, to point description of particular p = 0.01 imulations logical of the described model. Networks epidemic0.4situations [1–4]. to the role played by the network structure on the temporal classical to these problems deals 103 to 106 A vertices havemathematical been explored,approach with 0.2 dynamics of the epidemic. The disease has three stages: well mixed populations, 0. A typicalwith realization starts with the genera- where the subpopulations susceptible (S), infected (I), and refractory (R). An eleinvolved susceptible, and removed) inrandom network and(typically the initialization of the infected, 0.0 0.4 p = 0.2 elements. teract An initial fraction of 0.1 infected, ment of the population is described by a single dynamical in proportion to their sizes. With these zero dimensusceptible, was used in allitthe 0.2 variable adopting one of these three values. Susceptible sional models hasresults beenshown possible to study, among other initial conditions havefeatures, been explored as well, of threshold values for elements can pass to the infected state through contagion epidemic the existence 0.0 nges have been observed in the behavior. After 0.4 = 0.9 Infected elements pass to the refracby an infected pone. the spread of an infection [5], the asymptotic solution for a stationary state is achieved, and the computatory state after an infection time tI . Refractory elements density of time infected the effect of 0.2 llowed for the several thousand steps people to per- [6–8], and stochastic fluctuations on the modulation of an epidemic return to the susceptible state after a recovery time tR . This ical averages. 0.0 situation [9].time A series second classical approach describes spa- 5400 kind of system 5200 5300 5500 is usually 5600 called SIRS, for the cycle that a in Fig. 1 part of three displaying t of infected tially elements in the system, ninf !t". The such as elements in a extended subpopulations, single element goes over. The contagion is possible only s correspond to systems with different values spread of an epidemic lattice. In this, the geographic duringasthe S phase, and only by an I element. During the FIG. 1. Fraction of infected elements a function of time. rder parameter: p !analyzed 0.01 (top), (middle), can be as 0.2 a reaction-diffusion process R phase, to thedifferent elements are immune and do not infect. SIRS Three time series[10–13], are shown, corresponding values 4 ottom). Thebearing three systems have N ! 10 and of the disorder parameter p, in theare legends. Other systems, paclose similarity to paradigmatic reactions such as as shown models excitable known to display relaxation 4 infection cycles with tI ! 4 and tR ! 9. The rameters are N ! 10 , K ! 3, tI ! 4, tR ! 9, Ninf !0" ! 0.1. Recall: the small-world model (p: probability of rewiring an edge) ninf (t) The number of infected people (ninf(t)) by SIRS epidemic: Belousov-Zhabotinskii’s. oscillations in mean field or well-mixed approaches. In 30 Acknowledgments ✤ Slides on epidemics and networks are based on the book “Networks, Crowds, and Markets” by Easley and Kleinberg. 31

Bioinformatics: Network Analysis Networks as a Guiding Tool Luay Nakhleh, Rice University

Related documents

Products

Support

Bioinformatics: Network Analysis Networks as a Guiding Tool Luay Nakhleh, Rice University

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib