miRNAs and its Target gene Predictions -Shruthi & Surajit Outline Introduction to Neurons miRNA miRNA- mRNA interaction Results Computational algorithms for predicting the miRNA targets Novel methods Conclusions & Future directions References NEURON • Basis of the nervous system • Send signals throughout the body • 3 main parts 1. Dendrites 2. Cell body 3. Axon Dendrite Morphology & Neurological Disease The key functional role dendrites play in the establishment and maintenance of proper neuronal circuitry is illustrated in a number of neuropathological disease states including: Down Syndrome, Rett Syndrome, Fragile-X Syndrome, Autism, Fronto-Temporal Dementia, Alzheimer’s, Parkinson’s, Huntington’s, Schizophrenia & Muscular Dystrophies. In each of these disease states there exists strong neuro-anatomical correlation between specific dendritic abnormalities and cognitive impairments. Dendritic arbor complexity reduced in aging brains, including branches, synapses, and spines Normal Fragile X Drosophila dendritic arborization (da) neurons provide an excellent model for investigating class specific dendrite morphogenesis Powerful genetic tool and relevance to higher organism da neurons display stereotypical dendrite branching patterns with well characterized neuronal morphology and invariant spatial distributions. (4) distinct morphological subclasses; ability to analyze how neuronal diversity arises; how target fields are innervated; dendritic tiling and self-avoidance MIRNAS – ESSENTIAL POST-TRANSCRIPTIONAL REGULATORS OF GENE EXPRESSION miRNAs are ~22 nucleotides in length The human genome encodes about 1000s miRNAs which may target 60% of mammalian genes and are abundant in human cell types Regulators of maintenance, development, cell proliferation, differentiation • While miRNAs have emerged as critical post-transcriptional modulators of gene expression in neuronal development, very little is known regarding the roles of miRNA-mediated regulation of dendritic morphogenesis Why is it Important? miRNAs regulate the target mRNA and manage the production of the final protein output. It causes various functions like cell differentiation, proliferation, growth, mobility or apoptosis. Deregulation of the miRNA, plays critical role in the pathogenesis of genetics and multifactorial diseases, and is responsible for most human cancers. HOW DO MIRNAS REGULATE CLASSSPECIFIC DENDRITIC PATTERNING? 1. Which miRNAs may contribute to the specification of unique dendritic morphologies of da neuron subclasses? 1. What are the downstream dendritic arborization? targets of these miRNAs that control miR expression profiling To facilitate functional analyses of miRNA regulation in Drosophila dendrite morphogenesis, we conducted wholegenome miRNA expression profiling in class I, III, and IV da neurons via magnetic bead-based cell sorting. These analyses revealed 75 significantly expressed miRs in da neuron subclasses and differential expression for many of these miRs. Presented at right, are the top 30 differentially expressed miRs Gain of Function Results miR-mediated promotion of dendritic branching complexity in Class I da neurons M-7002 M-7159 M-7160 M-7101 M-7201 miR-mediated decrease of dendritic branching complexity in Class I da neurons M-7247 WT M-7127 M-7251 M-7106 M-7053 Target Prediction Methods The identification of this targets for the miRNA can be identified by biochemical process or computational process. Although the computational method is in wide use for the identification of the targets, the biochemical process of identification is the most important as it would give us more real life result. Here we will discuss few of the Target Prediction Algorithms and also discuss a method which we are trying to implement. Target ScaN Based on Sequence Search Similarity Developed by Lewis et.al. 2003. TargetScan predicts biological targets of miRNAs by searching for the presence of conserved 8mer ( a region of exact match to position 2-8 of mature miRNA, followed by Adeonisine) sites in the UTR, (in mRNA;also called seed matches) that match the seed region ( positions 2-7 of a matured miRNA) of each miRNA. Then it extends each seed match with additional base pairs to the miRNA as far as possible in each direction, allowing G:U pairs, but stopping at mismatches Using the RNAfold program (Hofacker et al., 1994), optimizes base pairing of the remaining 3’ portion of the miRNA to the 35 bases of the UTR (in the mRNA) immediately 5’ of each seed match thus extending each seed match to a longer “target site” using RNAeval (Hofacker et al., 1994) assigns a folding free energy G to each such miRNA : target site interaction (ignoring initiation free energy) assigns a Z score to each UTR where Z=∑e-Gk /T ,from k=1 to n Then the algorithm sorts the UTRs by the z-score and give a rank (Ri), and repeats the process for all UTRs. The target prediction is done for targets having Z score greater than or equal to a threshold(4.5)and Ri less than threshold (200-350) Miranda a) b) c) Based on Sequence Search Similarity Developed by Enright et.al. (2003) In this method along 3 criterions are taken into consideration to predict the Targets: Sequence matching to assess whether 2 sequences are complimentary and bind. Free energy calculation to estimate the energetic of physical interaction. Evolutionary conservation as an informational filter. This can be verified by 3 methods: a specific miRNA independently matches orthologous UTRs in both species sequences of detected target sites in both species exhibit more than a specified threshold of nucleotide identity (ID) with each other (threshold >=80% in D.pseudoobscura and >=60% in A.Gambiae) the positions of both target sites are equivalent according to a crossspecies UTR alignment Miranda Scores Miranda scores are the addition of complementarity scores and Free Energy All miRNA sequences are scanned against the 3' UTR datasets of D.Melanogaster,D.Pseudoobscura and A.gambiae. The thresholds used for hit detection are: initial SmithWaterman hybridization alignments must have S ≥ 80, and the minimum energy of the duplex structure ∆G ≤ -14 kcal/mol. Each hit between a miRNA and a UTR sequence is then scored according to the total energy and total score of all hits between those two sequences (complementary scores). Then a scan is done to detect whether the sequence of the targets are conserved or not. Then the conserved target sites are sorted and ranked according to their scores. For each miRNA 10 highest ranking genes are considered as targets. PITA Algorithm based on RNA-RNA duplex considering free energy minimization. Devoloped by Kertescz et.al. 2007 In this paper the authors, experimentally proved that mutations reducing target accessibility, reduce miRNA mediated translational repression. They also deduced site accessibility was an important method in miRNA-mRNA interaction. They deduced a Computational method to predict the same, PITA( Possibility of interaction by Target Acessibility). In this method the difference between the free energy gained to form the miRNA-Target duplex and energy lost, to unpair target to make it accessible to miRNA is considered as a parameter for scoring. CONS a. b. Most of the software are not accurate due to significant fraction of false positivity, which are caused by Limited comprehension of molecular basis of miRNAtarget pairing Changes during of post transcriptional regulation Solution With advanced experimental evidence of miRNA mechanism of target degradation, the target prediction with miRNA and gene expression profiles (obtained from microarrays) have been proposed to predict functional mRNA-miRNA relationship. Also it is experimentally observed that miRNA tend to down regulate the activity or expression of the target mRNA, it can be very well proposed that mRNA and miRNA are anticorrelated in nature. Some of the tools Web tools like mirGator and Diana-micro T web server, helps in clarification of biological pathways, processes and functions, through integration of target predictions with information from different genes, functional and protein database. Another web tool MMIA , integrates miRNA and mRNA data using significantly up-regulated and down-regulated data only, not taking into consideration whole expression profile, making it inefficient in the calculation of the whole genome expression anti correlation degree. MAGIA A novel tool which integrates target prediction and gene expression profile using Statistical tests like correlations and Bayesian methods, for matched or un-matched expression profiles, using miRNA-mRNA bipartite network reconstruction, gene functional enrichment and pathway annotation for results browsing. Snapshot 1 Snapshot2 SnapShot3 SnAPsHOT 5 snapshot 6 MIAMI Our tool MIAMI, is based on miRNA-mRNA target prediction tool, MAGIA. The web tool can be broadly classified into two parts: Query Section and Analysis Section. The Query Section allows users to retrieve and browse different target prediction databases including PITA, Miranda and TargetScan. The second part of the tool is dedicated on the analysis part of the tool which is the statistical analysis of the expression data which is given as input. It uses different statistical methods, like correlation and Distance correlation methods to generate miRNA-mRNA networks. hOMe pAGE Statistical Method We use Intragenic genes as the first step in understanding the Target-miRNA relation. We choose all the microarray results for Drosophilla Melanogaster from Gene Expression Omnibus(GEO) and then divide it in lieu of the platforms used(Agilent, Affymetrix, etc.) Then we use 3 statistical tests to determine the targets: Pearson, Distance correlation and Brownian Covariance. Results Conclusion The statistical methods used in this cases helps to identify the linear relationship (Pearson) as well as the nonlinear relationship (Distance Correlation and Brownian Covariance) between the Host-Target relationship. Our tool uses expression value to understand the Target-Host relationship and then uses the various Target prediction software in understanding whether the Targets hold good in other algorithms too, making it a more robust approach. References 1.Principles of Biochemistry,Lehninger 2.Exploring complex miRNA-mRNA interactions with Bayesian networks by splittingaveraging strategy-Bing Liu,Jiuyong Li, Anna Tsykin,Lin Liu,Arti B Gaur and Gregory J Goodall 3.MAGIA, a web-based tool for miRNA and Genes Integrated Analysis-Gabriele Sales, Alessandro Coppe, Andrea Bisognin, Marta Biasiolo,Stefania Bortoluzzi and Chiara Romualdi 4.http://onlinestatbook.com/chapter4/pearson.html-for Pearson Defination 5.Brownian Distance Covariance- Gabor J. Szekely and Maria L. Rizzo 6. Nam,S., Kim,B., Shin,S. and Lee,S. (2008) miRGator: an integrated system for functional annotation of microRNAs. Nucleic Acids Res., 36, D159–D164. 7. Maragkakis,M., Reczko,M., Simossis,V.A., Alexiou,P.,Papadopoulos,G.L., Dalamagas,T., Giannopoulos,G., Goumas,G.,Koukis,E., Kourtis,K. et al. (2009) DIANAmicroT web server:elucidating microRNA functions through target prediction. 8.Nam,S., Li,M., Choi,K., Balch,C., Kim,S. and Nephew,K.P.(2009) MicroRNA and mRNA integrated analysis (MMIA): a web tool for examining biological functions of microRNAexpression. Nucleic Acids Res., 37, W356–W362 Nucleic Acids Res., 37, W273–W276. 9. R Development Core Team (2011). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-070, URL http://www.R-project.org/. 10.Maria L. Rizzo and Gabor J. Szekely (2011). energy: E-statistics (energy statistics). R package version 1.4-0. http://CRAN.R-project.org/package=energy 11.Vincenzo Alessandro Gennarino,Marco Sardiello,Raffaella Avellino,Nicola Meola,Vincenza Maselli,Santosh Anand,Luisa Cutillo,Andrea Ballabio,and Sandro Banfi:MicroRNA target prediction by expressionanalysis of host genes 12.J. Reimand, M. Kull, H. Peterson, J. Hansen, J. Vilo: g:Profiler -- a web-based toolset for functional profiling of gene lists from large-scale experiments (2007) 13.J. Reimand, T. Arak, J. Vilo: g:Profiler -- a web server for functional interpretation of gene lists (2011 update) Nucleic Acids Research 2011 14.Christopher Brown (2011). hash: Full feature implementation of hash/associated arrays/dictionaries. R package version 2.1.0. http://CRAN.Rproject.org/package=hash 15.https://statistics.laerd.com/statistical-guides/pearson-correlation-coefficientstatistical-guide.php 16.http://critical-numbers.group.shef.ac.uk/glossary/correlation.html 17. J. L. Rodgers and W. A. Nicewander. Thirteen ways to look at the correlation coefficient. The American Statistician, 42(1):59–66, February 1988. 18.http://docs.ggplot2.org/current/ 19.http://ggplot2.org/resources/2007-vanderbilt.pdf 20. Székely, G. J. Rizzo, M. L. and Bakirov, N. K. (2007). "Measuring and testing independence by correlation of distances" 21.Prediction of Mammalian MicroRNA Targets:Benjamin P. Lewis,I-hung Shih,Matthew W. JonesRhoades,David P. Bartel and Christopher B. Burge.2003.Cell 22.The role of site accessibility in microRNA target recognition-Michael Kertesz, Nicola Iovino, Ulrich Unnerstall, Ulrike Gaul and Eran Segal.2007. Nature genetics 23.MicroRNA targets in Drosophila-Anton J Enright, Bino John, Ulrike Gaul, Thomas Tuschl, Chris Sander and Debora S Marks