Bioinformatics Growth in the Asian Region 20 years of Development Tan Tin Wee Founding Secretariat Asia Pacific Bioinformatics Network 29th August 2009 HKU-Pasteur Research Centre - EMBO Bioinformatics and Comparative Genomics Course 1 20 years: European Success Story EMBO, EMBL, EMBnet, EBI European Union European Commission European Molecular Biology Organisation European Molecular Biology Laboratories European Molecular Biology Network European Bioinformatics Institute 2 Europe-Asia Comparison Funds+Time+Cooperation+Culture EU/EC funding More than 20 years EMBO – solid history EMBnet – 20yr network of national nodes in over 20 countries. Well interconnected by GEANT2 Long time of high volume data generation and bioinformation creation Know how to share and cooperate No such Asia governmental body; may be APEC or ASEAN GIW nearly 20yrs; A-IMBN 12 yrs; APBioNet 11yrs A-IMBN – recent start 1997 APBioNet – bioinformatics node formation still in progress. Not well connected network. Only national links; last mile problems. Recent growth in data volume and bioinformation creation is still new. Asia still trying to learn how to catch up 3 Organisational Comparison EMBL DataLibrary EBI ECCB conference EU/EC EMBO EMBL EMBnet EMBnet National Nodes in Europe and Other places InCoB conf - A-IMBN eIMBL APBioNet AASBi APBC and GIW conf Asia Bioinformation Institute ? National Nodes? Not ready ABC? 4 Parallel Growth of Biology and Internet/IT in 20years Mosaic Web browser World Wide Web started Web 2.0 Internet widespread Grid Computing Cloud Computing Internet2 ??? Synergy 2000 1990 Genome Project Bacterial Genomes Start emerging Human Genome completed IT 2020 2010 Human Genome completed New Gen Seq BIO 5 Bioinformatics in Singapore 90’s 1991 – Biocomputing Interest Group 1992 – Bioinformatics User Support – first Gopher, WAIS server in Asia; hypertext web interface to bioinformatics software 1993 – First biological World Wide Web site biomed.nus.sg 1995 – Planning the formation of the Bioinformatics Centre (BIC) National Univ of Singapore 1996 – BioKleisli, BioPortal, integration projects 1997 – Bioinformatics Resource 1998 – BioMirror1 Project; APBioNet Formed 1999 – Remote solving of Xray structure of Bucandin 6 Bioinformatics outreach for Asia in the ‘90s Limited awareness Limited recognition Limited resources Limited manpower Mainly in Japan – GIW genome informatics workshop; DDBJ Sporadic efforts to introduce bioinformatics Late 90s – emergence of bioinformatics groups in Korea, Malaysia, China, Taiwan Efforts by WHO (recently TDRs); ICGEBnet formation (Trieste, Delhi); Pasteur’s efforts 7 Formation of APBioNet Help ourselves 1997 APEC TEL WG survey – great disparity of bioinformatics competence; poor inter-Asian connectivity. 1997 EMBnet nodes in Asia – 2. 1998 APBioNet formation in PSB’98 Hawaii 1998 till today – network, education, training, conferences, software, services … awareness low, organisation weak… 8 APBioNet objectives: Fostering the Growth of Bioinformatics and Allied Disciplines in the Asia Pacific Members from Australia, Bangladesh, Brunei Cambodia, China, India, Indonesia, Japan, Korea, Laos, Malaysia, New Zealand, Pakistan, Philippines, Russia, Saudi Arabia, Singapore, Thailand, Vietnam and in North America and Europe. 9 Members attend InCoB http://incob.apbionet.org International Conference on Bioinformatics Annually since 2002: Bangkok, Thailand (200) 2003: Penang, Malaysia (300) 2004: Auckland, New Zealand (200) 2005: Busan, South Korea (300) 2006: New Delhi, India (1,000) 2007: HongKong, China and Hanoi, Vietnam (300) 2008: Taipei, Taiwan 2009: Singapore 10 APBioNet-ASEAN-China 1st ASEAN China – 2004 Jingchu Luo Beijing 2nd ASEAN-China – 2008 Singapore 3rd ASEAN-China – 2009 Shanghai (TCM Bioinformatics and Systems Biology) 11 APBioNet Partners – East Asia Bioinformation Network EABN Sponsored by Korean Bioinformation Network 1st EABN – Busan Korea 2005 2nd EABN – Thailand (NSTDA) 2007 3rd EABN – Singapore (NUS) 2008 12 APBioNet Partners ASEAN-India – 1st ASEAN-India Bioinformatics Workshop 2005 – 2nd ASEAN-India Workshop 2006 Supported by FAOBMB, IUBMB, UNESCO, local inst. – – – – – – APBioNet Workshop in Bogor, Indonesia 2000 APBioNet Workshop in Malaysia 2000 APBioNet Workshop in Brunei, 2004 APBioNet Workhsop in Riyadh, Saudi Arabia 2005 APBioNet Workshop in Lahore, Pakistan 2006 APBioNet Workshop in Hanoi, Vietnam 2007 13 Top InCoB Papers have been published since 2006 http://www.biomedcentral.com/bmcbioinformatics/ www.bioinformation.net 14 Steady Growth of Research output in Asia From Elsevier 15 Citations 16 Transformation in 2000s Build Network Build Database Resources Build Computational Services Develop training software Hold training courses Coordinate and organise conferences, workshops, meetings Inject bioinformatics into curriculum Policy meetings at governmental and intergovernmental level 17 APAN and national RENs 1997 Singapore Research & Education Network (SINGAREN) Today every country has NREN of some kind or plans for one 18 TEIN 2 and TEIN 3 10M euros TransEurasian Information Network 19 BioMirror since 1998 http://www.bio-mirror.net/ 20 More Asian Databases emerging Original successful ones: DDBJ, KEGG, wwPDB (PDBj) Many more emerging but they are not sustainable e.g. ANTIMIC, SPDmod, EXINT, Xdom, etc. Growth in CJK - China Japan, Korea, Taiwan. 21 BioGrid 22 BioGrid Grid computing installable in three steps Funded by International Development Research Centre IDRC PAN-Asia Grant 2002 Azereus P2P database distribution system – Grant funded by IDRC in 2007-2009 - Thailand, KOBIC, NUS collaboration. 23 APBioGrid Computing 24 BioWorldWideWorkFlow 25 BioWorldWideWorkFlow 26 BioWorldWideWorkflowGrid mtg 27 Introducing WizFolio.com for bibliographic sharing and management To manage and share all the scientific papers you have read, and more, 28 Just sign up and login! Import From Clipboard Manual Entry Get Match Direct PDF Upload Online Search Highly optimized to PubMed Colleagues Import from RIS WizAdd Direct Export from Webpages WizFolio Web 2.0 Organize Folder, Flags, Tags, Filters Locate Full-Text Articles (PDF) WizCite Seamless citation generation with live preview Sharing of References Word 2003 Word 2007 29 Bioinformatics on a Disk APBioKnoppix 1 and 2 Tim Littlejohn’s BioLateral disks LiveCD – Knoppix-based Extension of BioKnoppix (Puerto Rico) 30 BioSLAX virtual desktops BioSLax on CD BioSLax on DVD BioSlax on VMware BioSlax on USB stick (BioSlax on Youtube movie) BioSlax with MokaFive and VMPlayer 31 Bioinformatics and Databases in a Box: BioSLAX Server - 500 to 1Terabyte HDD Boxes - Plus BioSlax Server LiveCD - Tested in Institute of Biotechnology (IBT) Hanoi, 2007/2008 32 Institutional Support ASEAN ASEAN Dialogue Partners – China, India KOBIC ISCB FAOBMB IUBMB UNESCO Many universities Many national bioinformatics societies 33 S* Life Science Informatics Alliance A thousand trained online Online courses from 2001 to 2005 S-star council from Stanford, Sweden (Uppsala/Karolinska), Singapore, Sydney, South Africa, UCSanDiego Mirror sites in 10 places Secretariat in National Univ of Singapore. 34 LAMS Learning Activity Management System 35 CENTRA for online PBL 36 CanalAVIST and AVIST Asean Virtual Institute of Science and Technology CanalAVIST will utilize the Trans Eurasia Information Network (TEIN2) to deliver lectures and streaming VDO to members of TEIN2, ASEAN, Europe and Australia with H.263 to DVTS supported by an E-learning platform VCLASS (see http:// www.vclass.net) by the Internet Education and Research Lab, Asian Institute of Technology 37 WEBCB at InCoB2008 and 2009 Workshop on Education in Bioinformatics and Computational Biology 23rd Oct 2008 Taipei, Taiwan http://trg.apbionet.org/webcb/ Curriculum Development Accreditation Testing – India Bioinformatics certification examination system 38 APBioNet Historical Milestones 1997: Bioinformatics survey of the region 1998: First Meeting at PSB Hawaii 1998: Endorsement by APEC TEL 1998: APBioNet as first project of APAN 1999: APAN-APBioNet BioMirrors Project 2000: Involvement in S* Alliance; outreach start to Malaysia, Thailand, Brunei 2002: InCoB conference series 2003: IDRC Grant for BioGrid software 2004: APBioKnoppix; later BioSlax (2006) 2004: ASEAN-China Bioinformatics Workshop 2005: Affiliation with ISCB; ASEAN-India Cooperation; KOBIC cooperation 2006: InCoB Conference publishes in BMC Bioinformatics; outreach to Cambodia; Saudi Arabia; Pakistan; Vietnam 2007: IDRC grant for P2P dissemination of biomirror datasets; CanalAVIST project 2008: 2nd ASEAN-China; 3rd East Asia Bioinformation Network; 7th 39 InCoB; BioSlax Server version; Dialogue with EMBnet. What next? Conferences, Workshops, Meetings Training and Education Network Infrastructure Software Mirror Services … Future? 40 Challenges ahead Lack of curriculum transformation some Tier 1 universities and none in Tier 2 and lower. Lack of bioinformatics-enabled professors Lack of institutional resources (network, expertise) Lack of professional career framework Lack of recognition and organisational support Lack of recurrent or established financial support Lack of bandwidth of older people, and lack of young people and volunteers Stimulate high impact research Stimulate high impact resources and databases Stimulate high impact software and services Stimulate high quality education and graduates 41 What can we do strategically? Be more actively involved in ISCB Seek membership for our members in EMBnet or its enlarged entity E”xtended”MBnet Facilitation of Research Grant funding Support the ABC initiative Support our colleagues in AASBi, APBC, GIW etc. 42 Today’s Challenges for Asia and Personal Genomics BioInformation have and have-nots – big disparity in capacities in different institutions and cities of Asia Lack of interoperability and standardisation Lack of Asia repositories – over reliance on Western sources Low to high level of knowledge in Asian life scientists – big disparity 43 Personal Genomics In 10 to 20 years, it will be common in the West. Asia will lag by at least 10 years What will improve it? – Education – Positive Publicity – Information infrastructure – Standards and best practice 44 Role of Asian Bioinformation Centers Repository of public knowledge – Stimulate private or proprietary knowledge Promotion of standardisation and interoperability – Leverage on virtualisation, grid and cloud Champion of bioinformation in our institution, in our city, and in our country – Education and awareness – Scientific activism with our governments 45 New initiatives Minimum Bioinformatics Skillsets for Life science undergraduates/graduates Minimum Information about a Bioinformatics Investigation MIABi AuthorID disambiguation Deposition of datasets What new areas of Research can we compete in? 46 Origin of Species 200th Anniversary of Darwin’s birth 150th Anniversary of the “Origin of Species” Alfred Russel Wallace Malay Archipelago 47 Origin of Life 48 From Religion: Logos Old Testament of the Holy Bible: Genesis 1:1 In the beginning God created the heavens and the earth. New Testament of the Holy Bible: John 1: 1 In the beginning was the Word, and the Word was with God, and the Word was God. Logos (Greek, "word,""reason,""ratio"), in ancient and especially in medieval philosophy and theology, the divine reason that acts as the ordering principle of the universe. 49 Tao (道, pinyin: dào (help·info) ) is a concept found in Taoism, Confucianism, and more generally in ancient Chinese philosophy. While the character itself translates as 'way', 'path', or 'route', or sometimes more loosely as 'doctrine' or 'principle', it is used philosophically to signify the fundamental or true nature of the world. The concept of Tao differs from Western ontology, however; it is an active and holistic conception of the world, rather than a static, atomistic one. 50 From Physics: Mass-Energy Equivalence Mass–energy equivalence was proposed in Albert Einstein's 1905 paper, "Does the inertia of a body depend upon its energy-content?", one of his Annus Mirabilis ("Miraculous Year") Papers.[2] Einstein was not the first to propose a mass–energy relationship, and various similar formulas appeared before Einstein's theory with incorrect numerical coefficients and an incomplete interpretation. Einstein was the first to propose the simple formula and the first to interpret it correctly: as a general principle which follows from the relativistic symmetries of space and time. S. Rainville, J.K. Thompson, E.G. Myers, J.M. Brown, M.S. Dewey, E.G. Kessler Jr., R.D. Deslattes, H.G. Börner, M. Jentschel, P. Mutti, D.E. Pritchard. 2005. A direct test of E = mc2. Nature. Dec. 22, 2005 51 52 From Chemistry Dmitri Ivanovich Mendeleev On March 6, 1869, Mendeleev made a formal presentation to the Russian Chemical Society, entitled The Dependence between the Properties of the Atomic Weights of the Elements, which described elements according to both atomic weight and valence. Principles of Chemistry (1868-70) 53 Periodic Table of Elements 54 The Standard Model in particle physics At present, matter and energy are best understood in terms of the kinematics and interactions of elementary particles. To date, physics has reduced the laws governing the behavior and interaction of all known forms of matter and energy, to a small set of fundamental laws and theories. A major goal of physics is to find the "common ground" that would unite all of these theories into one integrated theory of everything, of which all the other known laws would be special cases, and from which the behavior of all matter and energy could be derived (at least in principle) (Wikipedia) 55 Two Dimensional Pattern of Basic Sound as the basis of Spoken Language Sanskrit grammarians of ancient India, who had created astonishingly sophisticated theories of language based on their discovery of the twodimensional patterns in basic sounds The tabular form of the Sanskrit letters is due to the two parameters (point of articulation and aspiration) as the basis of the spoken sounds 56 From Linnaeaus Taxonomy Binomial Nomenclature 57 Claude Bernard 1813-1878 Homeostasis Milieu interieur is the key process with which Bernard is associated. He wrote, "La fixité du milieu intérieur est la condition d'une vie libre et indépendante" ("The constancy of the internal environment is the condition for a free and independent life"). Compare with Traditional Chinese and Traditional Indian Ayuvedic Medicine - Balance 58 What new metaphors for “the New Biology”? In 1623 Galileo crafted a famous metaphor that is still often cited by scientists. Nature, he wrote, is a book written in "the language of mathematics". If we cannot understand that language, we will be doomed to wander about as if "in a dark labyrinth". (Crease, 2006) The most important lesson to be found in Galileo's image is the need to keep developing and revising the metaphors with which we speak about science 59 On “Bioinformatics is Dead”, “I think we are just at the beginning of something more dramatic” John Quackenbush 2009 Information Coded Information – Need to build a concept of Information Need to measure it Pierre-Henri Gouyon (2009) 60 Semantic Web or Semiotics? Semiotics, also called semiotic studies or semiology, is the study of sign processes (semiosis), or signification and communication, signs and symbols, both individually and grouped into sign systems. It includes the study of how meaning is constructed and understood three branches: Semantics: Relation between signs and the things they refer to, their denotata. Syntactics: Relation of signs to each other in formal structures. Pragmatics: Relation of signs to their impacts on those who use them. Semiotics is frequently seen as having important anthropological dimensions, for example Umberto Eco proposes that every cultural phenomenon can be studied as communication. 61 The Origin of Information From the origin of species to the origin of encoded information in living things From Observational Science to Experimental science to Informational Biological science Future of Bioinformatics – the information in “bio” logical organisms Looking at organisms as information perpetuating agents Is this the theoretical biology of the future? What is the language to be used? Mathematics? Ontologies? 62