Visualizing Science and Technology Data Katy Börner Victor H. Yngve Professor of Information Science Director, Cyberinfrastructure for Network Science Center School of Informatics and Computing and Indiana University Network Science Institute Indiana University, USA CyberBridges Workshop CEB Waterview Conference Center, Arlington, VA August 31, 2015 Language Communities of Twitter - Eric Fischer - 2012 Findyourway Descriptive& Predictive Models Findcollaborators,friends Terrabytesofdata Identifytrends 2 TypeofAnalysisvs.LevelofAnalysis Micro/Individual (1Ͳ100records) Meso/Local (101–100,000records) Macro/Global (100,000<records) Statistical Analysis/Profiling Individualpersonand theirexpertiseprofiles Largerlabs,centers, universities,research domains,orstates AllofNSF,allofUSA, allofscience. TemporalAnalysis (When?) Fundingportfolioof oneindividual Mappingtopicburstsin 113YearsofPhysics 20ͲyearsofPNAS Research GeospatialAnalysis Careertrajectoryofone Mappingastates (Where?) individual intellectuallandscape PNASpublications TopicalAnalysis (What?) Baseknowledgefrom whichonegrantdraws. Knowledgeflowsin Chemistryresearch VxOrd/Topicmapsof NIHfunding NetworkAnalysis (WithWhom?) NSFCoͲPInetworkof oneindividual CoͲauthornetwork NIH’score competency 3 TypeofAnalysisvs.LevelofAnalysis Micro/Individual (1Ͳ100records) Meso/Local (101–100,000records) Macro/Global (100,000<records) Statistical Analysis/Profiling Individualpersonand theirexpertiseprofiles Largerlabs,centers, universities,research domains,orstates AllofNSF,allofUSA, allofscience. TemporalAnalysis (When?) Fundingportfolioof oneindividual Mappingtopicburstsin 113YearsofPhysics 20ͲyearsofPNAS Research GeospatialAnalysis Careertrajectoryofone Mappingastates (Where?) individual intellectuallandscape PNASpublications TopicalAnalysis (What?) Baseknowledgefrom whichonegrantdraws. Knowledgeflowsin Chemistryresearch VxOrd/Topicmapsof NIHfunding NetworkAnalysis (WithWhom?) NSFCoͲPInetworkof oneindividual CoͲauthornetwork NIH’score competency 4 IndividualCoͲPINetwork Ke &Börner.2006. Temporal/NetworkAnalysis 2001Ͳ2006,US,InfoVisScope EvolvingprojectͲPInetworks 5 MappingIndiana’s IntellectualSpace Identify Geospatial/NetworkAnalysis ¾ Pocketsofinnovation 2001Ͳ2006,BioMed,INScope ¾ Pathwaysfromideastoproducts ¾ Interplayofindustryandacademia AcademicͲIndustrycollaborationsand knowledgediffusion 6 MappingtheEvolutionofCoͲAuthorshipNetworks Ke,Visvanath &Börner. 2004. Won1stprizeattheIEEEInfoVis Contest. 7 MappingtheEvolutionofCoͲAuthorshipNetworks Ke,Visvanath &Börner. 2004. Won1stprizeattheIEEEInfoVis Contest. Temporal/NetworkAnalysis 1986Ͳ2004,US,InfoVis Scope EvolvingcoͲauthornetworksandauthorimpact 8 StudyingtheEmergingGlobalBrain:Analyzingand VisualizingtheImpactofCoͲAuthorshipTeams Börner,Dall’Asta,Ke &Vespignani.2005.Complexity10(4):58Ͳ67. Researchquestion: • Issciencedrivenbyprolificsingle expertsorbyhighͲimpactcoͲ authorshipteams? Contributions: • Newapproachtoallocatecitational credit. • Novelweightedgraphrepresentation. Temporal/NetworkAnalysis • Visualizationofthegrowthofweighted coͲauthornetwork. 1986Ͳ2004,US,InfoVis Scope ImpactofcoͲauthorrelations • Centralitymeasurestoidentifyauthor impact. • Globalstatisticalanalysisofpaper productionandcitationsincorrelation withcoͲauthorshipteamsizeover time. • Local,authorͲcenteredentropy measure. 9 SpatioͲTemporalInformationProductionandConsumptionofMajorU.S. ResearchInstitutions Börner,Penumarthy,Meiss,&Ke. 2006.“MappingtheDiffusionofScholarlyKnowledgeAmongMajor U.S.ResearchInstitutions.”Scientometrics 68(3):415Ͳ426. Researchquestions: 1.Doesspacestillmatter intheInternetage? 2.Doesonestillhaveto studyandworkatmajorresearch institutionsinordertohaveaccessto Temporal/Geospatial Analysis highͲqualitydataandexpertiseandtoproduce 1982-2001, US, PNAS (BioMed) Scope highͲqualityresearch? Citation impact and knowledge diffusion 3.DoestheInternetleadtomoreglobalcitation patterns—i.e.,morecitationlinksbetween papersproducedatgeographicallydistant researchinstructions? Contributions: ¾ AnswertoQs1+2isYES. ¾ AnswertoQs3isNO. ¾ Novelapproachtoanalyzingthedualroleof institutionsasinformationproducersand consumersandtostudyandvisualizethe diffusionofinformationamongthem. 10 113YearsofPhysicalReview http://scimaps.org/dev/map_detail.php?map_id=171 HerrII,Duhon,Hardy,Penumarthy &Börner. Topical/NetworkAnalysis 1893Ͳ2006,World,PhysicsScope Topicevolution,knowledgediffusion, Nobelpredictions 11 MappingTransdisciplinary TobaccoUse ResearchCentersPublications CompareR01investigatorͲbasedfundingwithTTURC Centerawardsintermsofnumberofpublicationsand evolvingcoͲauthornetworks. Stipelman,Hall,Zoss,Okamoto,Stokols,Börner,2014. SupportedbyNIH/NCIContractHHSN261200800812 NetworkAnalysis 1998Ͳ2009,US,NIHDataonTobacco researchfunding ComparisonofcoͲauthornetworks 12 ResearchCollaborationsbytheChineseAcademyofSciences Huang,Duhon,Hardy&Börner GeospatialAnalysis World,ChineseAcademyofScience CollaborationandknowledgediffusionviacoͲ authornetworks 13 MappingTopicBursts CoͲwordspaceof thetopͲ50most frequentand bursty wordsused inthetopͲ10% mosthighlycited PNAS publications in1982Ͳ2001. Mane&Börner. 2004.PNAS 101(Suppl.1): 5287Ͳ5290. Temporal/Topical Analysis 1982-2001, US, PNAS (BioMed) Scope Topic evolution and bursts 14 OlivierH.Beauchesne,2011.MapofScientificCollaborationsfrom2005Ͳ2009. Bollen,Johan,HerbertVandeSompel,AricHagberg,LuisM.A.Bettencourt,RyanChute,MarkoA. Rodriquez,LyudmilaBalakireva.2008.AClickstreamMapofScience. 15 16 LanguageCommunitiesofTwitter Ͳ EricFischerͲ 2012 17 WardShelley.2011.HistoryofScienceFiction. 18 CouncilforChemicalResearch.2009.ChemicalR&DPowerstheU.S.InnovationEngine. Washington,DC.CourtesyoftheCouncilforChemicalResearch. Visualization Frameworks 19 21 IlluminatedDiagramDisplay ondisplayattheSmithsonianinDC. http://scimaps.org/exhibit_info/#ID 22 23 24 ScienceMapsin“ExpeditionZukunft”sciencetrainvisiting62citiesin7months12 coaches,300mlongOpeningwasonApril23rd,2009byGermanChancellorMerkel http://www.expeditionͲzukunft.de 25 Places&SpacesDigitalDisplayinNorthCarolinaState’sbrandnewImmersionTheater 26 Ingo Gunther's Worldprocessor globe design on display at the Giant Geo Cosmos OLED Display at the Museum of Emerging Science and Innovation in Tokyo, Japan 27 http://scimaps.org/call 28 Registerforfreeathttp://ivmooc.cns.iu.edu.ClassrestartsJanuary,2016. 30 TheInformationVisualizationMOOC ivmooc.cns.iu.edu Studentsfrom~100countries 370+facultymembers #ivmooc 31 CourseSchedule Part1:TheoryandHandsͲOn • Session1 – WorkflowDesignandVisualizationFramework • Session2 – “When:”TemporalData • Session3 – “Where:”GeospatialData • Session4 – “What:”TopicalData MidͲTerm • Session5 – “WithWhom:”Trees • Session6 – “WithWhom:”Networks • Session7 – DynamicVisualizationsandDeployment FinalExam Part2:Studentsworkinteamsonclientprojects. FinalgradeisbasedonClassParticipation(10%),Midterm(30%),FinalExam (30%),andClientProject(30%). 32 BooksUsedintheIVMOOC Teachestimely knowledge: Advancedalgorithms, tools,andhandsͲon workflows. Teachestimelessknowledge: Visualizationframework— exemplifiedusinggeneric visualizationexamplesand pioneeringvisualizations. 33 CourseSchedule Part1:TheoryandHandsͲOn • Session1 – WorkflowDesignandVisualizationFramework • Session2 – “When:”TemporalData • Session3 – “Where:”GeospatialData • Session4 – “What:”TopicalData MidͲTerm • Session5 – “WithWhom:”Trees • Session6 – “WithWhom:”Networks • Session7 – DynamicVisualizationsandDeployment FinalExam Part2:Studentsworkinteamsonclientprojects. FinalgradeisbasedonClassParticipation(10%),Midterm(30%),FinalExam (30%),andClientProject(30%). 34 CourseSchedule Part1:TheoryandHandsͲOn • Session1 – WorkflowDesignandVisualizationFramework • Session2 – “When:”TemporalData • Session3 – “Where:”GeospatialData • Session4 – “What:”TopicalData MidͲTerm • Session5 – “WithWhom:”Trees • Session6 – “WithWhom:”Networks • Session7 – DynamicVisualizationsandDeployment FinalExam Part2:Studentsworkinteamsonclientprojects. FinalgradeisbasedonClassParticipation(10%),Midterm(30%),FinalExam (30%),andClientProject(30%). 35 NeedsͲDrivenWorkflowDesign DEPLOY Validation Interpretation Stakeholders Visually encode data Typesandlevelsofanalysisdetermine data,algorithms&parameters,and deployment Overlay data Data Select visualiz. type READ ANALYZE VISUALIZE NeedsͲDrivenWorkflowDesign DEPLOY Validation Interpretation Stakeholders Visually encode data Typesandlevelsofanalysisdetermine data,algorithms&parameters,and deployment Overlay data Data Select visualiz. type READ ANALYZE VISUALIZE Load One File and Run Many Analyses and Visualizations Times Publication City of Cited Year Publisher 12 2011 NEW YORK 18 2010 13 2010 Country Journal Title (Full) USA COMMUNICA TIONS OF THE ACM MALDEN USA CTSCLINICAL AND TRANSLATIO NAL SCIENCE WASHINGTON USA SCIENCE TRANSLATIO NAL MEDICINE Title Plug-and-Play Macroscopes Subject Category Computer Science Authors Borner, K Advancing the Science of Research & Team Science Experimental Medicine Falk-Krzesinski, HJ|Borner, K|Contractor, N|Fiore, SM|Hall, KL|Keyton, J|Spring, B|Stokols, D|Trochim, W|Uzzi, B A Multi-Level Systems Cell Biology Perspective for the |Research & Science of Team Science Experimental Medicine Borner, K|Contractor, N|FalkKrzesinski, HJ|Fiore, SM|Hall, KL|Keyton, J|Spring, B|Stokols, D|Trochim, W|Uzzi, B 38 Load One File and Run Many Analyses and Visualizations Times Publication City of Cited Year Publisher 12 2011 NEW YORK 18 2010 13 2010 Country Journal Title (Full) USA COMMUNICA TIONS OF THE ACM MALDEN USA CTSCLINICAL AND TRANSLATIO NAL SCIENCE WASHINGTON USA SCIENCE TRANSLATIO NAL MEDICINE Title Plug-and-Play Macroscopes Subject Category Computer Science Authors Borner, K Advancing the Science of Research & Team Science Experimental Medicine Falk-Krzesinski, HJ|Borner, K|Contractor, N|Fiore, SM|Hall, KL|Keyton, J|Spring, B|Stokols, D|Trochim, W|Uzzi, B A Multi-Level Systems Cell Biology Perspective for the |Research & Science of Team Science Experimental Medicine Borner, K|Contractor, N|FalkKrzesinski, HJ|Fiore, SM|Hall, KL|Keyton, J|Spring, B|Stokols, D|Trochim, W|Uzzi, B CoͲauthorand manyother biͲmodalnetworks. 39 CourseSchedule Part1:TheoryandHandsͲOn • Session1 – WorkflowDesignandVisualizationFramework • Session2 – “When:”TemporalData • Session3 – “Where:”GeospatialData • Session4 – “What:”TopicalData MidͲTerm • Session5 – “WithWhom:”Trees • Session6 – “WithWhom:”Networks • Session7 – DynamicVisualizationsandDeployment FinalExam Part2:Studentsworkinteamsonclientprojects. FinalgradeisbasedonClassParticipation(10%),Midterm(30%),FinalExam (30%),andClientProject(30%). 40 VisualizationFrameworks HowtoClassifyDifferentVisualizations? By • Userinsightneeds? • Usertasktypes? • Datatobevisualized? • Datatransformation? • Visualizationtechnique? • Visualmappingtransformation? • Interactiontechniques? • Or? Tasks Seepage5 Seepage24 43 Seepage26 Seepage24 VisualizationTypes(ReferenceSystems) 1. Charts:Noreferencesystem—e.g.,Wordle.com,piecharts 2. Tables:Categoricalaxesthatcanbeselected,reordered;cells canbecolorcodedandmightcontainproportionalsymbols. Specialkindofgraph. 3. Graphs:Quantitativeorqualitative(categorical)axes. Timelines,bargraphs,scatterplots. 4. Geospatialmaps:Uselatitudeandlongitudereference system.Worldorcitymaps. 5. Networklayouts:Nodepositionmightdependsonnode attributesornodesimilarity.Trees:hierarchies,taxonomies, genealogies.Networks:socialnetworks,migrationflows. Types Seepage25 48 Seepage36 49 Seepages36Ͳ39 50 References Börner,Katy,Chen,Chaomei,andBoyack,Kevin.(2003). VisualizingKnowledgeDomains. InBlaise Cronin(Ed.), ARIST,Medford,NJ:InformationToday,Volume37,Chapter 5,pp.179Ͳ255.http://ivl.slis.indiana.edu/km/pub/2003Ͳ bornerͲarist.pdf Shiffrin,RichardM.andBörner,Katy(Eds.)(2004).Mapping KnowledgeDomains.ProceedingsoftheNationalAcademy ofSciencesoftheUnitedStatesofAmerica,101(Suppl_1). http://www.pnas.org/content/vol101/suppl_1/ Börner,Katy(2010)AtlasofScience:VisualizingWhatWe Know.TheMITPress.http://scimaps.org/atlas Scharnhorst,Andrea,Börner,Katy,vandenBesselaar,Peter (2012)ModelsofScienceDynamics.SpringerVerlag. KatyBörner,MichaelConlon,JonCorsonͲRikert,Cornell, YingDing(2012)VIVO:ASemanticApproachtoScholarly NetworkingandDiscovery.Morgan&Claypool. KatyBörner andDavidEPolley (2014)VisualInsights:A PracticalGuidetoMakingSenseofData.TheMITPress. Börner,Katy(2015)AtlasofKnowledge:AnyoneCanMap. TheMITPress.http://scimaps.org/atlas2 51 Allpapers,maps,tools,talks,pressarelinkedfromhttp://cns.iu.edu Theseslidesareathttp://cns.iu.edu/docs/presentations CNSFacebook:http://www.facebook.com/cnscenter MappingScienceExhibitFacebook:http://www.facebook.com/mappingscience 52