Visualizing Science and Technology Data

advertisement
Visualizing Science and Technology Data
Katy Börner
Victor H. Yngve Professor of Information Science
Director, Cyberinfrastructure for Network Science Center
School of Informatics and Computing and Indiana University Network Science Institute
Indiana University, USA
CyberBridges Workshop
CEB Waterview Conference Center, Arlington, VA
August 31, 2015
Language Communities of Twitter - Eric Fischer - 2012
Findyourway
Descriptive&
Predictive
Models
Findcollaborators,friends
Terrabytesofdata
Identifytrends
2
TypeofAnalysisvs.LevelofAnalysis
Micro/Individual
(1Ͳ100records)
Meso/Local
(101–100,000records)
Macro/Global
(100,000<records)
Statistical
Analysis/Profiling
Individualpersonand
theirexpertiseprofiles
Largerlabs,centers,
universities,research
domains,orstates
AllofNSF,allofUSA,
allofscience.
TemporalAnalysis
(When?)
Fundingportfolioof
oneindividual
Mappingtopicburstsin 113YearsofPhysics
20ͲyearsofPNAS
Research
GeospatialAnalysis Careertrajectoryofone Mappingastates
(Where?)
individual
intellectuallandscape
PNASpublications
TopicalAnalysis
(What?)
Baseknowledgefrom
whichonegrantdraws.
Knowledgeflowsin
Chemistryresearch
VxOrd/Topicmapsof
NIHfunding
NetworkAnalysis
(WithWhom?)
NSFCoͲPInetworkof
oneindividual
CoͲauthornetwork
NIH’score
competency
3
TypeofAnalysisvs.LevelofAnalysis
Micro/Individual
(1Ͳ100records)
Meso/Local
(101–100,000records)
Macro/Global
(100,000<records)
Statistical
Analysis/Profiling
Individualpersonand
theirexpertiseprofiles
Largerlabs,centers,
universities,research
domains,orstates
AllofNSF,allofUSA,
allofscience.
TemporalAnalysis
(When?)
Fundingportfolioof
oneindividual
Mappingtopicburstsin 113YearsofPhysics
20ͲyearsofPNAS
Research
GeospatialAnalysis Careertrajectoryofone Mappingastates
(Where?)
individual
intellectuallandscape
PNASpublications
TopicalAnalysis
(What?)
Baseknowledgefrom
whichonegrantdraws.
Knowledgeflowsin
Chemistryresearch
VxOrd/Topicmapsof
NIHfunding
NetworkAnalysis
(WithWhom?)
NSFCoͲPInetworkof
oneindividual
CoͲauthornetwork
NIH’score
competency
4
IndividualCoͲPINetwork
Ke &Börner.2006.
Temporal/NetworkAnalysis
2001Ͳ2006,US,InfoVisScope
EvolvingprojectͲPInetworks
5
MappingIndiana’s
IntellectualSpace
Identify
Geospatial/NetworkAnalysis
¾ Pocketsofinnovation
2001Ͳ2006,BioMed,INScope
¾ Pathwaysfromideastoproducts
¾ Interplayofindustryandacademia
AcademicͲIndustrycollaborationsand
knowledgediffusion
6
MappingtheEvolutionofCoͲAuthorshipNetworks
Ke,Visvanath &Börner. 2004. Won1stprizeattheIEEEInfoVis Contest.
7
MappingtheEvolutionofCoͲAuthorshipNetworks
Ke,Visvanath &Börner. 2004. Won1stprizeattheIEEEInfoVis Contest.
Temporal/NetworkAnalysis
1986Ͳ2004,US,InfoVis Scope
EvolvingcoͲauthornetworksandauthorimpact
8
StudyingtheEmergingGlobalBrain:Analyzingand
VisualizingtheImpactofCoͲAuthorshipTeams
Börner,Dall’Asta,Ke &Vespignani.2005.Complexity10(4):58Ͳ67.
Researchquestion:
• Issciencedrivenbyprolificsingle
expertsorbyhighͲimpactcoͲ
authorshipteams?
Contributions:
• Newapproachtoallocatecitational
credit.
• Novelweightedgraphrepresentation.
Temporal/NetworkAnalysis
• Visualizationofthegrowthofweighted
coͲauthornetwork. 1986Ͳ2004,US,InfoVis Scope
ImpactofcoͲauthorrelations
• Centralitymeasurestoidentifyauthor
impact.
• Globalstatisticalanalysisofpaper
productionandcitationsincorrelation
withcoͲauthorshipteamsizeover
time.
• Local,authorͲcenteredentropy
measure.
9
SpatioͲTemporalInformationProductionandConsumptionofMajorU.S.
ResearchInstitutions
Börner,Penumarthy,Meiss,&Ke.
2006.“MappingtheDiffusionofScholarlyKnowledgeAmongMajor
U.S.ResearchInstitutions.”Scientometrics 68(3):415Ͳ426.
Researchquestions:
1.Doesspacestillmatter
intheInternetage?
2.Doesonestillhaveto
studyandworkatmajorresearch
institutionsinordertohaveaccessto
Temporal/Geospatial Analysis
highͲqualitydataandexpertiseandtoproduce
1982-2001, US, PNAS (BioMed) Scope
highͲqualityresearch?
Citation impact and knowledge diffusion
3.DoestheInternetleadtomoreglobalcitation
patterns—i.e.,morecitationlinksbetween
papersproducedatgeographicallydistant
researchinstructions?
Contributions:
¾ AnswertoQs1+2isYES.
¾ AnswertoQs3isNO.
¾ Novelapproachtoanalyzingthedualroleof
institutionsasinformationproducersand
consumersandtostudyandvisualizethe
diffusionofinformationamongthem.
10
113YearsofPhysicalReview
http://scimaps.org/dev/map_detail.php?map_id=171
HerrII,Duhon,Hardy,Penumarthy &Börner.
Topical/NetworkAnalysis
1893Ͳ2006,World,PhysicsScope
Topicevolution,knowledgediffusion,
Nobelpredictions
11
MappingTransdisciplinary TobaccoUse
ResearchCentersPublications
CompareR01investigatorͲbasedfundingwithTTURC
Centerawardsintermsofnumberofpublicationsand
evolvingcoͲauthornetworks.
Stipelman,Hall,Zoss,Okamoto,Stokols,Börner,2014.
SupportedbyNIH/NCIContractHHSN261200800812
NetworkAnalysis
1998Ͳ2009,US,NIHDataonTobacco
researchfunding
ComparisonofcoͲauthornetworks
12
ResearchCollaborationsbytheChineseAcademyofSciences
Huang,Duhon,Hardy&Börner
GeospatialAnalysis
World,ChineseAcademyofScience
CollaborationandknowledgediffusionviacoͲ
authornetworks
13
MappingTopicBursts
CoͲwordspaceof
thetopͲ50most
frequentand
bursty wordsused
inthetopͲ10%
mosthighlycited
PNAS publications
in1982Ͳ2001.
Mane&Börner.
2004.PNAS
101(Suppl.1):
5287Ͳ5290.
Temporal/Topical Analysis
1982-2001, US, PNAS (BioMed) Scope
Topic evolution and bursts
14
OlivierH.Beauchesne,2011.MapofScientificCollaborationsfrom2005Ͳ2009.
Bollen,Johan,HerbertVandeSompel,AricHagberg,LuisM.A.Bettencourt,RyanChute,MarkoA.
Rodriquez,LyudmilaBalakireva.2008.AClickstreamMapofScience.
15
16
LanguageCommunitiesofTwitter Ͳ EricFischerͲ 2012
17
WardShelley.2011.HistoryofScienceFiction.
18
CouncilforChemicalResearch.2009.ChemicalR&DPowerstheU.S.InnovationEngine.
Washington,DC.CourtesyoftheCouncilforChemicalResearch.
Visualization Frameworks
19
21
IlluminatedDiagramDisplay
ondisplayattheSmithsonianinDC.
http://scimaps.org/exhibit_info/#ID
22
23
24
ScienceMapsin“ExpeditionZukunft”sciencetrainvisiting62citiesin7months12
coaches,300mlongOpeningwasonApril23rd,2009byGermanChancellorMerkel
http://www.expeditionͲzukunft.de
25
Places&SpacesDigitalDisplayinNorthCarolinaState’sbrandnewImmersionTheater
26
Ingo Gunther's Worldprocessor globe design on display at the Giant Geo Cosmos OLED
Display at the Museum of Emerging Science and Innovation in Tokyo, Japan
27
http://scimaps.org/call
28
Registerforfreeathttp://ivmooc.cns.iu.edu.ClassrestartsJanuary,2016.
30
TheInformationVisualizationMOOC
ivmooc.cns.iu.edu
Studentsfrom~100countries
370+facultymembers
#ivmooc
31
CourseSchedule
Part1:TheoryandHandsͲOn
• Session1 – WorkflowDesignandVisualizationFramework
• Session2 – “When:”TemporalData
• Session3 – “Where:”GeospatialData
• Session4 – “What:”TopicalData
MidͲTerm
• Session5 – “WithWhom:”Trees
• Session6 – “WithWhom:”Networks
• Session7 – DynamicVisualizationsandDeployment
FinalExam
Part2:Studentsworkinteamsonclientprojects.
FinalgradeisbasedonClassParticipation(10%),Midterm(30%),FinalExam
(30%),andClientProject(30%).
32
BooksUsedintheIVMOOC
Teachestimely
knowledge:
Advancedalgorithms,
tools,andhandsͲon
workflows.
Teachestimelessknowledge:
Visualizationframework—
exemplifiedusinggeneric
visualizationexamplesand
pioneeringvisualizations.
33
CourseSchedule
Part1:TheoryandHandsͲOn
• Session1 – WorkflowDesignandVisualizationFramework
• Session2 – “When:”TemporalData
• Session3 – “Where:”GeospatialData
• Session4 – “What:”TopicalData
MidͲTerm
• Session5 – “WithWhom:”Trees
• Session6 – “WithWhom:”Networks
• Session7 – DynamicVisualizationsandDeployment
FinalExam
Part2:Studentsworkinteamsonclientprojects.
FinalgradeisbasedonClassParticipation(10%),Midterm(30%),FinalExam
(30%),andClientProject(30%).
34
CourseSchedule
Part1:TheoryandHandsͲOn
• Session1 – WorkflowDesignandVisualizationFramework
• Session2 – “When:”TemporalData
• Session3 – “Where:”GeospatialData
• Session4 – “What:”TopicalData
MidͲTerm
• Session5 – “WithWhom:”Trees
• Session6 – “WithWhom:”Networks
• Session7 – DynamicVisualizationsandDeployment
FinalExam
Part2:Studentsworkinteamsonclientprojects.
FinalgradeisbasedonClassParticipation(10%),Midterm(30%),FinalExam
(30%),andClientProject(30%).
35
NeedsͲDrivenWorkflowDesign
DEPLOY
Validation
Interpretation
Stakeholders
Visually
encode
data
Typesandlevelsofanalysisdetermine
data,algorithms&parameters,and
deployment
Overlay
data
Data
Select
visualiz.
type
READ
ANALYZE
VISUALIZE
NeedsͲDrivenWorkflowDesign
DEPLOY
Validation
Interpretation
Stakeholders
Visually
encode
data
Typesandlevelsofanalysisdetermine
data,algorithms&parameters,and
deployment
Overlay
data
Data
Select
visualiz.
type
READ
ANALYZE
VISUALIZE
Load One File and Run Many Analyses and Visualizations
Times Publication City of
Cited Year
Publisher
12
2011
NEW YORK
18
2010
13
2010
Country Journal Title
(Full)
USA
COMMUNICA
TIONS OF
THE ACM
MALDEN
USA
CTSCLINICAL
AND
TRANSLATIO
NAL
SCIENCE
WASHINGTON USA
SCIENCE
TRANSLATIO
NAL
MEDICINE
Title
Plug-and-Play
Macroscopes
Subject
Category
Computer
Science
Authors
Borner, K
Advancing the Science of Research &
Team Science
Experimental
Medicine
Falk-Krzesinski, HJ|Borner,
K|Contractor, N|Fiore, SM|Hall,
KL|Keyton, J|Spring, B|Stokols,
D|Trochim, W|Uzzi, B
A Multi-Level Systems
Cell Biology
Perspective for the
|Research &
Science of Team Science Experimental
Medicine
Borner, K|Contractor, N|FalkKrzesinski, HJ|Fiore, SM|Hall,
KL|Keyton, J|Spring, B|Stokols,
D|Trochim, W|Uzzi, B
38
Load One File and Run Many Analyses and Visualizations
Times Publication City of
Cited Year
Publisher
12
2011
NEW YORK
18
2010
13
2010
Country Journal Title
(Full)
USA
COMMUNICA
TIONS OF
THE ACM
MALDEN
USA
CTSCLINICAL
AND
TRANSLATIO
NAL
SCIENCE
WASHINGTON USA
SCIENCE
TRANSLATIO
NAL
MEDICINE
Title
Plug-and-Play
Macroscopes
Subject
Category
Computer
Science
Authors
Borner, K
Advancing the Science of Research &
Team Science
Experimental
Medicine
Falk-Krzesinski, HJ|Borner,
K|Contractor, N|Fiore, SM|Hall,
KL|Keyton, J|Spring, B|Stokols,
D|Trochim, W|Uzzi, B
A Multi-Level Systems
Cell Biology
Perspective for the
|Research &
Science of Team Science Experimental
Medicine
Borner, K|Contractor, N|FalkKrzesinski, HJ|Fiore, SM|Hall,
KL|Keyton, J|Spring, B|Stokols,
D|Trochim, W|Uzzi, B
CoͲauthorand
manyother
biͲmodalnetworks.
39
CourseSchedule
Part1:TheoryandHandsͲOn
• Session1 – WorkflowDesignandVisualizationFramework
• Session2 – “When:”TemporalData
• Session3 – “Where:”GeospatialData
• Session4 – “What:”TopicalData
MidͲTerm
• Session5 – “WithWhom:”Trees
• Session6 – “WithWhom:”Networks
• Session7 – DynamicVisualizationsandDeployment
FinalExam
Part2:Studentsworkinteamsonclientprojects.
FinalgradeisbasedonClassParticipation(10%),Midterm(30%),FinalExam
(30%),andClientProject(30%).
40
VisualizationFrameworks
HowtoClassifyDifferentVisualizations?
By
• Userinsightneeds?
• Usertasktypes?
• Datatobevisualized?
• Datatransformation?
• Visualizationtechnique?
• Visualmappingtransformation?
• Interactiontechniques?
• Or?
Tasks
Seepage5
Seepage24
43
Seepage26
Seepage24
VisualizationTypes(ReferenceSystems)
1. Charts:Noreferencesystem—e.g.,Wordle.com,piecharts
2. Tables:Categoricalaxesthatcanbeselected,reordered;cells
canbecolorcodedandmightcontainproportionalsymbols.
Specialkindofgraph.
3. Graphs:Quantitativeorqualitative(categorical)axes.
Timelines,bargraphs,scatterplots.
4. Geospatialmaps:Uselatitudeandlongitudereference
system.Worldorcitymaps.
5. Networklayouts:Nodepositionmightdependsonnode
attributesornodesimilarity.Trees:hierarchies,taxonomies,
genealogies.Networks:socialnetworks,migrationflows.
Types
Seepage25
48
Seepage36
49
Seepages36Ͳ39
50
References
Börner,Katy,Chen,Chaomei,andBoyack,Kevin.(2003).
VisualizingKnowledgeDomains. InBlaise Cronin(Ed.),
ARIST,Medford,NJ:InformationToday,Volume37,Chapter
5,pp.179Ͳ255.http://ivl.slis.indiana.edu/km/pub/2003Ͳ
bornerͲarist.pdf
Shiffrin,RichardM.andBörner,Katy(Eds.)(2004).Mapping
KnowledgeDomains.ProceedingsoftheNationalAcademy
ofSciencesoftheUnitedStatesofAmerica,101(Suppl_1).
http://www.pnas.org/content/vol101/suppl_1/
Börner,Katy(2010)AtlasofScience:VisualizingWhatWe
Know.TheMITPress.http://scimaps.org/atlas
Scharnhorst,Andrea,Börner,Katy,vandenBesselaar,Peter
(2012)ModelsofScienceDynamics.SpringerVerlag.
KatyBörner,MichaelConlon,JonCorsonͲRikert,Cornell,
YingDing(2012)VIVO:ASemanticApproachtoScholarly
NetworkingandDiscovery.Morgan&Claypool.
KatyBörner andDavidEPolley (2014)VisualInsights:A
PracticalGuidetoMakingSenseofData.TheMITPress.
Börner,Katy(2015)AtlasofKnowledge:AnyoneCanMap.
TheMITPress.http://scimaps.org/atlas2
51
Allpapers,maps,tools,talks,pressarelinkedfromhttp://cns.iu.edu
Theseslidesareathttp://cns.iu.edu/docs/presentations
CNSFacebook:http://www.facebook.com/cnscenter
MappingScienceExhibitFacebook:http://www.facebook.com/mappingscience
52
Download