Slides - 1am:London

advertisement
Valuing Software and Other
Research Outputs
Daniel S. Katz
dkatz@nsf.gov & d.katz@ieee.org
@danielskatz
Program Director, Division of
Advanced Cyberinfrastructure
1am:London – The Altmetrics Conference
26 September 2014
National Science Foundation
• Federal agency created in 1950 "to promote the
progress of science; to advance the national
health, prosperity, and welfare; to secure the
national defense…”
• Annual budget of $7.2 billion (FY 2014)
• Funds 24 percent of all federally supported
basic research at US colleges and universities
• In many fields such as mathematics, computer
science and the social sciences, NSF is the
major source of federal funds
NSF
OFFICE OF DIVERSITY &
INCLUSION (ODI)
NATIONAL SCIENCE FOUNDATION
Claudia J. Postell, Head
703.292.8020
OFFICE OF THE DIRECTOR
NATIONAL SCIENCE BOARD (NSB)
703.292.8000
Dan E. Arvizu
Chair
Kelvin K. Droegemeier
Vice Chair
OFFICE OF THE GENERAL
COUNSEL (OGC)
Lawrence Rudolph,
General Counsel
Peggy Hoyle, Deputy GC
France A. Córdova
703.292.7000
703.292.8060
Director
OFFICE OF INTERNATIONAL &
INTEGRATIVE ACTIVITIES (OIIA)
NATIONAL SCIENCE BOARD
OFFICE
Michael Van Woert
Wanda Ward, Head
Executive Officr
703.292.7000
Vacant
703.292.8040
Deputy Director
OFFICE OF INSPECTOR
GENERAL (OIG)
OFFICE OF LEGISLATIVE & PUBLIC
AFFAIRS (OLPA)
Allison C. Lerner, Inspector General
e
703.292.7100
DIRECTORATE FOR
BIOLOGICAL
SCIENCES
(BIO)
John C.l Wingfied,
Assistant Director
Jane Silverthorne,
Deputy AD
703.292.8400
DIVISION OF BIOLOGICAL
INFRASTRUCTURE (DBI)
Scott Edwards,
Division Director
703.292.8470
DIVISION OF ENVIRONMENT AL
BIOLOGY (DEB)
Penelope Firth,
Division Director
703.292.8480
DIVISION OF INTEGRA TIVE
ORGANISMAL SYSTEMS (IOS)
Michelle Elekonich,
Acting Division Director
703.292.8420
DIVISION OF MOLECULAR &
CELLULAR BIOSCIENCES (MCB)
Gregory Warr,
Acting Division Director
703.292.8440
Judy Gan, Head
703.292.8070
DIRECTORATE FOR
COMPUTER &
INFORMATION SCIENCE &
ENGINEERING (CISE)
DIRECTORATE FOR
EDUCATION & HUMAN
RESOURCES
(EHR)
Suzane Iacono,
Acting Assistant Director
Joan Ferrini-Mundy,
Assistant Director
Vacant,
Deputy AD
Jermelina L. Tupas,
Acting Deputy AD
DIRECTORATE FOR
ENGINEERING
(ENG)
Pramod P.
Khargonekar,
Assistant Director
Grace Wang,
Deputy AD
703.292.8600
703.292.8900
DIVISION OF COMPUTER & e
NETWORK SYSTEMS (CNS)
Keith Marzullo,
Division Director
703.292.8950
DIVISION OF COMPUTING &
COMMUNICATION
FOUNDATIONS (CCF)
Rao Kosaraju,
Division Director
703.292.8910
DIVISION OF ADVANCED
CYBERINFRASTRUCTURE (ACI)
Irene Qualters,
Division Director
703.292.8970
DIVISION OF INFORMATION &
INTELLIGENT SYSTEMS (IIS)
Deborah Lockhart,
Acting Division Director
703.292.8930
DIRECTORATE FOR
GEOSCIENCES
(GEO)
DIVISION OF HUMAN RESOURCE
DEVELOPMENT (HRD)
Sylvia James,
Division Director
703.292.8640
DIVISION OF RESEARCH ON
LEARNING IN FORMAL &
INFORMAL SETTINGS (DRL)
Sarah McDonald,
Acting Division Director
703.292.8620
DIVISION OF UNDERGRADUATE
EDUCATION (DUE)
Susan Singer,
Division Director
703.292.8670
OFFICE OF EMERGING
FRONTIERS (EF)
Charles Liarakos,
Acting Division Director
703.292.8508
National Science Foundation
4201 Wilson Boulevard
Arlington, Virginia 22230
TEL: 703.292.5111 | FIRS: 800.877.8339 | TDD: 800.281.8749
e
DIVISION OF CHEMICAL,
BIOENGINEERING, ENVIRONMENTAL &
TRANSPORT SYSTEMS (CBET)
JoAnn Lighty,
Division Director
703.292.8320
DIVISION OF CIVIL,
MECHANICAL & MANUFACTURING
INNOVATION (CMMI)
George Hazelrigg,
Acting Division Director
703.292.8360
DIVISION OF ELECTRICAL,
COMMUNICATIONS & CYBER
SYSTEMS (ECCS)
Samir El-Ghazaly,
Division Director
703.292.8339
DIVISION OF ENGINEERING
EDUCATION & CENTERS (EEC)
Theresa Maldonado,
Division Director
703.292.8380
DIVISION OF INDUSTRIAL
INNOVATION & PARTNERSHIPS (IIP)
Vacant,
Division Director
703.292.8050
OFFICE OF EMERGING
FRONTIERS IN RESEARCH &
INNOVATION (EFRI)
Sohi Rastegar,
Senior Advisor
703.292.8301
DIRECTORATE FOR
SOCIAL, BEHAVIORAL, &
ECONOMIC SCIENCES
(SBE)
Roger Wakimoto,
Assistant Director
Fleming Crim,
Assistant Director g
Fay Cook,
Assistant Director
Margaret Cavanaugh,
Deputy AD
Celeste M. Rohlfin,
Deputy AD
Deborah Olster,
Acting Deputy AD
703.292.8500
703.292.8800
703.292.8700
703.292.8300
DIVISION OF GRADUATE
EDUCATION (DGE)
Valerie Wilson,
Acting Division Director
703.292.8630
DIRECTORATE FOR
MATHEMATICAL &
PHYSICAL SCIENCES
(MPS)
OFFICE OF BUDGET,
FINANCE, & AWARD
MANAGEMENT
(BFA)
Martha A. Rubenstein,
Head e
/ Chief Financial
Officr
Joanna E. Rom,
Deputy Head
OFFICE OF INFORMATION
& RESOURCE
MANAGEMENT
(OIRM)
Clifford Gabriel,
Acting Head
Amy Northcutt,
Chief Information Officr
703.292.8100
703.292.8200
DIVISION OF ATMOSPHERIC &
GEOSPACE SCIENCES (AGS)
Scott Borg,
Division Director
703.292.8520
DIVISION OF ASTRONOMICAL
SCIENCES (AST)
James Ulvestad,
Division Director
703.292.8820
DIVISION OF BEHAVIORAL &
COGNITIVE SCIENCES (BCS)
Mark Weiss,
Division Director
703.292.8740
BUDGET DIVISION (BUD)
Michael Sieverts,
Division Director
703.292.8260
DIVISION OF ADMINISTRATIVE
SERVICES (DAS)
Mercedes Eugenia,
Division Director
703.292.8190
DIVISION OF EARTH
SCIENCES (EAR)
Sonia Esperança
Acting Division Director
703.292.8550
DIVISION OF CHEMISTRY (CHE)
Jacquelyn Gervay-Hague,
Division Director
703.292.8840
DIVISION OF SOCIAL &
ECONOMIC SCIENCES (SES)
Jeryl Mumpower,
Division Director
703.292.8760
DIVISION OF ACQUISITION AND
COOPERATIVE SUPPORT (DACS)
Jeffery Lupis,
Division Director
703.292.8240
DIVISION OF INFORMATION
SYSTEMS (DIS)
Dorothy Aronson,
Division Director
703.292.8150
DIVISION OF OCEAN
SCIENCES (OCE)
Deborah Bronk,
Acting Division Director
703.292.8580
DIVISION OF MATERIALS
RESEARCH (DMR)
Mary Galvin-Donoghue,
Division Director
703.292.8810
NATIONAL CENTER FOR
SCIENCE AND ENGINEERING
STATISTICS (NCSES)
John Gawalt,
Division Director
703.292.8780
DIVISION OF FINANCIAL
MANAGEMENT (DFM)
,
Shirl Ruffin
Division Director / Deputy CFO
703.292.8280
DIVISION OF HUMAN RESOURCE
MANAGEMENT (HRM)
Judy Sunley,
Division Director /
Acting Chief Human Capital Officr
703.292.8180
DIVISION OF
POLAR PROGRAMS (PLR)
Kelly Falkner,
Division Director
703.292.8030
DIVISION OF MATHEMATICAL
SCIENCES (DMS)
Michael Vogelius,
Division Director
703.292.8870
DIVISION OF GRANTS &
AGREEMENTS (DGA)
Karen Tiplady,
Division Director
703.292.8210
DIVISION OF PHYSICS (PHY)
Denise Caldwell,
Division Director
703.292.8890
DIVISION OF INSTITUTION &
AWARD SUPPORT (DIAS)
Mary Santonastasso,
Division Director
703.292.8230
OFFICE OF MULTIDISCIPLINARY
ACTIVITIES (OMA)
ClarkeCooper,
Offic He ad
703.292.8800
LARGE FACILITIES OFFICE
Matthew Hawkins,
Acting Deputy Director
703.292.4416
September 2014
Advanced Cyberinfrastructure
(ACI) Division
• Supports and coordinates the development,
acquisition, and provision of state-of-the-art
cyberinfrastructure resources, tools, and
services
• Supports forward-looking research and
education to expand the future capabilities of
cyberinfrastructure
• Serves the growing community of scientists and
engineers, across all disciplines, whose work
relies on the power of advanced computation,
data-handling, and networking
Cyberinfrastructure
“Cyberinfrastructure consists of
computing systems,
data storage systems,
advanced instruments and
data repositories,
visualization environments, and
people,
all linked together by
software and
high performance networks,
to improve research productivity and
enable breakthroughs not otherwise possible.”
-- Craig Stewart
Software as Infrastructure
Science
• Software (including services) essential for
the bulk of science
-
•
About half the papers in recent issues of
Software
Science were software-intensive projects
Research becoming dependent upon
advances in software
Computing
Significant software development being
Infrastructure
conducted across NSF: NEON, OOI,
NEES, NCN, iPlant, etc
Wide range of software types: system, applications, modeling,
gateways, analysis, algorithms, middleware, libraries
• Software is not a one-time effort, it must be sustained
•
•
•
Development, production, and maintenance are people intensive
Software life-times are long vs hardware
Software has under-appreciated value
For software to be sustainable,
it must become infrastructure
NSF Software Infrastructure Projects
5 rounds of funding,
65 SSEs
4 rounds of funding,
35 SSIs
2 rounds of funding,
14 S2I2
conceptualizations
See http://bit.ly/sw-ci for current projects
SSE & SSI – NSF 14-520: Cross-NSF, all Directorates participating
Next SSEs due Feb 2015; Next SSIs due June 2015
SI2 Solicitation and Decision Process
• Proposal reviews well -> my role becomes
matchmaking
– I want to find program officers with funds, and convince them
that they should spend their funds on the proposal
• Unidisciplinary project (e.g. bioinformatics app)
– Work with single program officer, either likes the proposal or
not
• Multidisciplinary project (e.g., molecular
dynamics)
– Work with multiple program officers, ...
• Omnidisciplinary project (e.g. http, math library)
– Try to work with all program officers, often am told “it’s your
responsibility”
To judge software, need to
understand/forecast impact
Measuring Impact – Scenarios
1. Developer of open source physics simulation
– Possible metrics
•
•
•
•
•
How many downloads? (easiest to measure, least value)
How many contributors?
How many uses?
How many papers cite it?
How many papers that cite it are cited? (hardest to measure,
most value)
2. Developer of open source math library
– Possible metrics are similar, but citations are less
likely
– What if users don’t download it?
•
•
•
•
It’s part of a distro
It’s pre-installed (and optimized) on an HPC system
It’s part of a cloud image
It’s a service
• Future impacts – let proposers suggest
ACI Software Cluster Programs
• In these programs, ACI works with other NSF
units to support projects that lead to software
as an element of infrastructure
• Issue: amount of software that is
infrastructure grows over time, and grows
faster than NSF funding
Q: How can NSF ensure that software as
infrastructure continues to appear, without
funding all of it?
A: Incentives
• The devil is in the details
Other Software Discussions
• Working Towards Sustainable Software for
Science: Practice and Experience (WSSSPE)
– Google: WSSSPE
– 2 previous workshops, one upcoming
– WSSSPE2 @ SC14 (Nov 16, New Orleans)
• Lessons:
Many of the issues in developing
sustainable software are social, not
technical
Software work is inadequately visible in
ways that “count” within the reputation
system underlying science
Where We Are
• To judge software, need to understand/forecast impact
• Q: How can NSF ensure that software as infrastructure
continues to appear, without funding all of it?
• A: Incentives
• Many of the issues in developing sustainable software are
social, not technical
• Software work is inadequately visible in ways that “count”
within the reputation system underlying science
Hypothesis: better measurement of
contributions can lead to rewards
(incentives), leading to career paths,
willingness to join communities, leading to
more sustainable software
Moving Forward - NSF
• Recent CISE/ACI & SBE/SES Dear Colleague
Letter: Supporting Scientific Discovery through
Norms and Practices for Software and Data
Citation and Attribution (NSF 14-059,
http://www.nsf.gov/pubs/2014/nsf14059/nsf14059
.jsp)
– Need well-developed metrics to assess the
impact and quality of scientific software and
data
– Explore new norms and practices for software
and data citation and attribution, so that data
producers, software and tool developers, and
data curators are credited
• 6 projects and 3 collaborative workshops funded
Moving Forward - Dan
• Products (software, paper, data set) are
registered
– Credit map (weighted list of contributors—
people, products, etc.) is an input
– DOI is an output
– Leads to transitive credit1
• E.g., paper 1 provides 25% credit to software A, and
software A provides 10% credit to library X -> library
X gets 2.5% credit for paper 1
• Helps developer show: “my tools are important”
– Issues:
• Social: Trust in person who registers a product
• Technological: How2, Registration system
1D.
S. Katz, "Transitive Credit as a Means to Address Social and Technological Concerns Stemming from Citation
and Attribution of Digital Products," Journal of Open Research Software, v.2(1): e20, 2014. DOI: 10.5334/jors.be
2D. S. Katz, A. M. Smith, "Implementing Transitive Credit with JSON-LD," 2nd Workshop on Sustainable Software
for Science: Practice and Experiences (WSSSPE2), 2014. URL: http://arxiv.org/abs/1407.5117
Moving Forward - Community
• Career paths – Is there a role for non-tenuretrack researchers who produce software, data,
etc. in universities?
– Assuming yes, do universities recognize and support
this? If not, how to get them to?
• What is needed to support reproducibility of
science, in terms of data and software?
• Lots of entities with similar interests in both
software and data, e.g. JISC, RCUK, NIH, DOE,
Sloan & Moore, Mozilla, Apache, etc.
• Participate in WSSSPE
• Other ideas and questions are welcome, now or
later
– dkatz@nsf.gov or d.katz@ieee.org
Resources
•
•
•
NSF Software as Infrastructure Vision:
http://www.nsf.gov/publications/pub_summ.jsp?ods_key=nsf12113
Implementation of NSF Software Vision:
http://www.nsf.gov/funding/pgm_summ.jsp?pims_id=504817
Software Infrastructure for Sustained Innovation (SI2) Program
–
–
–
–
•
Working towards Sustainable Software for Science: Practice and Experiences
(WSSSPE)
–
–
–
•
Home: http://wssspe.researchcomputing.org.uk (includes links to all slides & papers)
1st workshop paper: http://arxiv.org/abs/1404.7414
2nd workshop site: http://wssspe.researchcomputing.org.uk/wssspe2/
NSF 14-059: “Dear Colleague Letter - Supporting Scientific Discovery through
Norms and Practices for Software and Data Citation and Attribution”
–
•
Scientific Software Elements (SSE) & Scientific Software Integration (SSI) solicitation:
http://www.nsf.gov/publications/pub_summ.jsp?ods_key=nsf14520
2013 PI meeting: https://sites.google.com/site/si2pimeeting/
2014 PI meeting: https://sites.google.com/site/si2pimeeting2014/
Awards: http://bit.ly/sw-ci
http://www.nsf.gov/pubs/2014/nsf14059/nsf14059.jsp
Transitive Credit Papers
–
–
http://dx.doi.org/10.5334/jors.be
http://arxiv.org/abs/1407.5117
Credits:
•
SI2 Program:
– Current program officers: Daniel S. Katz, Rudolf Eigenmann, William
Y. B. Chang, John C. Cherniavsky, Almadena Y. Chtchelkanova,
Cheryl L. Eavey, Evelyn Goldfield, Sol Greenspan, Daryl W. Hess,
Peter H. McCartney, Bogdan Mihaila, Dimitrios V. Papavassiliou,
Andrew D. Pollington, Barbara Ransom, Thomas Russell, Massimo
Ruzzene, Nigel A. Sharp, Paul Werbos, Eva Zanzerkia
– Formerly-involved program officers: Manish Parashar, Gabrielle
Allen, Sumanta Acharya, Eduardo Misawa, Jean Cottam-Allen,
Thomas Siegmund
•
WSSSPE:
– Organizers: Daniel S. Katz, Gabrielle Allen, Neil Chue Hong, Karen
Cranston, Manish Parashar, David Proctor, Matthew Turk, Colin C.
Venters, Nancy Wilkins-Diehr
– WSSSPE1 Summary paper authors: Daniel S. Katz, Sou-Cheng T.
Choi, Hilmar Lapp, Ketan Maheshwari, Frank Löffler, Matthew Turk,
Marcus D. Hanwell, Nancy Wilkins-Diehr, James Hetherington,
James Howison, Shel Swenson, Gabrielle D. Allen, Anne C. Elster,
Bruce Berriman, Colin Venters
– WSSSPE1 Keynote speakers: Phil Bourne, Arfon Smith
Download