Open Access and Open Data Requirements

advertisement
AASHTO RAC/TRB State Representatives 2014 Annual
Meeting
Madison, July 23
Open Access and Open
Data Requirements
Outlook and Implications
Kenda K. Levine, UC Berkeley
Mary Moulton, National Transportation Library
#TransHack
Who We Are
Leighton Christiansen, Iowa DOT
Kenda K. Levine, UC Berkeley
Mary Moulton, National Transportation Library
Amanda J. Wilson, National Transportation Library
#TransHack
Who Are We?
all images used under
claim of educational fair use
#TransHack
http://libraryconnectivity.org/datamgt/index.php/Main_Page
http://libraryconnectivity.org/datamgt/index.php/Main_Page
#TransHack
What Is Open Data?
“Open data is data that can be freely used,
reused and redistributed by anyone - subject
only, at most, to the requirement to attribute
and sharealike.”
Open Data Handbook
#TransHack
What Is Open Data?
Availability and Access
● no barriers to use, APIs
● no need to ask/sign up to use data
● usable and convenient formats (not PDF)
#TransHack
What Is Open Data?
Reuse and Redistribution
● TOS permits reuse and products to be made
using the data
● License does not restrict usage
● No proprietary formats, conform to standards
#TransHack
What Is Open Data?
Universal Participation
● Everyone has access (no firewalls)
● No industry based caveats
● Think of it as potential Public-PrivatePartnership
#TransHack
What Is Open Data?
Interoperability
● Ability to mix multiple sources
● Crosswalks
● APIs
● No barriers - format or license
#TransHack
What Is Open Access?
“Open Access is the free, immediate, online
availability of research articles, coupled with the
rights to use these articles fully in the digital
environment.”
SPARC
#TransHack
What Is Open Access?
Free Access
● No cost barriers for users
● Creative Commons for attribution
● Enables data-mining and text-mining
● Embargo periods
#TransHack
What Is Open Access?
Different Colors of Open Access
● Green: Author self-archives at the time of
submission
● Gold: Author/Institution pays a fee to
publisher to publisher access “free”
#TransHack
White House OSTP Memo
•February 22, 2013 – Office of Science & Technology Policy issues
memorandum entitled “Increasing Access to the Results of Federally Funded
Scientific Research.”
•Memorandum addresses new requirements for both intramural and extramural
publications and digital data sets resulting from federally-funded scientific
research.
•Expand on Data.gov, Open Government requirements already in place
•RITA assigned responsibility for preparing response.
#TransHack
USDOT Draft Implementation Plan
Scope
1.
“DOT-managed” = DOT contracts/grants/cooperative agreements and DOT-funded (independent of
Federal funding source, “R&D” label).
2.
Excludes funding flowed to states from Federal Aid programs (SP&R, NCHRP).
3.
Includes shared funded where Feds manage: pooled fund, etc.
#TransHack
USDOT Draft Implementation Plan
Highlights
1. Ensure publications and technical reports are deposited in the National Transportation Library – MAP21 requires NTL to “serve as a central depository for research results and technical publications of the
Department”.
2. Ensure metadata describing research digital data sets and the terms of access and use are made
accessible via the DOT Public Data Listing required under OMB Circular M-13-13.
3. Propose a simple Research Project Record process using a persistent identifier or similar method for
identifying and connecting publications and data sets, enabling NTL to:
a. Locate data related to publication to refer requestors/researchers.
b. Serve as compliance and reporting mechanism
#TransHack
USDOT Draft Implementation Plan
Publications
1. Deliver accepted, final manuscript to NTL under non-exclusive license agreement.
2. All manuscripts will be embargoed for a period of 18 months post-publication.
a. OSTP requires that embargoes may be challenged.
b. Will create “open docket” for challenges, and ongoing public feedback on Plan.
3. All publications, data sets and authors will have unique permanent identifiers for correlation of articles
with authors and relevant underlying data.
#TransHack
USDOT Draft Implementation Plan
Data
1. Plan excludes data which has confidentiality, privacy, proprietary, IP, security, and other exemptions
and protections
2. Extramural research will require submission and DOT approval of data management plan from all
awardees [blanket, with flow-down to PIs for compliance]:
a.
b.
c.
d.
Is the data worth keeping?
For how long?
In what format(s)?
How is cost recovery allowed? [DOT generally not take ownership; may charge]
3.
Awardees will determine repository for depositing data; must be accessible by NTL.
4.
Metadata will be included in DOT Enterprise Data Inventory.
#TransHack
Data Available from USDOT
NTL Data Catalog http://ntlsearch.bts.gov/repository/ntlc/btsdd/index.shtm
● Statistical data sets
● Geospatial data
● Sensor data
● Administrative (e.g. bridge inventories)
● Naturalistic study data
● Simulations and models
#TransHack
Roadblocks
Administrative Issues
● Many funding sources = many headaches
● Many funding sources = many terms of
deliverables
● Lack of coordination for tracking compliance
#TransHack
Roadblocks
Legal Issues
● Privacy and NDA issues
o
o
Human subject testing
Industry secrets
● Rights and ownership
● Ability to license, re-use, and buy/sell
data?
#TransHack
Roadblocks
Open Data Issues
● Interoperability
● Data formats
● Metadata schema
● Data sets are scattered by funder (if at all)
● When is “final” dataset made available?
#TransHack
Roadblocks
Open Access Issues
● Competing copyright
● Academics need to “publish or perish”
● Which repository?
● Limited only to research from specific
projects?
#TransHack
Roadblocks
Funding Issues
● Unfunded Mandates
● Long-term vs.Short-term funding
● Recharge model and pricing
● Cost of infrastructure and overhead
#TransHack
Thanks!
#TransHack
Download