Data citation at Geoscience Australia The journey so far.

advertisement
Data citation at Geoscience Australia
Policy
Amanda Steen (Systems and Data Librarian)
Infrastructure to support data citation
Dr Sue Fyfe (Director, Data Governance & Services Section)
Introduction
Geoscience Australia is Australia’s national geoscience agency
developing and providing geographical and geological data and
knowledge for the Australian Government to support it in
delivering its priorities.
We also provide advice and information to industry and other
stakeholders such as emergency services, minerals and
petroleum industries, primary producers, telecommunications
agencies, and more.
Vision: Geoscience Australia is custodian of the geographic and
geological data and knowledge of the nation. We create,
maintain and disseminate geographic and geological knowledge
for the future well being of all Australians.
Data citation at GA 2014
Why does GA need data citation?
• Enables validity of GA data
• Enables validation of research (for example in the peer review
process)
• Scientists are able to be more aware of existing research data
and can reuse it, which could prevent costly research being
done again (for example re-exploration of mineral resources
in areas already investigated)
• Can be used to evaluate, acknowledge and reward the work
done by GA’s scientists
Data citation at GA 2014
What is GA doing in its move toward data citation
• Data citation standard
• An overarching citation standard
• Dynamic datasets
• Persistent identifiers
• DOIs
• Began minting DOIs for publications via CrossRef in
January 2014.
• Will soon begin minting for datasets with ANDS once
M2M is established.
Data citation at GA 2014 >
DOI Policy snapshot
• Can be applied to any type of digital object
• Must be openly accessible, stable, available on net,
quality assured and held in a GA approved repository
• Can be applied at different levels of granularity….
• Will not be applied to restricted or embargoed objects….
• ….metadata record will be used as the landing page…
• ….does not replace other identifiers.
• To encourage its use in data citation the DOI will be displayed
in repository metadata.
• To enable tracking of citations, DOIs must be applied to new
versions of objects.
Data citation at GA 2014 >
DOI Policy snapshot cont.
• Where there is a collaborating institution, the institution
responsible for publishing the object and is able to provide a
persistent link is responsible for minting the DOI. Where a
DOI has been minted by the collaborating institution, it will be
manually entered into the metadata.
• Where a publisher has already minted a DOI, there is no need
to mint another. Therefore, if it is known that a DOI has
already been created, that DOI will be entered into the
metadata.
• The decision to apply a DOI will be built into the research
review process.
Data citation at GA 2014 >
Procedures for data citation (in Draft) snapshot
For the data producer:
• Provide really good metadata
• Allow secure access through a GA approved quality, stable,
long-term data storage facility
For the data reuser:
• Include GA data citations in your publication/research
bibliography
• Include version numbers where required
• Include type and version of software where available
Data citation at GA 2014 >
What’s next at GA?
• IGSN – International GeoSample Number
• 9 digit alphanumeric that uniquely identifies samples
• Exploring legalities of involvement
• ORCID – Open Researcher and Contributor ID
• 16 digit number which can appear as a URI
• Many at GA have an ORCID, but not yet utilising full
benefit
Data citation at GA 2014 >
Amanda’s wishlist
• Storage of dynamic datasets previous instances
• DOI minting on the fly for dynamic dataset snapshots
• One-stop data management shop for researchers in the form
of a LibGuide
Data citation at GA 2014>
What we said we need to do in 2012…
data management
ICT
infrastructure
services
systems
tools
connectivity
security
support
standards
vocabularies
thesauri
procedures
classification
custodianship
data models
formats
archive
processes
cite data
protocols
publish
policy &
governance
policy
investment
management plans
data specifications
QA
approval
audit
legislation
compliance
licensing
support
culture change
Data stewardship essentials: governance,
The building blocks.. or cogs... of data stewardship
•
•
•
•
•
Information/business
architecture
Data & services
architecture
Enterprise platforms
Enterprise systems
& tools
HPC/HPD
•
•
•
•
•
•
•
Data custodians
Data managers
Data scientists
Geoscientists
Projects
Product managers
Executives & SLT
governance
•
•
•
•
•
•
Policy
Standards
Procedures
Compliance
Alignment & interoperability
Value
technology
people
(culture change)
Data stewardship essentials: governance,
Data & Services Architecture
mobile-enabled web components



Find…
View &/or Read…
Download
&/or Link to…
GA WEBSITE
refer to exemplars of
great design in Govt
websites
NaviGAtor –
search, discover,
limited
visualisation,
endpoint to
access products
Fast 1-way
info push,
Stakeholder
collaboration?




Find…
Access…
Use & Analyse…
Integrate &/or
Mash-up…
social
media
search &
discover
map &
visualise
specialist
applications
Portal search
Data.gov.au/FIND (GA),
AODN, ANDS,
geoscence.gov.au (GA)
technology platform delivers standard channels
catalogue
services
CS-W (eCAT)
Backend
simple data
services
Public API, Google
Maps Engine
OGC
services
WMS, WFS, WCS
external &
internal user
discover, access &
analyse directly from
user tools & systems
e.g. GIS, HPC, modelling
transform layer
Standardised, controlled
& managed data, tools
infrastructure, extraction
& rendition
web content files & objects;
unstructured (linked) data
Google search
data services
analytics
information
decision support
apps e.g. NOPIMS,
NEDF, CIAP, NFRIP,
workflow engines &
other apps
e.g.
Qld Globe,
MetEye
downloads
Machine
Web Users
content management system
static
content
view & mashup 2D & 3D
data, graph &
facts
dashboard?
landing pages
Human
Web Users
toolkit delivers standard extract-transfer-load channels
products
metadata
data
packaged data, publications,
images, object files
stored in CDS, TRIM
structured meta-databases eCAT plus EODs, NOPIMS,
library, GADDS, via eCAT
structured databases, unstructured data
files, high performance data
integrated data platform
Data stewardship
Insert
essentials:
Title Here
governance,
<view/master/slidemaster>
technology, culture
Data stewardship
essentials:
governance,
ANDS/
DataCite
eCat
application
DOI minting
process
DOI
eCat
database
DOI
request
XML
Creation
Publishing
XML
HTML
Discovery
and
delivery
system
Web
service
ANDS
Data.gov.au
Any questions?
Thank you
Phone: +61 2 6249 9302
Web: www.ga.gov.au
Email: amanda.steen@ga.gov.au
Address: Cnr Jerrabomberra Avenue and Hindmarsh Drive, Symonston ACT 2609
Postal Address: GPO Box 378, Canberra ACT 2601
Download