Data citation at Geoscience Australia Policy Amanda Steen (Systems and Data Librarian) Infrastructure to support data citation Dr Sue Fyfe (Director, Data Governance & Services Section) Introduction Geoscience Australia is Australia’s national geoscience agency developing and providing geographical and geological data and knowledge for the Australian Government to support it in delivering its priorities. We also provide advice and information to industry and other stakeholders such as emergency services, minerals and petroleum industries, primary producers, telecommunications agencies, and more. Vision: Geoscience Australia is custodian of the geographic and geological data and knowledge of the nation. We create, maintain and disseminate geographic and geological knowledge for the future well being of all Australians. Data citation at GA 2014 Why does GA need data citation? • Enables validity of GA data • Enables validation of research (for example in the peer review process) • Scientists are able to be more aware of existing research data and can reuse it, which could prevent costly research being done again (for example re-exploration of mineral resources in areas already investigated) • Can be used to evaluate, acknowledge and reward the work done by GA’s scientists Data citation at GA 2014 What is GA doing in its move toward data citation • Data citation standard • An overarching citation standard • Dynamic datasets • Persistent identifiers • DOIs • Began minting DOIs for publications via CrossRef in January 2014. • Will soon begin minting for datasets with ANDS once M2M is established. Data citation at GA 2014 > DOI Policy snapshot • Can be applied to any type of digital object • Must be openly accessible, stable, available on net, quality assured and held in a GA approved repository • Can be applied at different levels of granularity…. • Will not be applied to restricted or embargoed objects…. • ….metadata record will be used as the landing page… • ….does not replace other identifiers. • To encourage its use in data citation the DOI will be displayed in repository metadata. • To enable tracking of citations, DOIs must be applied to new versions of objects. Data citation at GA 2014 > DOI Policy snapshot cont. • Where there is a collaborating institution, the institution responsible for publishing the object and is able to provide a persistent link is responsible for minting the DOI. Where a DOI has been minted by the collaborating institution, it will be manually entered into the metadata. • Where a publisher has already minted a DOI, there is no need to mint another. Therefore, if it is known that a DOI has already been created, that DOI will be entered into the metadata. • The decision to apply a DOI will be built into the research review process. Data citation at GA 2014 > Procedures for data citation (in Draft) snapshot For the data producer: • Provide really good metadata • Allow secure access through a GA approved quality, stable, long-term data storage facility For the data reuser: • Include GA data citations in your publication/research bibliography • Include version numbers where required • Include type and version of software where available Data citation at GA 2014 > What’s next at GA? • IGSN – International GeoSample Number • 9 digit alphanumeric that uniquely identifies samples • Exploring legalities of involvement • ORCID – Open Researcher and Contributor ID • 16 digit number which can appear as a URI • Many at GA have an ORCID, but not yet utilising full benefit Data citation at GA 2014 > Amanda’s wishlist • Storage of dynamic datasets previous instances • DOI minting on the fly for dynamic dataset snapshots • One-stop data management shop for researchers in the form of a LibGuide Data citation at GA 2014> What we said we need to do in 2012… data management ICT infrastructure services systems tools connectivity security support standards vocabularies thesauri procedures classification custodianship data models formats archive processes cite data protocols publish policy & governance policy investment management plans data specifications QA approval audit legislation compliance licensing support culture change Data stewardship essentials: governance, The building blocks.. or cogs... of data stewardship • • • • • Information/business architecture Data & services architecture Enterprise platforms Enterprise systems & tools HPC/HPD • • • • • • • Data custodians Data managers Data scientists Geoscientists Projects Product managers Executives & SLT governance • • • • • • Policy Standards Procedures Compliance Alignment & interoperability Value technology people (culture change) Data stewardship essentials: governance, Data & Services Architecture mobile-enabled web components Find… View &/or Read… Download &/or Link to… GA WEBSITE refer to exemplars of great design in Govt websites NaviGAtor – search, discover, limited visualisation, endpoint to access products Fast 1-way info push, Stakeholder collaboration? Find… Access… Use & Analyse… Integrate &/or Mash-up… social media search & discover map & visualise specialist applications Portal search Data.gov.au/FIND (GA), AODN, ANDS, geoscence.gov.au (GA) technology platform delivers standard channels catalogue services CS-W (eCAT) Backend simple data services Public API, Google Maps Engine OGC services WMS, WFS, WCS external & internal user discover, access & analyse directly from user tools & systems e.g. GIS, HPC, modelling transform layer Standardised, controlled & managed data, tools infrastructure, extraction & rendition web content files & objects; unstructured (linked) data Google search data services analytics information decision support apps e.g. NOPIMS, NEDF, CIAP, NFRIP, workflow engines & other apps e.g. Qld Globe, MetEye downloads Machine Web Users content management system static content view & mashup 2D & 3D data, graph & facts dashboard? landing pages Human Web Users toolkit delivers standard extract-transfer-load channels products metadata data packaged data, publications, images, object files stored in CDS, TRIM structured meta-databases eCAT plus EODs, NOPIMS, library, GADDS, via eCAT structured databases, unstructured data files, high performance data integrated data platform Data stewardship Insert essentials: Title Here governance, <view/master/slidemaster> technology, culture Data stewardship essentials: governance, ANDS/ DataCite eCat application DOI minting process DOI eCat database DOI request XML Creation Publishing XML HTML Discovery and delivery system Web service ANDS Data.gov.au Any questions? Thank you Phone: +61 2 6249 9302 Web: www.ga.gov.au Email: amanda.steen@ga.gov.au Address: Cnr Jerrabomberra Avenue and Hindmarsh Drive, Symonston ACT 2609 Postal Address: GPO Box 378, Canberra ACT 2601