Data Archiving and Networked Services Certification and Dutch data management services Marjan Grootveld LIBER workshop, Vienna, May 19-20, 2014 DANS is an institute of KNAW and NWO Content Contents • DANS • DANS • Data and Trust • Research funders • European framework for certification • Federated data – DSA, DIN, ISO infrastructure • European • Other approaches framework for • Ongoing work certification • Horizon 2020 • DSA, DIN, ISO • Perspective What is DANS? Institute of Dutch Academy and Research Funding Organisation (KNAW & NWO) since 2005 Mission: promote and provide permanent access to digital research information First predecessor dates back to 1964 (Steinmetz Foundation), Historical Data Archive 1989 Our services EASY: Electronic Archiving System for self-deposit Support for Data Management Persistent Identifier URN:NBN resolver NARCIS: Gateway to scholarly information in NL Only use this slide to present a screenshot of an application. As no style is applied, the screenshot can take up the whole slide. For all other information please use the slide with preset style! Only use this slide to present a screenshot of an application. As no style is applied, the screenshot can take up the whole slide. For all other information please use the slide with preset style! http://zonmw.nl/ Maximum access to data Sustainably stored in a digital repository with a Seal of Approval. Only use this slide to present a screenshot of an application. As no style is applied, the screenshot can take up the whole slide. For all other information please use the slide with preset style! Compliance with codes of conduct http://www.nwo.nl/ Slogan DANS: ‘Open if possible, protected if necessary’ NL Code of Conduct for Scientific Practice Code of Conduct for Scientific Practice III. Verifiability Principle Presented information is verifiable. Whenever research results are publicized, it is made clear what the data and the conclusions are based on, where they were derived from and how they can be verified. DMP clues: • accurately documented; • quality of data collection; • all steps taken must be properly reported; • raw data; • retention period; • archiving. Best Practice III.1 Research must be replicable in order to verify its accuracy. The choice of research question, the research set-up, the choice of method and the reference to sources studied is accurately documented. III.2 The quality of data collection, data input, data storage and data processing is guarded closely. All steps taken must be properly reported and their execution must be properly monitored (lab journals, progress reports, documentation of arrangements and decisions, etc.). III.3 Raw research data are stored for at least five years. These data are made available to other scientific practitioners at request. III.4 Raw research data are archived in such a way that they can be consulted at a minimum expense of time and effort. III.5 The source of all educational material, including oral information transfer, is stated. http://www.vsnu.nl/ Plan or planning? • Data management is a part of professional research. • Data awareness and planning are more important than the actual DMP. Yet – Make your plan explicit and keep it up to date; – Try not to reinvent the wheel: lots of DMP templates available; – Involve other stakeholders. Data management: who? €$£ Funders Institution Research data policy Position within the federated data infrastructure: a collaborative framework Data Curation Trust Data Generators Data Users User functions: data capture and transfer Front offices: • Local Data Facilities ((University) Libraries) • Domain-Specific Research Infrastructures Community Support Services Back Offices: DANS, 3TU.Datacenter, … Common Data Services: Archiving, Access, … Basic Technical Infrastructure: SURFsara, Target, … Common Data Services: Storage, Backups, … What is trust built on? • Dedicate yourself (mission statement) • Do what you promise (stable, sincere and competent reputation) • Be transparent (peer review, get certified) Requirements For Bodies Providing Audit And Certification (ISO 16919 ) OAIS 14721) be applied to the evaluation of the trustworthiness of digital repositories using ISO 16363. It covers principles needed to inspire confidence that third party certification of the management of the digital repository has been performed with impartiality, competence, responsibility, openness, confidentiality, and responsiveness to complaints TRAC Certification of digital repositories ted Digital ositories: butes and onsibilities Audit and Certification of Trustworthy Digital Repositories (ISO 16363 ) Audit by external auditors Formal Certification s concerning: ganizational Infrastructure • e.g. The repository shall have a documented history of the changes to its operations, procedures, software, and hardware. Extended • Management International framework ital Object • 3 standards . Data Seal of rastructure Risk Management • and3Security levels Approval • • Monitored self-audit using ISO 16363 (or DIN31644 in Germany) e.g. The repository shall have access to necessary tools and resources to provide authoritative Representation Information for all of the digital objects it contains eg. The repository shall have procedures in place to evaluate when changes are needed to current software. ki.digitalrepositoryauditandcertification.org and lliancepermanentaccess.org/membership/member-resources/audit-and-certification l be available free from http://www.ccsds.org http://www.trusteddigitalrepository.eu Certification Basic Certification Monitored selfaudit using DSA metrics EUROPEAN FRAMEWORK FOR AUDIT AND CERTIFICATION OF DIGITAL REPOSITORIES to be promoted by the EU DSA: basic certification • • • • • • DANS initiative (2005/6) International Board (2009) 16 guidelines Self assessment & review Transparency Almost 30 seals awarded Data producers are responsible for the quality of research data, repositories for storage and long-term access, and users for correct use of data http://datasealofapproval.org/ The research data: • can be found on the Internet • are accessible (clear rights and licenses) • are in a usable format • are reliable • can be referred to (persistent identifier) DIN 31644: extended certification • 34 criteria written by German nestor-group and adopted as DIN 31644 • Self assessment & review • Self assessment and evidence on website • No seals acquired yet… http://www.langzeitarchivierung.de/Subsites/nestor/ ISO 16363: formal certification • • • • • • Based on Open Archival Information System (OAIS) and Trusted Repository Audit and Certification (TRAC) Over 100 metrics Test audits 2011 by Primary Trustworthy Digital Repository Authorisation Body(PTAB) Full external auditing process ISO 16919: Requirements for bodies providing audit and certification of candidate trustworthy digital repositories No ISO certifications yet… http://www.iso16363.org/ ESFRI Research Infrastructures and Trust Requirements for CLARIN Centres “Centres need to have a proper and clearly specified repository system and participate in a quality assessment procedure as proposed by the Data Seal of Approval or MOIMS-RAC approaches” Building Trust: CESSDA Self-Assessment Project Participants from fifteen CESSDA member organisations discussed the CESSDA-ERIC requirements and agreed upon using the Data Seal of Approval (DSA) guidelines as a tool to gain information on the level of their conformance with the DSA and the CESSDA-ERIC requirements. H2020 Open data pilot • Current Guidelines Open Research Data Pilot do not mention “trusted digital repositories”; Annex 2 mentions “certified repositories” once. • Understandable: there are few certified TDRs yet. • E-infrastructure call H2020: Services to ensure the quality and reliability of the e- infrastructure, including certification mechanisms for repositories and certification services to test and benchmark capabilities in terms of resilience and service continuity of einfrastructures. Netherlands Coalition for Digital Preservation Mission: To establish an infrastructure for long-term access to all public-sector digital objects in the Netherlands within five years Local TV Provincial archives Sound and Vision National Archives Local TV RHC Only use this slide to present a screenshot of an application. As no style is applied, the screenshot can take up the whole TV slide. For all other information Local please use theFilmslide with Rotterdam Museum preset style! Museum RCE Cultural heritage Media Local TV Ministries KB Museum KNAW DANS Museum VP DEN SURF Max Planck NWO Scholarly communities Government/archives Amsterdam Perspective Institution Research data policy €$£ Funders Data Archiving and Networked Services … and how does this fit? For more information: marjan.grootveld@dans.knaw.nl www.dans.knaw.nl DANS is an institute of KNAW en NWO