Building Blocks for Open Science: Opportunities for Library Engagement at EU Level Susan Reilly @skreilly Executive Director Ljubljana, October 2015 I’ll be talking about… • LIBER & Open Science • Open Science in the EU • Definition of Open Science • Building blocks of Open Science • What can libraries do to support Open Science? LIBER: reinventing the library of the future • Largest network of European research libraries: 410 in over 40 countries Mission: “To provide an information infrastructure to enable research in LIBER institutions to be world class” LIBER: Information Infrastructure for World Class Research? • Collaborative • Growth in collaboration from 13% (2003)- 17% (2011) • International • 40% of French & German research outputs a result of international collaboration • Rate of citation grows as geographic extent of collaboration increases • Interdisciplinary • Foundation of frontiers research • Data intensive • supports interdisciplinary exploration • … and open Higgs boson 2012 Journal Physics B paper: 6235 citations Libraries enabling Open Science “We believe that the move towards openness will lead to increased transparency, better quality research, a higher level of citizen engagement, and will accelerate the pace of scientific discovery through the facilitation of data-driven innovation.” http://libereurope.eu/wpcontent/uploads/2014/09/LIBER_Statement-on-openscience-final.pdf European Commission Council of Europe • Launch of Open Science Cloud • H2020 Open Data Pilot • Development of open science agenda • Importance of skills, infra Open Science Definition “The conduction of science in a way that others can collaborate and contribute, where research data, lab notes and other research processes are freely available, with terms that allow reuse, redistribution and reproduction of the research” https://www.fosteropenscience.eu/foster-taxonomy/open-science-definition The problem with defining Open Science • Means is often confused with the end • Ultimate goal is to be a transient term i.e. Open Science = Science • Aims to bring coherence and vision to a range Science of different Open open activities e.g. open access, = Diversity open data, open software • Key is changing practice and culture, which is different for every stakeholder Open Science Goals • Transparency in experimental methodology, observation, • • • • and collection of data Public availability and reusability of scientific data Public accessibility and transparency of scientific communication Citizen engagement* Using web-based tools to facilitate scientific collaboration Dan Gezelter, http://www.openscience.org/blog/?p=269 *EU Move away from a 300 year old model! Work. Finish. Publish! (Faraday) To an Open Science Landscape Collaboration Open infrastructure New forms of peer review Research data management Open access publishing Policy Massive Open Online Courses (MOOCs) Alternative Metrics Open data Open science Advocacy & training Coyright & licencing Open educational resources Building block: open access “We write to communicate an untenable situation facing the Harvard Library. Many large journal publishers have made the scholarly communication environment fiscally unsustainable and academically restrictive. ” Harvard University Library, 2012 Moved beyond the tipping point: http://www.sciencemetrix.com/pdf/SM_EC_OA_Availability_20042011.pdf OpenAire2020 Post FP7 Gold Open Access Pilot EUR 4m funding provided by the EC to support Open Access publications from post-grant FP7 projects finished no longer than 2 years ago. Guidelines develop after consultation: Maximum of three publications per project to be funded (research articles, monographs, book chapters, contributions to conference proceedings) which meet the requirements described in the Pilot policy guidelines. No publications in hybrid journals will be funded, only in fully Open Access titles. A €2,000 funding cap is in place. Pilot (soft) launched beginning of May 2015. ,https://goldoa-pilot.openaire.eu/ Building block: open data • Need to release the value of data • Benefits: • Jobs (Copernicus= 50000 jobs) • Research productivity (big bang) • Help communities (flood hack) • Cost of not sharing (bird flu) 1.7 million billion bytes of date every minute X 34 Data must be.. • Open by default (G8,LERU) • Usable by all • Available • Findable • Interpretable • Citable • Curated/preserved The Data Publication Pyramid Publications with data (2) Further data explanations in any kind of supplementary files to articles (4) Data publications, describing available datasets Processed Data and Data Representations Data Collections and Structured Databases Raw Data and Data Sets (1) Data contained and explained within the article (3) Data referenced from the article and held in data centers and repositories (5) Data in drawers and on disks at the institute Building block: advocacy… • Advocate for roadmaps and policies that promote open science at institutional and national level • Advocate for changes in practice e.g. data citation, use of cc licences and incentives • Need to change system of incentives and assessment • Move away from journal based metrics • Consider value and impact of ALL research outputs (data, software…) • Align assessment with institutional values • Only a change of system of incentives will truly change practice and culture Building block: infrastructure • International • Open • Interoperable • Cross disciplinary • Facilitate collaboration • • • • • Store & Share Sync & Exchange Replicate Compute Find • • • • • Content TDM tools Workflows Standards Interoperability Building block: skills and training ReCODE Recommendation 10: Support the transition to open research data through curriculum-development and training.The transition to an open science paradigm where research data plays a significant role requires training and education for researchers and for data managers who support open science. Courses for getting researchers and data managers up-to date with current relevant issues are necessary, as well as the development of curricula that contribute towards the development of data science and information management as distinct and legitimate career paths. •Need to embed training in post graduate education •Invest in the development of the data professional •Training provision as and when needed (importance of train the trainer) •Training and support for new tools and methods Building block: policy and legislation • Legal clarity • Interoperabilty (WIPO solution?) • Ensure researchers have right to secondary publication • Standard open access licences • CC-by and CC0/PD Copyright v TDM • Because it involves the copying of content in order to convert into machine readable format TDM may infringe copyright • European Database Directive prohibits copying of substantial parts of databases • In US TDM is covered by fair use, other parts of the world have a specific exception e.g. Japan, UK https://www.flickr.com/photos/apelad/304195427/ Knowledge Discovery in the Digital Age • Ultimate goal of text and data mining is to extract high level knowledge from low level data • Allows analysis across disciplines • “Undiscovered public knowledge” (Swanson) • Identifies patterns in the data to produce new knowledge • It’s not a new thing, it’s just digital information makes it a whole lot more powerful and relevant! Elsevier TDM Policy • Access through API only • Text only- no images, tables • Research must register details • Click-through licence • Terms can change any time • Reproducibility of results Key Principles • Copyright not intended to govern access to facts, ideas and data, nor should it • Need to move beyond the tipping point of open access • Protect academic freedom (no monitoring) • Human rights not to be undermined by contract • Evolution of ethics (standards and legislation) • No artificial restrictions on innovation Libraries enabling Open Science • Support Open Access • Deposit XML in OpenAire compliant repositories • Explore new business models- try publishing! • Be transparent! • Advise on use of licences • https://www.openaire.eu/postgrantoapilot Libraries enabling Open Science • Get started in research data management • Research data management plans • Partner with data centres (back-office/front-office model) • Curate long tail data Libraries’ Opportunities Data Issue: Libraries and data centres opportunities (Chapter 4): Availability Lower barriers to researchers to make their data available. Integrate data sets into retrieval services. Findability Support of persistent identifiers. Engage in developing common metadescription schemas and common citation practices. Promote use of common standards and tools among researchers Interpretability Support crosslinks between publications and datasets. Provide and help researchers understand metadescriptions of datasets. Establish and maintain knowledge base about data and their context. Re-usability Curate and preserve datasets. Archive software needed for re-analysis of data. Be transparent about conditions under which data sets can be re-used (expert knowledge needed, software needed). Citability Engage in establishing uniform data citation standards. Support and promote persistent identifiers. Curation/Preservation Transparency about curation of submitted data. Promote good data management practice. Collaborate with data creators Instruct researchers on discipline specific best practices in data creation (preservation formats, documentation of experiment,…) Libraries enabling Open Science • Provide training and develop the workforce • Best practice in research data managment • Guidance on copyright & licences • Training on tools Developing our own workforce! Advocate & engage • Within institutions to develop policies and roadmaps • Towards researchers to highlight benefits of open science • With other stakeholders at insitutional level and internationally • Gather and provide evidence for the need for changes in legislation and policy • Promote and engage with citizens • Unite! • Sign the Hague Declaration! Be the local interface: space and experimentation • The Hague Declaration: http://thehaguedeclaration.com/the• • • • • • • hague-declaration-on-knowledge-discovery-in-the-digital-age/ LERU Roadmap for Research Data http://www.leru.org/index.php/public/news/press-release-leruroadmap-for-research-data/ EUDAT http://eudat.eu/ Research Data Alliance https://rd-alliance.org/ LIBER 10 Recommendation on Getting Started in RDM http://libereurope.eu/wpcontent/uploads/The%20research%20data%20group%202012%20v7%2 0final.pdf OpenAire https://www.openaire.eu/ San Francisco Declaration http://www.ascb.org/dora-old/files/SFDeclarationFINAL.pdf