1 Endorsement of new data publishers Introduction Prospective data publishers must be endorsed by a GBIF node or the Participant Node Managers Committee before they can be registered and allowed to publish data to the GBIF network. Documentation on this subject consists of the following parts: 1. Background summary and context on what endorsement is and why it is necessary 2. Questionnaire to be completed by a prospective data publisher 3. Guidelines and criteria to help endorsers in evaluating potential new data publishers 4. Endorsement role players and diagrammatic workflow Background summary and context Before datasets can be indexed into the central access system (www.gbif.org), the institution or organization sharing the data has to be approved as a data publisher by a GBIF Participant, through a procedure called "endorsement". The endorsement procedure aims to secure the following key outcomes: (1) that published data are relevant to GBIF’s scope and objectives; (2) stable, persistent data hosting (3) strong national, regional and thematic engagement around data publishing and use (4) open data sharing and reuse (5) improved data quality through responsiveness of data publishers to feedback To help the endorsing GBIF Participant make an informed decision, a prospective data publisher is requested to complete an online questionnaire or a PDF form version return it to GBIF Secretariat. Questionnaire online form as well as pdf form Items marked as (***) are considered critical for facilitating decision making. Items marked as (**) are important and should be provided where known, while those marked as (*) are useful and should also be provided to help the GBIF community to know your institution better. Contact information for your institution (or organization) Name of institution*** Institutional email*** and telephone number** Administrative contact person’s name***, email*** and telephone number** Technical contact person’s name***, email*** and telephone number** Institutional web address** Institutional physical address* Scope in relation to the GBIF mission GBIF deals with biodiversity data of the following types: species occurrence records including spatial and temporal information based on observations, specimens and other 2 evidence; sample-based data including measures of abundance based on defined sampling protocols; species checklists including area-based and thematic lists as well as taxonomic catalogues; and structured metadata describing biological datasets. Briefly describe the scope of your institution/organization in relation to GBIF's mission (e.g. collection holdings, research focus, biodiversity information management, etc.).*** Data hosting capacity GBIF.org provides a data access service that presents an index of published primary data and metadata. The responsibility of hosting the original data remains with the data publisher or owner. In order to fulfil this requirement, please answer the following questions. Do you have EITHER the capacity to run a live server, OR access to a server, through which your original dataset will be served to GBIF.org?*** Are you planning to install and run a publishing software installation such as the GBIF Integrated Publishing Toolkit (IPT)1 , BioCASe2, etc, on your server, or the server to which you have access, to publish your data directly to GBIF.org? *** If you have answered NO to either of the previous questions: are you aware of a technical host institution that would be able to perform these functions on your behalf? o If YES, please specify o If NO, would you like to receive suggestions for such a data hosting facility?*** Example dataset In order to assist us to understand the nature of your data, please provide a link to some example data holdings, or send a sample spreadsheet by email, if either is available* Responsiveness to feedback Are you willing and able to take reasonable measures to respond to queries and feedback from users of your dataset?* Endorsing Participant node*** Publishers from GBIF Participant Countries o If your country is a GBIF Voting Participant (see list at http://www.gbif.org/participation/list#voting) or Associate Country Participant (see list at http://www.gbif.org/participation/list#associate) we will seek endorsement from your national node unless you specifically request for endorsement from one of the organisational nodes also referred to as Other Associate Participant (see list at http://www.gbif.org/participation/list#other). Please select one option on who will endorse you. I wish to be endorsed by my national node I wish to be endorsed by Other Associate Participant and select >>>>>> Publishers from countries that are not yet GBIF Participants and intergovernmental organizations, international organizations and other organizations with an international scope, and economies 1 http://www.gbif.org/ipt 2 http://www.biocase.org/ 3 o If you are from or a country that is not currently a participating member of GBIF or other described organization types and therefore do not have a national node, please indicate one of the following endorsement options: 1. My institution has an existing connection with one of the organisations currently participating in GBIF (see list at http://www.gbif.org/participation/list#other) and I request endorsement from the following organization: ……………….. You are encouraged to inform the organisation you select to endorse you about your decision. 2. We request the GBIF Secretariat to seek endorsement on our behalf from the wider GBIF community through the GBIF Participant Nodes Committee. Terms and conditions I have read and understood GBIF's Data Use Agreement (http://www.gbif.org/disclaimer/datause) and Data Sharing Agreement (http://www.gbif.org/disclaimer/datasharing) I understand that I am seeking registration on behalf of my institution, and confirm that the responsible authorities of my institution are aware of this registration. I understand that my institutional information, including the contact details provided, will be made publicly available in GBIF.org Please return the filled questionnaire to: [not applicable in case of an online form] GBIF Secretariat, Universitetsparken 15, DK-2100 Copenhagen, Denmark email: helpdesk@gbif.org fax: +45-35321480 4 Endorsement guidelines and criteria for endorsers Endorsement of new data publishers is a GBIF community standard that aims to secure the following key outcomes: 1. data of relevance to GBIF’s scope and objectives 2. stable, persistent data hosting 3. strong national, regional and thematic engagement around data publishing and use 4. open data sharing and reuse 5. improved data quality through responsiveness of data publishers to feedback A Node Manager should endorse a prospective publisher if s/he judges that the applicant acts as the authorized representative of a bona fide institution/entity, fulfilling the following criteria: 1. has, or will have, datasets of potential scientific interest consistent with GBIF's mission 2. has the technical capacity or access to technical capacity to ensure that the datasets are served in suitable formats and remain online (sustainable solution) 3. is able to respond to requests for further curation of published data 4. Accepts the GBIF Data Use Agreement3 and Data Sharing Agreement4 The Node Manager makes a judgement based on the information provided in the questionnaire and where appropriate through direct communication with the prospective publisher. The criteria may be met at the time of first registration, or aspired to and established during interactions between prospective publisher, endorsing node and the Secretariat. Minimum requirements for endorsement It is highly recommended that endorsers check that the prospective publisher has satisfactorily addressed the questions on the following aspects: Existence or authenticity of the institution/organization This is to ensure that GBIF endorses only institutions or organizations known to exist. Societies and networks qualify to be treated as organizations. Individuals are not currently eligible for registration but they can publish through an institution/organization or network and they will be properly credited and cited as the owners or authors of the respective datasets. Knowing the address of the institution and the contact persons will facilitate communication and networking, for example in dealing with queries and feedback on data quality. It is highly recommended that two contact persons are indicated to provide an alternative in the event that one contact cannot be reached. Scope aligned with the GBIF mission This is to ensure that nodes endorse only those institutions dealing with biological data with relevance to the current scope of GBIF. The guidance provided to prospective publishers defines such data as: species occurrence records including spatial and temporal information based on observations, specimens and other evidence; sample-based data including measures of abundance based on defined sampling protocols; species checklists 3 http://www.gbif.org/disclaimer/datause 4 http://www.gbif.org/disclaimer/datasharing 5 including area-based and thematic lists as well as taxonomic catalogues; and structured metadata describing biological datasets. Nodes are requested not to endorse institutions that appear to hold data entirely outside the current scope of GBIF’s mission. To help with this judgement, prospective publishers are requested to provide links to example datasets and/or attach example spreadsheets that will be forwarded to nodes where available. It should be noted that some publishers may deliver data beyond their own individual institutional scope as they may host datasets from diverse institutions. If in doubt, please consult GBIFS. Stable data hosting services or a third party host identified GBIF.org provides a data access service that presents an index of published primary data and metadata. The responsibility of hosting the original data remains with the data publisher or owner. Answers to this question will help the Node Manager and GBIFS to support the new data publisher in finding a hosting solution if they do not yet have one. Many new publishers may not meet this requirement at the time of first registration. Node Managers should examine possible solutions that would enable this criterion to be met. The node institution itself may be able to offer a technical hosting solution, or may request such facilities from other publishers within its network. GBIFS may be able to offer advice on alternative hosting solutions. Acceptance of the GBIF Data Use Agreement and Data Sharing Agreement A new publisher applies for registration because it expects to publish datasets on GBIF.org. In keeping with GBIF’s commitment to promote open access and a re-use culture, publishers are expected to understand from the outset that they should not impose unnecessary restrictions on their data. To be endorsed, publishers must accept the conditions specified in the two agreements. Furthermore, GBIF will shortly require all publishers to assign datasets to one of the three Creative Commons licences (CC0, CC-BY and CC-BY-NC) or their equivalent provisions. Responsiveness to feedback A new publisher will be expected to show willingness to respond to questions and feedback from users of their data. The publisher may delegate the task to individual owners of the datasets. Annotations and community curation will in future offload part if this responsibility from publishers, but the general policy remains to ensure best possible data quality at source. A negative answer to this question should not result in denial of endorsement, but Node Managers should in such circumstances strive to encourage publishers to find alternative solutions since this responsibility is critical in furthering the quality and value of published data. Additional guidance notes How far should datasets be evaluated during endorsement? While the formal endorsement of an institution does not require the evaluation of data (other than examining example datasets to ensure the institution is within the scope of a GBIF data publisher), it is nevertheless very valuable to take the opportunity of the endorsement stage to establish an ongoing working relationship between the endorsing node and the new publisher. Dataset evaluation, including the application of quality control routines, is an optional step practised by some nodes and forms part of wider community efforts to improve the fitness-for-use of data accessed through the GBIF network. Endorsement of datasets will be the subject of a separate process (see Fig. 1) What if the endorser has doubts? If the Node Manager has doubts on any points based on the information provided in the questionnaire, he/she should contact the prospective publisher for clarification. Node 6 Managers are also at liberty to carry out due diligence about the organization/institution by making inquiries within the community, checking online sources or other means of investigation. The Node Manager should strive to use all available information provided by the prospective publisher, and from other sources when necessary, in making a fair judgement in the spirit of acting as a good ‘gatekeeper’ for the GBIF community and furthering its mission. Does endorsing a data publisher have legal implications? The endorsement process is a GBIF community procedure and not a legal process. In general, the GBIF community recommends an inclusive rather than over-restrictive approach, and where criteria are not met initially, nodes should seek potential solutions before recommending that endorsement be denied. What if an endorsement is denied, could the prospective publisher 'try' to be endorsed by another endorsing node or appeal the decision? In principle the GBIF Community should strive to be inclusive and work with prospective publishers so that they can meet the minimum requirements for endorsement. In the unlikely event that endorsement is denied and the prospective publisher is dissatisfied, they may appeal through their national node and if the issues are not resolved, the appeal moves to the Nodes Steering Group (NSG). If the NSG cannot resolve the issue, the final arbiter will be the Executive Committee (EC). Matters pertaining to intergovernmental organizations, international organizations and other organizations with an international scope, and economies that are not subject to national nodes will be dealt with at the NSG and EC levels. The GBIF Secretariat will have no role in the decision making during appeals but will serve the administrative role of being the communication office for the appellant and the decision makers at the national node, NSG and EC. Is an endorsement valid forever or can it be revoked? In principle endorsement is supposed to be a one-time process and therefore once a publisher has been endorsed, they should not have to undergo endorsement again in future. Endorsers should take the exercise seriously to avoid situations that could lead them to change their mind about their decision in future. However in the unlikely event that a registered data publisher is deemed to be unsuitable to continue as a GBIF publisher, the endorsement could be revoked through a formal process as follows, if all efforts to rectify the situation are unsuccessful. The revocation case with sufficient justification would be brought to the national node where applicable then to the NSG and finally to the EC for final approval. A data publisher whose endorsement is revoked can reapply after X period to become a data publisher again, in which case they would have to go through the endorsement process as a new data publisher and also demonstrate that the reasons for their revocation have been cured. Service charters Nodes are encouraged to develop service charters based on their capacities to ensure that the endorsement is carried out in a timely and predictable manner. This will help to manage expections of prospective publishers and avoid stalling of processes. Such service charters should spell out what the node does; contact details – email, telephone, skype, chat, etc; standard of service that publishers can expect, e.g., response times, timeliness in decision making; feedback and handling of complaints. Since these service charters will differ from one node to another, it is important for endorsers to consider a reasonable turnaround time for decision making. In this way GBIF Secretariat in consultation with the endorsing node can keep reasonable timelines in which to report back to prospective publishers. Community level endorsement process through the Nodes Steering Group (NSG) 7 For cases in which a prospective data publisher is in a country not yet participating in GBIF, and has no connections with an existing Participant node, the Secretariat will write to the holders of the following under listed positions in the NSG, to request for an endorsing node to be identified: 1. NSG chair 2. NSG first vice chair 3. NSG second vice chair 4. Regional Representative(where relevant) 5. Regional Deputy Representative (where relevant) The respective Regional Representative will identify an endorsing node if the prospective publisher falls clearly within their respective region, and inform the other NSG members and copy GBIFS. The GBIFS will communicate with the identified endorser on behalf of the NSG, when at least 3 NSG members have responded and agreed, by furnishing the endorser with the completed questionnaire from the prospective publisher as well as the guidelines and criteria for endorsement. The NSG post holders through their chair may endorse a prospective publisher without reverting to identify a node to carry out the procedure in the following cases: i. If a publisher is from a country that does not fall in any specific GBIF region. ii. Where the NSG makes a judgement that identifying a specific node to carry out the endorsement might lead to negative sensitivities. iii. If an endorsing node identified by the NSG becomes unresponsive to GBIFS and NSG communications, or fails to complete the endorsement procedure in reasonable time and the NSG considers it not feasible to identify a new endorsing node. All endorsements that will be done through the NSG will be regarded as endorsements by the Nodes Committee and only the collective name, ‘Participant Node Managers Committee’ will appear on GBIF website as the endorser and not the specific name of the endorsing node if involved. Endorsement role players and diagrammatic workflow (Fig. 1) This diagram summarizes the endorsement procedure. (1) A prospective publisher approaches the GBIF Secretariat (GBIFS) directly or through a Participant node, and is asked to complete a questionnaire. (2) The completed questionnaire is submitted, online or by email, to GBIFS which then (3a) passes it on to the preferred endorsing node where indicated. (3b) In cases where the prospective publisher is from a country that is not yet a GBIF Participant and has no connection to an existing Participant node endorsement will follow the community-level endorsement process defined by the Nodes Steering Group (NSG). (4) The endorsing node or NSG evaluates the new publisher’s application following GBIF community guidelines and criteria]. Interactions with the prospective publisher, endorsing node or NSG and GBIFS may follow, for example to clarify issues based on the completed questionnaire and to seek technical solutions for data hosting. (5a) The decision of the endorsing node or the (5b) NSG defined process is communicated to GBIFS which in turn (6) informs the prospective publisher. Where endorsement is approved, the institution is registered as a GBIF data publisher and GBIFS may proceed to index datasets when submitted. 8 Fig. 1: Endorsement workflow