SuperSites Data Deposit Form Contents 1. ...................................................................................... INTRODUCTION 2 1.1 Guidelines for Data Deposit Form Completion ..................................................................................................... 2 2. DATA PACKAGE TITLE .............................................................................................. 3 3. DATA CREATORS ..................................................................................................... 3 4. ABSTRACT .............................................................................................................. 4 5. CONTACTS FOR QUESTIONS ON THE USE AND INTERPRETATION OF DATA .................... 5 6. PROJECT INFORMATION AND DATA OWNERS .............................................................. 5 6.1 Project Title ........................................................................................................................................................... 5 6.2 Data Owners (Data Providers) .............................................................................................................................. 5 6.3 Data Provider Contact........................................................................................................................................... 6 6.4 Nominated Representative ................................................................................................................................... 6 7. FUNDING SOURCES ................................................................................................. 7 8. ASSOCIATED PARTIES ............................................................................................. 7 9. KEYWORDS AND SUBJECT CATEGORIES .................................................................... 8 10. GEOGRAPHIC COVERAGE ....................................................................................... 9 10.1 Bounding Coordinates ........................................................................................................................................ 9 11. TEMPORAL COVERAGE ........................................................................................... 9 12. TAXONOMIC COVERAGE AND CLASSIFICATION ......................................................... 9 13. METHODS AND SAMPLING INFORMATION ............................................................... 10 13.1 Study Extent Description .................................................................................................................................. 11 13.2 Sampling Description ........................................................................................................................................ 11 14. INTELLECTUAL RIGHTS ........................................................................................ 11 14.1 15. TERN Data Provider Deed Special Conditions ................................................................................................... 12 DATA PREPARATION GUIDELINES .......................................................................... 12 FILES ..................................................................................................................... 12 COLUMNS................................................................................................................ 12 15.1 Saving the Data Tables...................................................................................................................................... 13 15.2 Data Table Descriptions .................................................................................................................................... 13 16. DATA TABLE METADATA ....................................................................................... 14 17. SUBMITTING THE DATA PACKAGE TO SUPERSITES .................................................. 15 18. FURTHER ASSISTANCE ......................................................................................... 15 F-1 SuperSites Data Deposit Form 1. INTRODUCTION The purpose of the Australian SuperSite Network (SuperSites) Data Deposit Form is to capture enough information about the data so that we can publish good quality metadata and data, which is consistent with the Ecological Metadata Language (EML)1 specification, to the SuperSites Data Portal. Each Data Package published to the SuperSites Data Portal will normally consist of the following components: 1. Metadata describing the context of the data (i.e. who, why, what, when, where, how). 2. Metadata describing the column headings in each data table supplied. 3. The data tables. Examples of published Data Packages can be viewed in the SuperSites Data Portal at http://www.ternsupersites.net.au/knb/ A Data Package and the component data tables will represent a topic or subject area from your research project. The composition of a data package will usually reflect a type of observation or survey method (e.g. arboreal marsupial surveys) employed in the research. The SuperSites Data Portal Team creates metadata records from this Data Deposit Form for publication or archiving on the SuperSites Data Portal. This ensures that metadata conforms to EML, and is stored in the Metacat2 repository along with the data. Metacat is part of the Knowledge Network for Biocomplexity (KNB)3, a network intended to facilitate the discovery, access and interpretation of complex ecological data. 1.1 Guidelines for Data Deposit Form Completion The aim of this form is to enable the systematic collection of metadata describing the context of the research as well as the data. This ensures that SuperSites provides quality support to researchers who wish to publish their data to the SuperSites Data Portal. We welcome constructive suggestions and comments to enable us to evaluate and improve our processes, services and resources. A simple guide is provided below to help you navigate this form. Are you depositing a new data package? Please complete the entire form (pages 2-19). Are you reviewing metadata associated with a recently submitted Data Package? Please edit directly into text fields using tracked-changes (pages 2-19). Are you amending the content of an existing (published) data package? Please answer the questions below: Data package title and reference number? 1 Ecological Metadata Language (EML) is a metadata standard developed for the ecology discipline. Sponsored by https://knb.ecoinformatics.org/ 2 Metacat is the repository where data and metadata are stored. 3 https://knb.ecoinformatics.org/ F-2 SuperSites Data Deposit Form Data table title and reference number? Temporal coverage of Data Package? Location of proposed amendment: in table or metadata? What is the proposed amendment? Reason for proposed amendment? 2. DATA PACKAGE TITLE The data package title provides a description of the data package that is long enough to differentiate it from other similar Data Packages. SuperSites uses the following format: [Short description of data)] [,] [SuperSite Name)] [,] [Node] [,] [Location] [,] [Year] 3. DATA CREATORS Enter the following details for EACH Data Creator. Please add tables as required for additional Data Creators. The Data Creator is a person, an organisational role or an organisation who collected or produced the data. Data Creators will appear in the citation for the published data. Please note that this information will be published to the SuperSites Data Portal. At least one Data Creator is required. Salutation: Name: Position name: Organisation name: The following contact details are optional. Do you authorize the inclusion of the following details on the SuperSites Data Portal (if no, SuperSites Data Portal Team will not publish these details)? Yes No Address (w): Phone (w): Email (w): F-3 SuperSites Data Deposit Form 4. ABSTRACT The following rules should guide the process of compiling the Abstract field in the metadata record: It is advisable to complete the abstract field last, so as to enable collation of the key facts from the rest of the metadata elements. Use active voice and past tense. Use short complete sentences. The abstract should be approximately 200-300 words in length The abstract can be descriptive of the data and the research. Briefly outline the relevant project or study including the contents of the data package. Include geographic location, the primary objectives of the study, what data were collected (species or phenomena), the year/ year range the data was collected or compiled in, and collection frequency if applicable. Describe methods, protocols, techniques or approaches only to the degree necessary for comprehension. Avoid excessive detail. If spelled on the first occasion (with acronym or abbreviation in brackets), the abbreviated form may be used thereafter. Links to webpages (such as SuperSites Plot Network homepages) are encouraged. Only citations of published documents are permitted in the abstract. Use of Harvard Citation Reference List Style is required. Any Data Packages with citations included in their abstracts associated with unpublished material such as journal papers will be embargoed. F-4 SuperSites Data Deposit Form 5. CONTACTS FOR QUESTIONS ON THE USE AND INTERPRETATION OF DATA Enter the following details for EACH person, organisational role or organisation who should be contacted by the general public with questions about the use of or interpretation of the data package. Please add tables as required for additional Contacts. If the content is the same as that provided for the Creator in Section 3, please enter ‘name’ only in this field. Please note that this information will be published to the SuperSites Data Portal. Salutation: Name: Position name: Organisation name: Address (w): Phone (w): Email (w): 6. PROJECT INFORMATION AND DATA OWNERS 6.1 Project Title If the data has been collected as part of a larger umbrella research project, list any projects that need to be referenced as part of the publication. 6.2 Data Owners (Data Providers) A Data Owner/ Data Provider can be a person, an organisational role or an organisation who has a statutory and operational authority over data. The term Data Owner is synonymous with the term “Data Provider” used in TERN Data Provider Deed (Version 1.5). Specifically, the Data Provider “warrants to the best of its knowledge and belief that it is the owner of the Data” (as per Clause 2b of the TERN Data Provider Deed). The Data Owner/ Data Provider has the authority to sign TERN Data Provider Deeds and authority to grant (or deny) permissions to share and access data. This section should be completed as per the TERN Data Provider Deed. If the content is the same as that provided for the Creator in Section 3, please enter ‘name’ only in this field. Please add tables as required for additional Data Owners. Please note that this information will be published to the SuperSites Data Portal. If the Data Owner is a person or an organisational role, please provide details: (if the Data Owner is an organisation, please provide Position and Organisation Names): F-5 Salutation: Name: Position name: Organisation name: SuperSites Data Deposit Form The following contact details are optional. Do you authorize the inclusion of the following details on the SuperSites Data Portal (if no, SuperSites Data Portal Team will not publish these details)? Yes No 6.3 Address (w): Phone (w): Email (w): Data Provider Contact The signatory of the TERN Data Provider Deed (Data Provider/Data Owner) is required to nominate a Data Provider Contact who assumes overall responsibility for working collaboratively with the Data Portal Team to aid in the preparation of metadata, assist the team with any queries arising during the publication or archiving process and provide approval to publish Data Packages. The TERN Data Provider Deed defines the ‘Contact’ as an individual nominated to liaise with the Facility or data users. It is assumed that the Data Provider Contact is acting on behalf of the Data Owner, and consequently is responsible for granting (or otherwise) permissions to share and access mediated data. This section should be completed as per the TERN Data Provider Deed. Please provide the following details for the data Provider Contact. If the content is the same as that provided for the Creator in Section 3, please enter ‘name’ only in this field. Please add tables as required for additional Data Provider Contacts: Salutation: Name: Position name: Organisation name: The following contact details are optional. Do you authorize the inclusion of the following details on the SuperSites Data Portal (if no, SuperSites Data Portal Team will not publish these details)? Yes No Address (w): Phone (w): Email (w): 6.4 Nominated Representative Although the Data Provider Contact assumes responsibility for working collaboratively with the Data Portal Team to aid in data-related issues, on occasions the Data Provider Contact may elect a Nominated Representative to undertake liaison on their behalf. That person assumes responsibility for working collaboratively with the Data Portal Team to aid in the preparation of metadata and assist in overcoming issues; however, the Nominated Representative cannot authorise access to published data and cannot approve the publication of data. The overall responsibility remains with the Data Provider Contact and consequently, access to mediated data requires the approval from the Data Provider Contact, as does the process of approving the content and quality of each data package for publication. Please provide the following details for the data Nominated Representative. Please add tables as required for additional Nominated Representative: F-6 SuperSites Data Deposit Form Salutation: Name: Position name: Organisation name: Address (w): Phone (w): Email (w): 7. FUNDING SOURCES List the project names and all significant funding sources under which the data has been collected for the entire lifespan of the project. TERN-funded projects will be published with the following caveat: Since 2012 this project has been part of the Australian SuperSites Network (SuperSites). SuperSites is a Facility within the Terrestrial Ecosystem Research Network (TERN). TERN is supported by the Australian Government through the National Collaborative Research Infrastructure Strategy. 8. ASSOCIATED PARTIES Enter the following details for EACH Associated Party. An Associated Party is a person, an organisational role or an organisation who has had an important role in the creation or maintenance of the data. These may include parties who grant access to survey sites as landholder or land manager, or may have provided funding for the surveys. Associated Parties do not appear in the citation displayed on the SuperSites Data Portal webpages. As such, parties who have had a significant role in the creation of the data may be more appropriately listed as Data Creators, but preferably not both. Decision rules regarding whether a party is a Data Creator (and therefore part of the citation) or an Associated Party are similar to the principles used to decide if a person is an author of a peer-reviewed journal paper or if they will receive an acknowledgement in the paper. Please add tables as required for additional Associated Parties. Please note that this information will be published to the SuperSites Data Portal. If the Associated Party is a person, an organisational role, or an organisation, please provide details: Salutation: Name: Position name: Person or organisation’s role in creation or maintenance of the data: Address (w): F-7 SuperSites Data Deposit Form The following contact details are optional. Do you authorize the inclusion of the following details on the SuperSites Data Portal (if no, SuperSites Data Portal Team will not publish these details)? Yes No 9. Phone (w): Email (w): KEYWORDS AND SUBJECT CATEGORIES ANZSRC-FOR Codes: Australian and New Zealand Standard Research Classification – (See Australian Bureau of Statistics for more details at http://www.arc.gov.au/pdf/ANZSRC_FOR_codes.pdf). At least ONE four digit code is required. Select as many as applicable. 0501 Ecological Applications 0604 Genetics 0502 Environmental Science and Management 0605 Microbiology 0503 Soil Sciences 0606 Physiology 0599 Other Environmental Sciences 0607 Plant Biology 0601 Biochemistry and Cell Biology 0608 Zoology 0602 Ecology 0699 Other Biological Sciences 0603 Evolutionary Biology 0705 Forestry Sciences Other If an applicable code is not in the table above, please enter the appropriate one at ‘Other’. You can also check the codes at: http://www.abs.gov.au/Ausstats/abs@.nsf/Latestproducts/4AE1B46AE2048A28CA25741800044242?open document Keywords List: Enter any keywords that best describe the subject of your data package. A minimum set of Keywords is indicated in brackets. You can use the Olsen, L.M. et al. (2007) NASA/Global change Master Directory (GCMD) Earth Science Keywords. Found at: http://gcmdservices.gsfc.nasa.gov/static/kms/sciencekeywords/sciencekeywords.csv [SuperSite name], [SuperSite node if applicable], [vegetation/soil/fauna], F-8 SuperSites Data Deposit Form 10. GEOGRAPHIC COVERAGE Please provide a general description of the geographic area in which the data were collected. This can be a simple place name (e.g. Great Western Woodlands, Western Australia) or a fuller description (e.g. Great Western Woodlands, Credo Station, 110 km NNW of Kalgoorlie, WA) 10.1 Bounding Coordinates In accordance with the TERN Data Provider Deed, the Data Provider warrants to the best of its knowledge and belief that the data does not contain any restrictions such as confidentiality, privacy/personal information, sensitive data issues or other restrictions which affect the use of the data. For the purpose of referencing the geographic coverage of each research site for use on the SuperSites Data Portal, the SuperSites Data Portal Team will apply a bounding box to the co-ordinates in the form of a 500 metre buffer. The SuperSites Data Portal Team will transform UTM coordinates (both Australian Map Grid (AMGs) and Map Grid of Australia (MGAs)) into decimal degrees (GDA94). Please always include the datum and the zone, if applicable. North South East West Datum 11. TEMPORAL COVERAGE Please indicate the dates that events were observed on. Wherever possible, Data Packages should be published as annual or seasonal tranches (i.e. 2012, 2013 or Summer 2012/2013). Please select a single point in time or a date range. Single point in time Year only (YYYY) Year, month day (YYYY-MM-DD) Date range Start date (YYYY or YYYY-MM-DD) End date (as for start date) 12. TAXONOMIC COVERAGE AND CLASSIFICATION F-9 SuperSites Data Deposit Form The SuperSites Data Portal Team will extract a species list from your data. Please specify which variable contains species information. Classification System: Although the use of current taxonomies is not necessary, we do, however, require that the correct nomenclature is adhered to and that spelling errors are amended in both the data tables and the metadata for taxonomic coverage. Ideally, species should resolve to a current taxonomic classification but in circumstances where alternative or older taxonomies are adopted, the classification system should be noted below. 13. METHODS AND SAMPLING INFORMATION Describe EACH method step you followed in the field to implement the specific measurement protocols and set up the design of the study. Repeat the details for each Method Step as often as necessary to describe the study design and your field activities. Please be specific to the data tables supplied rather than providing a high level description of the overall study. In circumstances where a formal sampling design was employed, please provide details. Please add tables as required for additional steps.. Method Step 1 Method Step Title: e.g. plot set-up Method Step Description: Instrumentation Details: Method Step 2 Method Step Title: e.g. data collection Method Step Description: Instrumentation Details: Method Step 3 F-10 SuperSites Data Deposit Form Method Step Title: e.g. documentation Method Step Description: Instrumentation Details: 13.1 Study Extent Description This information supplements the coverage information you may have provided at the Geographic, Taxonomic and Temporal Coverage fields above. Include any other information that may be required to understand the Data Package, such as specific sampling area and the sampling frequency (temporal boundaries, frequency of occurrence). 13.2 Sampling Description Describe how the primary field sites were selected and how the areas within those sites were chosen to implement the measurement protocols. Where a formal sampling design was used please also describe this. Where appropriate also provide information about how any study treatments were allocated (e.g. logging practice, grazing intensity). 14. INTELLECTUAL RIGHTS Usage rights define who is able to access your data package. Please note that TERN-funded data are expected to be made freely available subject only to unavoidable considerations of ethics, privacy, data sensitivities and any other justifiable or necessary legal requirement. Usage rights are entered in accordance with the licensing options selected in the TERN Data Provider Deed. A signed TERN Data Provider Deed must be completed and on file which covers the data package content before the LTERN Data Portal Team can publish it to the SuperSites Data Portal. The SuperSites Data Portal Team will liaise directly with Data Owners/ Data Providers to complete this step. Please refer to the TERN Data Licensing Policy for more information about data licensing principles and options. This section should be completed as per the TERN Data Provider Deed. TERN funded SuperSite milestone deliverables will, by default, be allocated the TERN Attribution-Share Alike v1.0 licence. F-11 SuperSites Data Deposit Form Data Provider Terms Select one licence TERN Attribution- Share Alike v1.0 (You allow others to distribute derivative works only under a licence identical to the licence that governs your work (includes Attribution)). Special licence agreement – select other combinations of special licence terms (consistent with the open access goals of TERN) – details to be provided below: 14.1 TERN Data Provider Deed Special Conditions Where applicable, special conditions are entered in accordance with those identified in the TERN Data Provider Deed. This data is currently being used for research into ABC. We anticipate that the data will be released by x/y/20zz. In the meantime, we can release this data for purposes that do not overlap with our current research. Please request access here. The data will be released when Quality Control/Quality Assurance procedures have been completed, this will be achieved by INSERT DATE. Please request future access here. Creative Commons Australia licences see http://creativecommons.org.au/learnmore/licences Other (please specify): 15. DATA PREPARATION GUIDELINES Standardised data spread sheet formats for a range of monitoring data has been distributed to SuperSites. If you require a copy please email the SuperSites Coordinator. These spread sheet formats are continually under review so all comments and suggestions are appreciated. Please note that although the SuperSites Data Portal Team is not currently resourced to process or extract data files for you, they are happy to assist you in the process. FILES 1. Data should be in a raw format and provided at the level at which it was collected. Summaries, including aggregated data should not be provided in the data package. 2. As much of the data as possible should be provided in the form of a standalone data table (i.e. a single worksheet). 3. If you have multiple tables, each table should include a column that allows those tables to be linked unambiguously. COLUMNS 1. Columns should contain a single variable rather than include observations from a mix of variables (e.g. don’t incorporate values of both season and year in the one column). 2. Use consistent format within a column (i.e. all numeric or character string). 3. Where specific nominal data is replicated across multiple tables (e.g. Location ID), please ensure that the column headers are consistent in each of the data tables within your Data Packages. 4. Check for and repair data inconsistencies (i.e. species name and common name used interchangeably). F-12 SuperSites Data Deposit Form 5. 6. 7. 8. 1. 2. Check for and repair duplicate variable names, (especially those due to software induced truncation). Use descriptive variable names. Missing values must be clearly identified using a consistent value (NA). Reasons for missing values should be noted (i.e. not observed, removed, censored).ROWS Check for typographical errors, missing values and for duplicate entries. Check for and repair unusual characters in data values. These can be a result of software (such as MS Word) which encode characters like apostrophes with symbols. 3. Data in SI units is preferred. 15.1 Saving the Data Tables Please save each data table in plain text comma separated format (*.csv) using the following naming convention. asn_[supersite code]_[node]_[data type]_[optional genera]_[optional species]_[optional location]_[YYYY or YYYY-YYYY or YYYYMMDD] Use lowercase with underscore. Location is optional Example: asn_fnqr_robson_weather_2013.csv 15.2 Data Table Descriptions Please complete a separate Table Description for EACH data table supplied as part of the data package. Please include descriptions of all columns in your data tables (e.g. don’t omit the site ID column or locational coordinates if these are included in a table). The column descriptions should relate to the measurement protocols described in the Contextual Metadata. F-13 SuperSites Data Deposit Form 16. DATA TABLE METADATA Data Package Reference Number and Names Data Table Reference Number and Names Data-table Content. This is an example only. Please submit table description files. Variable Variable Description Measurement Standard Unit of Scales. Select Measurement or Value one: Nominal, Labels Ordinal, Interval, Ratio Date-Time Examples Table ltern_example_YYYY.csv Year SiteName MeanMaxWind Reproductive phase Year Name of site Mean maximum wind Bud, flowering, fruiting Date-Time Nominal Interval Nominal Captures Total captures over 3 nights of trapping Continuous (ratio) F-14 SuperSites Data Deposit Form YYYY N/A km/hour a=bud b=flower c=fruiting Capture units Issue Description/ Reference and Suggested Amendments Depositor Response 17. SUBMITTING THE DATA PACKAGE TO SUPERSITES The simplest way to send your data tables and metadata to the SuperSites Data Portal Team is to email them to shiela.lloyd@jcu.edu.au Ensure all files are saved as plain text comma separated value format (.csv) as described above, and zipped together per Data Package. Also include this reviewed and completed word document in the zip file. 18. FURTHER ASSISTANCE For further information and assistance email the SuperSites Data Portal Team at info@tern-supersites.net.au. Any feedback about this form or the information provided would be appreciated. F- 15 SuperSites Data Deposit Form