Management of Oceanographic time-series data at the Woods Hole Coastal and Marine Science Center Ellyn Montgomery USGS NERACOOS/NECOSP Data Management Workshop, Sept. 26, 2012 U.S. Department of the Interior U.S. Geological Survey USGS Coastal and Marine Geology Program This presentation applies only to Oceanographic measurements acquired by Woods Hole researchers and colleagues to address scientific questions The data is mostly used for validation of sediment transport and coastal process models Management of the data is part of the larger Data Life-cycle that includes everything from planning to archival and publication NECODP 2012 About our data Our holdings describe conditions at specific locations for limited durations. Except for a 25+ year program in Massachusetts Bay, we don’t do long term monitoring programs. Once the data files collected during an experiment are processed and QC’d that data is published (Federal requirement) Updates are made only if a problem is found in the data to correct that problem NECODP 2012 About our data II The data are stored in netCDF format files using EPIC conventions NcML is used to convert to CF conventions Files are delivered on-line via file download OPeNDAP using THREDDS FGDC metadata for each experiment is being created and harvested to ocean.data.gov Each file has ISO 19115 metadata (ncISO) NECODP 2012 How we manage our data - 1 Our raw data is included in the WHSC field activity tracking system We store 3 levels of data on local disks that are backed up nightly, with periodic backups made and stored off-site. These levels are : Raw data as off-loaded from the instruments Processed data (and stages leading to it) Published data As EPIC As CF NECODP 2012 Publication and access Our data may be published directly to the website after approval or with an Open File Report that documents the context of the measurements. We periodically check the website for broken links or non-responsiveness We add new interfaces as available (for example making KML files to use with GoogleEarth) NECODP 2012 Web access NECODP 2012 Web access NECODP 2012 THREDDS NECODP 2012 Metadata NetCDF allows metadata to be stored as attributes with the data in the file We use standards and conventions to enable interoperability We have GCMD DIF, FGDC and ISO19115 metadata in as many catalogs and registries as possible We enhance the metadata as new standards emerge (added Unidata Dataset Discovery terms in 2011) NECODP 2012 Data Management Successes Community Standards development has evolved to enable LOT of great functionality We are happy with how THREDDS allows remote clients to access data for analysis in real-time, without having to down-load files Clients like Matlab, IDV, ncBrowse and ncView allow access and visualization Registering our THREDDS catalogs in larger repositories of ocean measurements enhances discoverability NECODP 2012 Data Management Challenges Getting links to our holdings into the catalogs of the regional observatories Since our data is historical, it hasn’t really fit the observatory mission, but including links to historical measurements in the area would be useful for enriching potential analyses Identifying which of the emerging technologies will persist into the future Finding ways to let potential users know the data is available NECODP 2012 Take home ideas Use standards to enhance discoverability and enable use of community developed tools Enable web-based access so the users can access what they need without assistance Tailor the process to what your clients need to accomplish their missions Adapt your data and services to take advantage of new techniques as they evolve NECODP 2012 Web access The human-friendly interface to our experimental data, (EPIC conventions) is at: http://stellwagen.er.usgs.gov The computer-friendly (CF conventions) version of our data holdings, served by THREDDS at: http://geoport.whoi.edu/thredds/allts_reg_cat alog.html Contact me at: emontgomery@usgs.gov (508)457-2356 NECODP 2012