Data Management Infrastructure

advertisement
OceanObs’09 Community White Paper Proposal
Meta-data Management in Global Distributed Observation
networks (tentative title)
Lead author:
Derrick Snowden
NOAA Climate Program Office, Climate Observation Division
1100 Wayne Avenue, Suite 1202
Silver Spring, MD 20910 USA
Derrick.Snowden@noaa.gov
Contributing authors:
Mathieu Belbeoch
JCOMM Observing Platform Support Center
8-10, rue Hermes
Parc Technologique du Canal
31526 Ramonville cedex
France
belbeoch@jcommops.org
Bill Burnett
NOAA National Data Buoy Center
1007 Balch Blvd.
Building 3203
Stennis Space Center, MS 39529-6000 USA
Bill.Burnett@noaa.gov
Ted Habermann
NOAA National Geophysical Data Center
325 Broadway
Building DSRC
Boulder, CO 80305-3328 USA
Ted.Habermann@noaa.gov
Hester Viola
JCOMM Observing Platform Support Center
8-10, rue Hermes
Parc Technologique du Canal
31526 Ramonville cedex
France
viola@jcommops.org
OceanObs’09 Community White Paper Proposal
Extremely raw bullets of topics to consider
** note that suggest we don’t talk about metadata recovery here at all, but focus on
how to avoid the mistakes of the past and improve for the future. **
This paper will describe the fundamentals of metadata, existing management
systems used, how and why these management methods could be improved and the
long term vision of metadata management across the global in-situ ocean observing
system.
Some of the elements of current metadata management systems will be described.
These systems have been designed based on weather forecasting requirements for
example the BUFR/CREX, ODAS, Pilot Projects such as Meta-T which until now
have not been standardized. Climate applications require more accurate data
metadata than weather forecasting and it is becoming apparent that the largest
uncertainties in the global ocean observation record can be traced to poor metadata.
This paper will describe different types or layers of metadata which are important
within the data flow for a global observing network.
 Sensor
 Measurement and/or Observation
 Instrument/platform/site
 Product (e.g. GTS Message or data/file or derived product or analysis or
climatology etc) layer (collection of same or different observations)
Metadata management for climate purposes must apply to individual sensors as well
as collections of observations and derived products. An explanation will be given of
examples of these different levels of metadata and discuss the different requirements
for metadata of stakeholders, for instance platform operators, program managers,
monitoring centers, data users
Some recent examples will be used to demonstrate how proper records of these
layers of metadata could have helped to avoid data issues and loss of utility of
datasets. These examples, which are specific to individual data flows highlight that
metadata must be managed through the lifetime of the observation (e.g. in the case
of recalibration, data quality assessments, modifications and flagging of
observations), at many distinct points in the data flow, not just at the time of
measurement. They also highlight that there is aneed to introduce better metadata
management to avoid similar issues in the future.
More general metadata requirements will then be discussed which relate to
the full data flow for ocean observations, and what is missing at the moment which
impacts data users and those monitoring observation programs (inc. network status
and data quality).



Challenges?
Assessment of what’s missing for a better system, what are the main
metadata gaps
What needs to be done to identify and then fill gaps operationally in Future?
o Requires More than a Pilot Project! I agree with what you said, ie that
if the same methods to collect metadata are used in the
implementation of the Meta-T proposals (i.e. relying on people to enter
the required information manually and sometimes into multiple
OceanObs’09 Community White Paper Proposal







systems) and as those have not been successful in the past - then
the methods proposed will not improve the situation
perhaps META-T’s main output will be this “gap analysis”
What’s the vision for meta-t and climate data users & long term –
sensor ML, smart/self reporting platforms,
operational metadata sharing based on what’s needed by users at all levels
(ability to opt into certain levels of metadata in the hierarchy),
unique identifiers for all measurements and groups of observations,
metadata reporting mechanisms enforced by international/regional/national
resolutions(? Like the Argo Notification requirement) with coordination of
standards and data content across multiple national efforts that collectively
make up GOOS
Remote and often expendable instruments that are never serviced after
deployment.
Download