Data Publishing - A Modest Proposal

advertisement

Node Publishing – A Modeƒt Proposal

Or, “Everything should just flow from here”

1

Curtis Cude

Data Exchange Specialist

(503) 229-6086

Information Services

Oregon DEQ

April 26, 2007

We’ll discover how publishing can enhance the profile of environmental information.

Basic publishing described by example

Optimizing data discovery and retrieval

2

Future of publishing

3

PNW WQX, completed early 2005, demonstrated the feasibility of many important EN concepts.

4

The hybrid centralized/decentralized approach includes data submission and publishing flows

Data Access

Application

Washington

Data

Source

Network

Node

.xml

.xml

Network

Node

Idaho (planned)

Data

Source

INTERNET

.xml

Oregon

Data

Source

Network

Node

Exchange Host

Host

Database

Network

Node

Data

Catalog

.xml

.xml

.xml

Node

Client

Non-node Partner

Data

Source

Network

Node

Alaska

Data

Source

Instead of sending the same old oversized query response, optimize your publishing schema with

5

Data discovery and query costing can be built in to the data flow.

6

One way to enhance data discovery is to implement nested queries.

Providing Organization

Project

Monitoring Station

Field Event

Result

Sample Analysis

GetProjects

GetStations

GetMeasurements

A Data Catalog can provide summary information for the sources underlying the flow.

7

Station lists classified by station type can allow customers to geographically focus queries.

8

Data Catalog services can tell the customer what data they might find at a particular location.

http://deq12.deq.state.or.us/pnwwqx/search.aspx?name=Colum bia River at Bonneville Dam&prov=Oregon DEQ

9

Data Catalog services can also provide data for query costing purposes.

Customer enters search parameters

GetDCReferenceDataCount

Query/solicit using these parameters is first sent to Data Catalog

Your query may return up to 29,763 results.

Do you wish to continue?

Continue Refine Cancel

Estimated size of response returned

GetMeasurements

“Continue” routes query/solicit to nodes(s); other options reduce node server loading

10

The future of EN data publishing includes more, varied data partners and applications.

Parties other than EPA and ECOS should be at the table to increase partners within EN.

11

As node technology improves, opportunities for data publishing rather than transmission improve.

12

Data publishing will help us connect EN data with other types of data and other networks

Applications consume “our” info via web services, adding value and support.

13

EPA Region 10’s STORET.org consumes Stations

List and links back to PNW application

Applications are being developed to consume, warehouse and analyze specific data.

14

Beyond charting, QC and statistical analysis tools could also be developed.

In summary, data publishing builds partnerships with other providers and environmental information clientele.

Enhanced data partnerships mean greater data availability.

More data means more incentive to build value-adding tools.

More network tools mean more customers and greater support.

15

Questions?

Curtis Cude (503) 229-6086

Download