Introduction to Metadata Lynda Wayne GeoMaxim / FGDC FGDC CAP Kickoff Meeting Denver, CO September 28-29, 2004 1 Introductions Name Organization Metadata Experience Workshop Expectations 2 Objectives After the workshop, students can: develop a metadata template that uses a range of mandatory, conditional, & optional CSDGM elements effectively read and comprehend metadata make the business case for metadata discriminate between minimal and quality metadata locate and access online resources 3 What IS Metadata? Data ‘reporting’ WHO created the data? WHAT is the content of the data? WHEN was it created? WHERE is it geographically? HOW was the data developed? WHY was the data developed? 4 What IS Metadata? title time period supplemental information author abstract sources (file) size 5 What IS Metadata? entity attributes view actual metadata record 6 Let’s Make Metadata Turn to your neighbor and document the following: Title (name) Theme Keywords (work, play, life) Supplemental Information 7 Value of Metadata Preserves investments in geospatial data development people forget…………..people move on 8 Value of Metadata Serves as a valuable resource to data use and analysis 9 Value of Metadata Provides data consumers & stewards: • a common language • a context for data resources • needed information about the data Projection? Albers, I think? Attributes? Yes Time Period? I’ll have get back to you… 10 Value of Metadata Supports data management • maintenance and update • project status • value assessment / cba • project estimates • deliverable and performance measures 11 Value of Metadata 12 Who creates metadata? Officially Federal organizations producing digital geospatial data Federally funded projects that produce digital geospatial data Morally and practically Anybody that creates digital data It’s the right thing to do! 13 Why metadata? Be a part of the BIG picture Federal Geographic Data Committee (FGDC) - coordinates national geospatial data development National Spatial Data Infrastructure (NSDI) - creation and distribution of seamless national geospatial data Global Spatial Data Infrastructure (GSDI) - creation and distribution of seamless global geospatial data 14 Why metadata? National Spatial Data Infrastructure Old perspective…………………. ……………………………….New perspective 15 Why metadata? NSDI Framework Data - cooperatively-designed data dictionaries for seven key ‘reference’ themes • Elevation and Bathymetry • Hydrography • Transportation • Geodetic Control • Governmental Units • Cadastral • Orthoimagery 16 Why metadata? NSDI Geospatial Data Clearinghouse - international network of metadata distribution ‘nodes’ user query available metadata records internet Clearinghouse • FGDC • EROS • ESRI • NRCS • NOAA CSC • ALASKA GDC nodes international GINA AGDC ASGDC Anchorage state / local national 17 Why metadata? NSDI Geospatial One Stop metadata and links to data webmapping services data acquisition plans data category communities …..fast tracking the NSDI 18 Why Metadata? National Map Geospatial One-stop coordination policies • standards • partnerships FGDC 19 How Do I Create Metadata? the metadata gospel… It’s not pretty It’s not easy But sure is thorough….. 20 CSDGM 1. Identification Information General bibliographic information including: title, originator, data contact, status, date, time period of content, abstract, purpose, keywords, geographic location 2. Data Quality Information Lineage and data assessments sources, process methods, accuracy, data processing contact 21 CSDGM 3. Spatial Data Organization Data format: vector, point, raster 4. Spatial Reference Information Coordinate system parameters: horizontal / vertical coordinate system, projection, datum 22 CSDGM 5. Entity and Attribute Information Database design entities, attributes, domains, description of data values 6. Distribution Information How to acquire the data distribution contact, available formats, online distribution website, costs 23 CSDGM 7. Metadata Reference Information General information about the metadata record itself metadata contact, metadata standard used, metadata creation date, metadata review date 24 CSDGM Elements 25 CSDGM Elements 26 CSDGM Elements 27 CSDGM Elements Turn to page 35 in the Green Book 28 CSDGM Elements 29 CSDGM Elements Calendar Dates YYYYMMDD Time of Day HHMMSSSS Coordinates Lat/Lon Decimal Degrees Network addresses & file names Service://hostname:port/path/filename 30 CSDGM Production Rules 31 CSDGM Production Rules 32 CSDGM Production Rules 33 CSDGM Production Rules 34 CSDGM Production Rules 35 CSDGM 36 CSDGM Section One: Identification Elements you use to ‘shop’ for a data set of interest: WHO ? title, originator, publication date WHAT ? abstract, keywords, native data set (software) environment WHERE ? geographic extent (mbr), browse graphic WHEN ? time period of content Availability? access/use constraints, status 37 CSDGM 38 CSDGM Section Two: Data Quality How was the data set developed? • Source Information • Data processing What checks were made of the data set? • Logical Consistency • Positional Accuracy • Attribute Accuracy 39 CSDGM 40 CSDGM Section Three: Spatial Data Organization Point, Line, or Vector? 41 CSDGM 42 CSDGM Section Four: Spatial Reference Information Coordinate System 43 CSDGM 44 CSDGM Section Five: Entity & Attribute Information Database Description 45 CSDGM 46 CSDGM Section Six: Distribution Information How can I get the data set? • Distribution Contact Information • Available Formats (digital and hardcopy) • Standard and Custom Order Processes • Online option • Offline media • Distribution Liability • Fees 47 CSDGM 48 CSDGM Section Seven: Metadata Information Elements that describe the metadata record itself: WHO wrote the metadata record WHAT metadata standard was used? WHEN was the metadata created/updated? Is the metadata available? metadata access and use constraints 49 CSDGM 50 CSDGM Supporting Sections Eight: Citation Information Title, Originator, Publisher Nine: Time Period Information Date and Hour Ten: Contact Information Name, Job Title, Address, Phone, Email 51 CSDGM 52 Metadata Collection Tools Shareware corpsmet - USACE tkme / xtme - USGS/FGDC NPSmeta (ArcCatalog Extension) MetaD - ISO 53 Metadata Collection Tools GIS Internal ESRI ArcCatalog Intergraph SMMS for Geomedia 54 Metadata Collection Tools Software Commercial SMMS Data Logger Blue Marble 55 Metadata Collection Tools Forms hardcopy or online 56 Metadata Validation Tools mp - Metadata Parser Checks for CSDGM syntax Element names Mandatory elements Element content (domains and logical consistency) Embeds tags for NSDI Clearinghouse Distribution cns – Chew ‘N Spit Rectifies indentation Inserts capitalization and underbars FGDC ‘compliant’ metadata must pass mp! 57 Metadata Resources: The Greenbook An easy to use guide to implementing the CSDGM (is not the standard itself) Background and Descriptive Information CSDGM history, value of meta, formats CSDGM Elements definitions, domains, FAQs, common values Sample Metadata NWI and DGL Glossary of Terms 58 Metadata Resources: Online Resources http://www.fgdc.gov/metadata/ /online_resources tutorials metadata guidance websites software publications thesauri sources 59 Making Metadata Part of the Process New tools enable us to better integrate metadata creation into the data development process 60 Making Metadata Part of the Process If metadata were collected throughout the data process… 1. more accurate (no guessing) metadata 2. more details 3. better decision-making. 61 Making Metadata Part of the Process But what can I do to better incorporate metadata into the data development process? 62 step one: build adminstrative support Make the business case: • • • • • • preserve data investments limit data liability manage data resources find new data resources easier data transfer more efficient data distribution 63 step two: build technical support Emphasize individual benefits: • • • reduce workload..in the long term field fewer data inquiries document personal contributions 64 step two: build technical support Support your staff • • include in job descriptions & performance measures provide staff support tools training 65 step three: create organizational templates For each common data type: identify pertinent fields populate fixed fields standardized language distribution methods standards used build source and contact libraries 66 step four: distribute the effort map fields to the work flow establish and assign responsibilities technicians - lineage analyst - process methodology field scientists - accuracy assessments IT managers - tools, collection, management 67 step four: distribute the effort Data Planning: Section 1: Identification Info title theme keywords originator abstract purpose content time period 68 step four: distribute the effort Data Planning Section 3: Data Organization Indirect Spatial Reference FIPS codes, county monikers, etc Object-type point, vector, or raster 69 step four: distribute the effort Data Planning Section 5: Data Organization Entities and Attributes database design and configuration Highways name surface [asphalt, concrete, unpaved] year_built restrictions [haz, 2axle, flood] district [A, B, C, D, E] 70 step four: distribute the effort Data Processing Section 2: Data Quality completeness report missing or omitted data positional accuracy digitize rms error, GPS triangulation, survey/photogrammetry/IP method 71 step four: distribute the effort Data Processing Section 2: Data Quality lineage data set source files data compilation process maps photos dbases GIS 72 step four: distribute the effort Data Analysis Section 2: Data Quality attribute accuracy field checks, cross-checks, statistical analyses lineage data analysis process 73 step four: distribute the effort Data Analysis Section 3: Data Organizations object type number of points, pixels, lines Section 5: Entity and Attributes attribute values 74 step four: distribute the effort Data Analysis Section 7: Metadata Reference metadata contact and dates 75 step five: lead by example Managers Can WRITE metadata data planning fields title abstract purpose geographic extent keywords and many more… and manage the effort metadata coordination metadata enforcement 76 step six: policies and procedures Establish Policies: mandate use of standards and templates develop boilerplate metadata deliverable language for data contractors require units to publish their metadata publish metadata SOP to document policies and procedures 77