Universal Data Element Framework (UDEF) Overview GEIA – DHS Workshop Phoenix – April 11-14, 2005 Ron Schuldt April 13, 2005 UDEF Background CALS ISG - Developed UDEF EIA - Applied UDEF in the late 80s – early 90s Industry Ron Schuldt (Chair) - Lockheed Martin Barbara Barman (Vice Chair) - Raytheon Rob Bryant - DynCorp Ruey Chen - David Taylor Research Center Bob Hodges - Texas Instruments Neal McNamara - Analysis & Technology Inc. Bud Orlando - TRW Madelyn van der Bokke - ASEC George Walther - Lockheed Martin in the mid 90s and 2002 Industry Ron Schuldt (Chair) - Lockheed Martin Rick Lang - Texas Instruments Pam Stanfield - Lockheed Martin Gary O’Hara - Hughes Space and Comm Tony DiPerna - Ericsson Communications Ken McTee - Texas Instruments Cindy Hauer - Mevatec Corp Fred Bahrs - CMstat Corporation Doug Drury - ITT Federal Services Lee LeClair - Texas Instruments Government Norma Kornwebel - PM JCALS Dinah Beres - NAWC Steve Waterbury - NASA Government Deborah Cornelius - US Army Missile Cmd C. H. VanLandingham - NOAA Nat’l Wea Svc Today AFEI (formerly CALS ISG) holds the Intellectual Property Rights to the UDEF UDEF Recent Background UDEF Global Host Request For Information (RFI) by AIA and AFEI – Oct 2003 • Sought candidate global non-profits interested in hosting the UDEF tree structures • Responsible for managing extensions to the trees and for conducting global UDEF training and promoting adoption of UDEF • RFI response submitted jointly by CompTIA, OAGi and Contivo – later backed away for variety of reasons but all three are still interested • Draft Agreement and Performance Specification developed Once Global Host is Selected (Agreement Signed) • Pilot will be conducted to demonstrate that UDEF helps reduce costs of developing interfaces • Once pilot is completed successfully and the Performance Spec requirements are satisfied, then the non-profit will be given approval to formally launch the UDEF Global Host Problem – Global Perspective Each organization is attempting to set its own semantics standard Each must interface with organizations they do not control Elec Banks Chem Aero DoD NASA Trans Organization DHS Other Ship Retail The problem is the lack of common semantics and schema between organizations Problem – Supply Chain Perspective • Today’s reality… point-to-point solutions introduce excessive cost, complexity and time • Examples of gaps and/or inefficiencies: - Increasing number of unique portals adding cost to suppliers - Multiple, redundant, incompatible “portal systems” - Incompatibilities in information exchange contribute to delay, rework, and error • Excessive cost, complexity and time impeding supply chain agility Supply Chain Perspective Cost of a single interface ranges from $10K to $1M depending on scope and complexity ~ N (N-1) mapping problem & expense Industry Members Business Partners Contractor Government Customer Public Exchange OEM Tier-1 Supplier excessive cost, complexity and time Commercial Customer n-Tier Supplier Logistics Provider Problem – Internal Perspective Conflicting semantic overlaps between back-office systems OAGIS XBRL HR-XML Other XML Standards Legacy Data Though semantically equal, the following are 4 different XML tag names <PARTNUMBER>111-222-333</PARTNUMBER> <partNumber>111-222-333</partNumber> <PartNumber>111-222-333</PartNumber> <partnumber>111-222-333</partnumber> The Impact on Integration To interface two or more applications either within the enterprise or between enterprises ---• Middleware vendors build and sell “adaptors” for interfaces between major commercial applications (i.e., SAP - PeopleSoft) - As the version of a given application changes it typically drives the need for changes to the adaptors • Application interface developers (data architects) spend considerable time and effort analyzing the semantics of data that need to be exchanged between systems - Semantics analysis requires up to 35-40% of the requirements and design-time phase labor cost of building a given interface - To minimize the semantics analysis effort typically requires support (availability) from the “experts” of each application since most applications are poorly documented The Goal Reduce Requirements and Design-Time Phase Semantics Analysis Time and Cost Current Point-to-Point Approach --- n(n-1) Global Semantics Standard Approach --- 2n Global Semantics Standard 400 400 350 350 300 300 $$ 250 250 200 200 150 150 100 100 50 50 Savings 0 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 Derived XML Schema Requirements XML Schema “Structure” Requirements • All new XML Schema used or built by an organization must conform to W3C XML and XML Schema specifications » http://w3.org/TR/2004/REC-xml11-20040204/ » http://w3.org/XML/Schema#dev • Adopt global naming convention standard – ISO/IEC 11179-5 » http://isotc.iso.ch/livelink/livelink/fetch/2000/2489/Ittf_Home/PubliclyAvailableStandards.htm • Adopt United Nations XML schema naming and design rules » http://www.disa.org/cefact-groups/atg/downloads/index.cfm XML Schema “Payload” Requirements • Adopt global set of basic data representation terms and their definitions – e.g., Amount, Date, Name, Quantity » Tables 8-1 and 8-3 in United Nations Core Components Technical Specification (CCTS) – ISO 15000-5 • Adopt global and industry XML schema “payload” standards whenever possible before building organization unique XML schema – see example list on next slide Example “Payload” Standards • OAGIS – Open Applications Group http://www.openapplications.org/ » Participants - ERP and middleware vendors and end users » Example payloads – RFQ, purchase order, invoice, bill of material • HL7 - Health Care http://www.hl7.org/ » Participants – health care providers across the globe » Example payload – health records • ACORD – XML for the Insurance Industry http://www.acord.org/ » Participants – insurance providers across the globe » Example payload – insurance claim • XBRL – Business Reporting - Accounting http://www.xbrl.org/ » Participants – accounting firms across the globe » Example payload – financial reports to SEC • EIA-836 – Configuration Management Data Exchange and Interoperability http://63.249.145.5/836/default.htm » Participants – DoD and aerospace and defense industry (AIA and GEIA) » Example payload – engineering change Derived XML Schema Requirements XML Schema “Infrastructure Support” Requirements • Organization adopted and internally developed XML schema must be easily discovered for reuse – requires a centralized registry/repository • Use Universal Data Element Framework (UDEF) as means to index the points of intersection between various XML schema “payloads” and XML instances AND to improve data interoperability. UDEF directly supports and enhances an organization’s metadata management strategy. XML Schema “Standards” Architecture XML Schema “Infrastructure Support” Standards Cross Standard Semantics and Metadata Alignment – UDEF*, RDF, OWL XML Schema “Payload” Standards Organization Specific Implementation Conventions (subsets & extensions) OAGIS ACORD XBRL HL7 EIA-836* PLCS …. Others XML Schema “Structure” Standards ISO/IEC 11179-5, ebXML CCTS, UN Naming and Design Rules W3C – XML, XML Schema * Not quite ready for full adoption UDEF – Similar to Dewey Decimal System The Dewey Decimal Classification or DDC was invented in 1876 by a young American librarian, Melvil Dewey. 1. Divided human knowledge into ten major Classes 2. Subdivided these classes into ten Divisions 3. Division subdivided into ten Sections. 4. These sections subdivided, etc How to classify a book on Radio Short Waves 10 Main Classes 000 Generalities 100 Philosophy 200 Religion 300 Social Sciences 400 Languages 500 Science 600 Applied science 700 Arts 800 Literature 900 History 500 Natural sciences 510 Mathematics 520 Astronomy 530 Physics 540 Chemistry 550 Earth sciences 560 Paleontology 570 Life sciences 580 Botany 590 Zoology 530 Physics 531 Solid mechanics 532 Fluid mechanics 533 Gas mechanics 534 Sound 535 Light 536 Heat 537 Electricity 538 Magnetism 539 Modern physics 537.5 537.534 537.5342 537.5343 Electronics Radio waves Long waves Short waves UDEF – Scope Transactions – Structured Data • Purchase Orders • Purchase Order Changes • Purchase Order Acknowledgements • Purchase Order Change Acknowledgements • Invoices • Remittance Advice • Request for Quote • Request for Quote Response • Shipping Schedule • Etc. Heavily influenced by finance and procurement functions and ERP type applications Relevant open standards – X12, EDIFACT, ebXML, OAGIS, RosettaNet Collaboration – Unstructured Data • Mission Requirements • Concept of Operations • Specifications • Product Designs • Engineering Change Proposals • Trade-off Studies • Test Reports • Meeting Minutes • Plans • Schedules • Presentations • Etc. Heavily influenced by engineering, manufacturing and program management functions and PDM/Document Management type applications Relevant open standards – STEP, PLCS, EIA-836 UDEF transcends the transaction and collaboration worlds UDEF Built on Standards An Instantiation of ISO 11179-5 Naming Convention and Supports ebXML UDEF Object Class List • Entity • Document • Enterprise • Place • Program • Product • Process • Person • Asset • Law-Rule • Environment • Condition • Liability • Animal • Plant • Mineral ISO 11179-5 Naming Convention Data Element Name Object Class Term 0...n qualifiers + 1 or more required Object Class Property Term + 0..n qualifiers + 1 required Property Example UDEF-Based Data Element Names Document Abstract Text Enterprise Name Product Price Amount Product Scheduled Delivery Date Engineering Design Process Cost Amount ebXML Property List • Amount • Code • Date • Date Time • Graphic • Identifier • Indicator • Measure • Name • Percent • Picture • Quantity • Rate • Text • Time • Value • Sound • Video UDEF names follow the rules of English – qualifiers precede the word they modify ISO/IEC 11179 – Data Element Concept Object Class Property Representation Data Element Concept UDEF Maps Data Element Concepts Data Element Core Data Element Value Domain Application Data Element Creating UDEF IDs UDEF Trees 16 Object Class Trees 0 Entity 1 Asset 2 Document 18 Property Trees … t Order a Work b c Change Technical 1 Amount 1 Region d Purchase 4 … Code 33 … Type … … 66 Defined … Purchase Order Document_Type Code has UDEF ID = d.t.2_33.4 See http://www.udef.org/ Example Mappings Organizations cannot avoid multiple data standards ** Need global semantics standard ** PDM Sys A PDM Sys B Part No Part Num OAGIS 7.1 RosettaNet ItemX ProprietaryProductIdentifier 9_9.35.8 X12 (EDI) EDIFACT Product/Service ID Item Number STEP AP 203 xCBL Product ID PartID UDEF Universal Identifier Product(9)_Manufacturer(9).Assigned (35).Identifier(8) N (N-1) mapping effort instead becomes a 2N mapping effort Mapping Concepts to UDEF 1. Identify the applicable UDEF property word that characterizes the dominant attribute (property) of the data element concept. For example, Name, Identifier, Date, etc. 2. Identify the dominant UDEF object word that the dominant property (selected in step 1) is describing. For example, Person_Name, Product_Identifier, Document_Date, etc. 3. By reviewing the UDEF tree for the selected property identified in step 1, identify applicable qualifiers that are necessary to unambiguously describe the property word term. For example, Last Name 4. By reviewing the UDEF tree for the selected object identified in step 2, identify applicable qualifiers that are necessary to unambiguously describe the object word term. For example, Customer Person 5. Concatenate the object term and the property term to create a UDEF naming convention compliant name where it is recognized that the name may seem artificially long. For example, Customer Person_Last Name 6. Derive a structured ID based on the UDEF taxonomy that carries the UDEF inherited indexing scheme. For example <CustomerPersonLastName GUID=“as.5_5.10”> DHS Typical Example Data Element Concept Name United States Passport Number UDEF ID 1. Object Type or Role Object Class Property Type Property Identify the applicable UDEF property word that characterizes the dominant attribute (property) of the data element concept. For example, Name, Identifier, Date, etc. DHS Typical Example Data Element Concept Name United States Passport Number UDEF ID Object Type or Role Object Class Property Type Property Identifier 2. Identify the dominant UDEF object word that the dominant property (selected in step 1) is describing. For example, Person_Name, Product_Identifier, Document_Date, etc. DHS Typical Example Data Element Concept Name United States Passport Number UDEF ID Object Type or Role Object Class Document 3. Property Type Property Identifier By reviewing the UDEF tree for the selected property identified in step 1, identify applicable qualifiers that are necessary to unambiguously describe the property word term. For example, Last Name DHS Typical Example Data Element Concept Name United States Passport Number UDEF ID Object Type or Role Object Class Document 4. Property Type State Department Government Assigned Property Identifier By reviewing the UDEF tree for the selected object identified in step 2, identify applicable qualifiers that are necessary to unambiguously describe the object word term. For example, Customer Person DHS Typical Example Data Element Concept Name United States Passport Number UDEF ID Object Type or Role United States Passport 5. Object Class Document Property Type State Department Government Assigned Property Identifier Concatenate the object term and the property term to create a UDEF naming convention compliant name where it is recognized that the name may seem artificially long. For example, Customer Person_Last Name DHS Typical Example Data Element Concept Name United States Passport Number UDEF ID a.ck.2_2.13.35.8 6. Object Type or Role United States Passport Object Class Document Property Type State Department Government Assigned Property Identifier Derive a structured ID based on the UDEF taxonomy that carries the UDEF inherited indexing scheme. For example <CustomerPersonLastName GUID=“as.5_5.10”> UDEF Use with XML • UDEF derived intelligent ID is the “key” to systems integration • Mapping to UDEF is a rules-based approach to determine semantic equivalence Two different systems with need to exchange purchase order data System A (xCBL 3.x Format) UDEF Name Purchase Order Document_Identifier UDEF ID = d.t.2_8 System B (OAGIS 7.x Format) • Optional UDEF IDs applied only when aligning ANY two systems for the first time (design time) – stored as an alias within each system - Non-intrusive – does not require change to data element names • Leverage the UDEF Web Service to generate a gap analysis report http://xml.eidx.org:8080/udef/servlet/UdefReport1 Possible DHS-DoJ UDEF Example UDEF-Based Common Vocabulary Person Identifier 5_8 Person Last Name 5_5.10 Person First Name 5_4.10 Person Birth Date 5_51.6 Person Birth City Name 5_2.10.10 Local Police – Suspect Role Suspect Id aw.5_3.13.35.8 Local Police – Criminal Role Criminal Id av.5_3.13.35.8 Suspect Name aw.5_5.10 & aw.5_4.10 Criminal Name av.5_5.10 & av.5_4.10 Suspect Date of Birth aw.5_51.6 Criminal Date of Birth av.5_51.6 Suspect City of Birth aw.5_2.10.10 Criminal City of Birth av.5_2.10.10 A Metadata Managed Architecture Interfaces to Back-Office Systems Run Time EAI Vendors with Canonical Models UDEF-Indexed Metadata Registries Transformation Engines Internet Std Schema Global UDEF Registry Use Matrices Design Time Interface Developers • Data Dictionary Extend Matrices • Mapping Matrices • Std XML Schema Build/Extend Schema UDEF-Indexed Data Modelers Metadata Registry/Repository And Apps Developers Centralized metadata registry/repository • Enables reuse to reduce costs • Encourages standardization Software Vendors with UDEF ID APIs UDEF Change Board Web Public Sample Scenarios and Benefits Sample Scenarios Sample Benefits Application developer uses metadata repository and UDEF IDs to discover a data model containing 60% of the data for the new application Data model reuse – time and cost reduction Helps enable a corporate-wide common vocabulary Interface developer uses metadata repository and UDEF IDs to discover 35% of fields already mapped between two applications Interface reuse – time and cost reduction CEO wants to know state-by-state breakdown how many dollars in last fiscal year went to suppliers for a single program Provide answer in 4 hours rather than 5 days Interface developer uses automated tools that have been UDEF enabled to help build Java code that directly feeds the run-time EAI translation tool Reduce time and cost to automatically develop Java code from 4 hours to 1 hour A new supplier uses RosettaNet rather than the industry standard xCBL. The semantic mapping based on UDEF is provided to the supplier Reduce the supplier’s time and cost to complete the mapping. DoD End-to-End Procurement Mapping • AIA effort by Raytheon, Lockheed Martin and Northrop Grumman based on actual DoD procurement of a Raytheon spare part • Mapped all fields in five documents plus Central Contractor Registration to both the UDEF and to OAGIS 8.0 XML standard – RFQ, Response to RFQ, Purchase Order, DD250, Invoice • DD250 includes the 16 fields required to support UID UDEF Example Mappings Examples From DoD Purchase Order Mapped to UDEF and OAGIS 8.0 UDEF Name UDEF ID SysName DPASPriority OAGIS XML Path Name Purchase Order Document Government Assigned Priority Code PurchaseOrder/Header/Priority d.t.2_1.4.31.4 DateOfOrder Purchase Order Document Date PurchaseOrder/Header/DocumentDateTime d.t.2_6 IssuedByDODAAC Buyer Enterprise Defense Logistics Agency Assigned Identifier q.3_6.35.8 PurchaseOrder/Header/Parties/ SoldToParty/PartyID/ID • UDEF names are unambiguous • UDEF names cross-referenced and indexed by UDEF IDs to a table would allow interfaces to be built without being dependent on availability of the “system expert” Summary – Key Points • Adopt Key Global XML Schema Standards – especially … - ISO/IEC 11179-5 “Naming and Identification Principles for Data Elements” - ISO 15000-5 Core Components Technical Specification (especially Tables 8-1 and 8-3) • Work together to promote and adopt the UDEF – an Instantiation of ISO/IEC 11179-5 and ISO 15000-5 CCTS Table 8-1 and 8-3 For Additional Information ISO/IEC 11179 – Specification and standardization of data elements http://isotc.iso.ch/livelink/livelink/fetch/2000/2489/Ittf_Home/PubliclyAvailableStandards.htm UDEF.ORG Web Site http://www.udef.org/ Ron Schuldt – 303-977-1414 or ron.l.schuldt@lmco.com