Interoperability: Where the irresistible force of flexibility meets the immovable object of standardization. DRH Conference 2003 Chris Turner Leaders Project UCL SLAIS Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Conflicting needs • Flexibility in encoding – Wide range of source materials – Existing practices • Standardisation – Reusability – Support – Sustainability Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Interoperability ‘to be interoperable, one should actively be engaged in the ongoing process of ensuring that the systems, procedures and culture of an organisation are managed in such a way as to maximise opportunities for exchange and re-use of information, whether internally or externally’ Miller, Paul. ‘Interoperability. What is it and Why should I want it?’, Aridane, Issue 24 (21 June 2000) http://www.ariadne.ac.uk/issue24/interoperability/intro.html Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources 6 Aspects of Interoperability • • • • • • Technical Semantic Political/Human Inter-community Legal Informational Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Technical Interoperability • the ‘plumbing’, or what goes on out of sight to make documents accessible. – http – tcp/ip – etc Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Semantic Interoperability • Works on different levels and enables us to understand and transform documents – Syntactic level • XML rules for well-formedness provide a standardized structure for files – Semantic level • Common convention for naming parts of a document – DTD/Schema – Structural level • Determines how parts of a file relate to each other – the content models described by a DTD/Schema – Content level • What does the document actually say. E.g. Index terms for searching, and lists of descriptors Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Interoperability of TEI and EAD documents • they can be moved between computers and accessed between computers • being XML they have a recognized syntactic structure • they use common tag libraries, and thus have consistency of naming. • the content models of the DTDs offer commonly defined structures • the DTDs offer some pre-defined attribute values. Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Flexibility of TEI/EAD • A goal of EAD is to make archival resources from many institutions accessible to users. To achieve this goal, EAD must accommodate a wide range of internationally divergent descriptive practices (EAD Working Group, 2002). • It is important to remember...[that] many -perhaps most -- serious TEI applications have found it necessary to build their own customization of the full scheme in some way (Burnard, 2000). Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Consequence No two documents created in EAD or TEI need have the same structural or content markup. – Limits possibilities for data exchange and reusability – Makes consistent search and retrieval and presentation of results difficult Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Solution approach • Determined by project aims – ‘Document-centric’ – focus is on retrieval and presentation of TEI encoded transcripts and images • EAD used for metadata to facilitate search and retrieval • EAC used for contextual information – Reusability of files – Hospitality to existing files? Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Contextual presentation Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Metadata in EAD file item description in Leaders-EAD to harvest index terms and description from EAD documents resulting index document Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Controlled terms for searching Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources TEI - descriptive markup • Pizza Chef approach helpful • Wide range of formats of archival documents • Descriptive/procedural distinction • But just one DTD. • Modify stylesheets Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Examples Example EAC document Example TEI document Application example Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources TEI DTD modification TEI for Archives DTD Abbreviations entity file Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Re-use through Web Services • Standardisation of encoding makes possible standardised application functions. • Application functions described by WSDL (Web Services Description Language) file • Means application developers can incorporate LEADERS functions into their own applications. Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Conclusions • Trade-offs – EAD: stricter control vs loss of hospitality to legacy data and variation in existing practices. – TEI: a single DTD but hospitality to a potentially wide range of source materials vs less reusability of stylesheets. • Paradox – More standardisation of encoding vs more flexibility in terms of interoperability. Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources