Summarization of DDI Qualitative use cases This document is an attempt to aggregate the DDI qualitative use cases. Qualitative Data Anything non-quantitative (ASSDA) (ICPSR) that can’t be imported into Nesstar Publisher (ASSDA) Text files (ASSDA) (ICPSR) (Finland) (KU) (GESIS) Narratives (ICPSR) (Austria) (Cambridge) (Finland) Transcripts (Austria) (Cambridge) (Finland) Open text survey responses (verbatim) within a quantitative data file (ICPSR) (GESIS) Image files (ASSDA) (ICPSR) (Austria) (Finland) Audio files (ASSDA) (ICPSR) (Austria) (Cambridge) (Finland) Video files (ASSDA) (ICPSR) (Austria) (Cambridge) (Finland) Web pages (Finland) Parts of files (ASSDA) (Austria) Comments or memos to segments of text or to codes (ICPSR) (GESIS) Categories, codes, concepts linked to segments (Austria) (Cambridge) Segments can overlap (KU) Multimedia stimuli linked to qualitative/quantitative response (Austria) Maps (Austria) Interpretative and analytic accounts - case studies, "thematic analyses" drawing on different datasets or innovation reports (Cambridge) Case records and other interpretative accounts (Cambridge) Multimedia linked to Physical objects e.g. archaeological artifacts, sites (Cambridge) Web cams (Cambridge) Multimedia linked to dance/ performances (Cambridge) Relationship to other specialized encodings – musical score – labanotation? Event Data (subject verb object???) (GESIS) DDI should facilitate? Ingestion (ASSDA) (ICPSR) (Cambridge) QuDex as ingestion intermediate format to METS, FOXML/METS, DDI (QuDex) Deposit (ICPSR) Storage (ASSDA) Preservation (ICPSR) Description (ASSDA) Controlled Vocabularies (Austria) Search and browse (ASSDA) (ICPSR) Link to download / video streaming? (ASSDA) Processing (ICPSR) Transcription (Austria) Anonymization (Austria)(ICPSR) Dissemination (ASSDA) (ICPSR) Access control (ASSDA) Access control - current drawback of QuDex (ASSDA) (ICPSR) different access controls, e.g., the data restricted-access but documentation openly downloaded (ICPSR) Secure system to store the data roster and direct identifiers (i.e. a pseudonym crosswalk file) separately from the Research data (ICPSR) Preservation metadata for qualitative data that meet METS/PREMIS standards (ICPSR) Revised 2010 – 05-27 CAQDAS/ CAQUDAS (Austria) Core DDI / QuDex ontology (Cambridge) a LITE version for only textual social science data as a customization of TEI (Finland) Metadata File level (ASSDA) Word to term mapping from text mining (KU) Term to variable and variable weights (scoring functions) from text mining (KU) Clusters of documents (segments) from text mining (KU) Stemming, stop word list, start word list, synonym lists, dictionary from text mining (KU) Concept links from text mining (KU) Tools Depositor/ archivist tool to describe files (ASSDA) (QuDex based) (ASSDA) Resources (files), memos, categories, codes Viewer plug-ins allowing attachment of memos categories and codes to segments (ASSDA) QuDex end user tool (ASSDA) Explore and view contents (ASSDA) Uses Reproduce / extend research findings (ICPSR) Data transparency (ICPSR) produce data and/or documentation files for qualitative studies (ICPSR) storing output from Apache UIMA text analysis XMI format for CAS and SOFA (QuDex) Model QuDex (ASSDA) (ICPSR) Fedora (ICPSR) FOXML(Cambridge) Fedora RELS ontology (Cambridge) More than 'isMemberOf', 'hasPart', 'isConstituentOf', 'isAnnotationOf' (Cambridge) Dublin Core annotations in a triplestore (Cambridge) Needed Fields Data type – quantitative, qualitative, mixed (ICPSR) Pseudonyms required (ICPSR) Interview type – structured, semi-structured, open ended (ICPSR) Anonymization performed and by whom, method used (ICPSR) (Austria) Pseudonyms, abstract coding, removal or masking of blocks of text, other (ICPSR) Identification (GESIS) Source like date of published text, author, etc. (GESIS) Revised 2010 – 05-27