Uses - Data Documentation Initiative

advertisement
Summarization of DDI Qualitative use cases
This document is an attempt to aggregate the DDI qualitative use cases.
Qualitative Data
Anything non-quantitative (ASSDA) (ICPSR)
that can’t be imported into Nesstar Publisher (ASSDA)
Text files (ASSDA) (ICPSR) (Finland)
Narratives (ICPSR) (Austria) (Cambridge) (Finland)
Transcripts (Austria) (Cambridge) (Finland)
Open text survey responses (verbatim) within a quantitative data file (ICPSR)
Image files (ASSDA) (ICPSR) (Austria) (Finland)
Audio files (ASSDA) (ICPSR) (Austria) (Cambridge) (Finland)
Video files (ASSDA) (ICPSR) (Austria) (Cambridge) (Finland)
Web pages (Finland)
Parts of files (ASSDA) (Austria)
Comments or memos to segments of text or to codes (ICPSR)
Categories, codes, concepts linked to segments (Austria) (Cambridge)
Segments can overlap (KU)
Multimedia stimuli linked to qualitative/quantitative response (Austria)
Maps (Austria)
Interpretative and analytic accounts - case studies, "thematic analyses" drawing on different datasets or innovation
reports (Cambridge)
Case records and other interpretative accounts (Cambridge)
Multimedia linked to Physical objects e.g. archaeological artifacts, sites (Cambridge)
Web cams (Cambridge)
Multimedia linked to dance/ performances (Cambridge)
Relationship to other specialized encodings – musical score – labanotation?
DDI should facilitate?
Ingestion (ASSDA) (ICPSR) (Cambridge)
QuDex as ingestion intermediate format to METS, FOXML/METS, DDI (QuDex)
Deposit (ICPSR)
Storage (ASSDA)
Preservation (ICPSR)
Description (ASSDA)
Controlled Vocabularies (Austria)
Search and browse (ASSDA) (ICPSR)
Link to download / video streaming? (ASSDA)
Processing (ICPSR)
Transcription (Austria)
Anonymization (Austria)(ICPSR)
Dissemination (ASSDA) (ICPSR)
Access control (ASSDA)
Access control - current drawback of QuDex (ASSDA) (ICPSR)
different access controls, e.g., the data restricted-access but documentation openly downloaded (ICPSR)
Secure system to store the data roster and direct identifiers (i.e. a pseudonym crosswalk file) separately from the
Research data (ICPSR)
Preservation metadata for qualitative data that meet METS/PREMIS standards (ICPSR)
CAQDAS/ CAQUDAS (Austria)
Core DDI / QuDex ontology (Cambridge)
a LITE version for only textual social science data as a customization of TEI (Finland)
Metadata
File level (ASSDA)
Word to term mapping from text mining (KU)
Term to variable and variable weights (scoring functions) from text mining (KU)
Clusters of documents (segments) from text mining (KU)
Stemming, stop word list, start word list, synonym lists, dictionary from text mining (KU)
Concept links from text mining (KU)
Tools
Depositor/ archivist tool to describe files (ASSDA)
(QuDex based) (ASSDA)
Resources (files), memos, categories, codes
Viewer plug-ins allowing attachment of memos categories and codes to segments (ASSDA)
QuDex end user tool (ASSDA)
Explore and view contents (ASSDA)
Uses
Reproduce / extend research findings (ICPSR)
Data transparency (ICPSR)
produce data and/or documentation files for qualitative studies (ICPSR)
storing output from Apache UIMA text analysis XMI format for CAS and SOFA (QuDex)
Model
QuDex (ASSDA) (ICPSR)
Fedora (ICPSR)
FOXML(Cambridge)
Fedora RELS ontology (Cambridge)
More than 'isMemberOf', 'hasPart', 'isConstituentOf', 'isAnnotationOf' (Cambridge)
Dublin Core annotations in a triplestore (Cambridge)
Needed Fields
Data type – quantitative, qualitative, mixed (ICPSR)
Pseudonyms required (ICPSR)
Interview type – structured, semi-structured, open ended (ICPSR)
Anonymization performed and by whom, method used (ICPSR) (Austria)
Pseudonyms, abstract coding, removal or masking of blocks of text, other (ICPSR)
Download