Proposed DFT IG in form V1

advertisement
Name of Proposed Interest Group: Data Foundation and Terminology (DFT) Interest Group
Case Statement
Brief statement of objectives:
Agreed upon terminology, the meaning of terms used in discussion, is an important aspect of all
group work. The DFT IG task is to continue the discussion evoked by the RDA DFT WG to support
continuing RDA efforts regarding basic data concepts & framework models along with their
vocabularies. A key aspect of the work is to support broader model and vocabulary agreements for
and across these RDA WGs and IGs along with their representative communities and stakeholders.
This will help continue the synchronization of RDA conceptualization and enable better
understanding within and between RDA groups.
User scenario(s) or use case(s) the IG wishes to address
The initial DFT WG was able to develop what were called snapshot vocabularies for a subset of
interests around early WGs. However, many WGs are still active along with a growing number
of IGs which define a larger data space than covered in the DFT WG. An example is the increased
scope of the DF IG. There is general value for communities to define the terms used in work and
to reach agreement on the terminology and meaning of data vocabularies used in conversations.
Shared vocabularies establish a common ground for interactions which in turn can help with the
RDA goal of adoption of common data sharing practices and interoperation.
Initial work has given some value to RDA by starting on an abstract data organization model.
However it did not have adequate time and involvement to reach consensus on things like common
building blocks and their characteristics that are needed and useful across the current and expected
RDA groups. For example there remains consensus on terms used by the various metadata groups.
There is value in continuing activities supporting discussions to further this vision & to create
awareness of relations between different WG & IG concepts.
Based on the progress of the IG one or more WGs, we expect collaborative sessions with other
WGs as may be formed to address specific vocabularies.
Objectives (primary focus areas):
Objectives for the new IG include
 Advancing a methodology for developing group vocabularies,
o This will be supported by registration and documentation of vocabulary in the
Term Tool (Ted-T) developed at part of the DFT WG so that everyone can easily
refer to and comment on term concepts,
 developing community based recommendations for group vocabularies
 clarification of some foundational terminologies,
 writing one or more white papers synthesizing vocabularies as support of RDA group
activities and to drive the discussions and subsequent activities,
 providing quality DFT vocabulary sets,
 exploring ways to encourage vocabulary adoption and ensure its use,



Identifying new topics/areas to keep DFT discussion response to "hot" topics and ne
interests,
Exploring formalization of definitions such as metadata and policy by enhanced methods
and adaptation of the Ted-T tool to output useful definitions in RDF,
support the above by engaging RDA groups and relevant communities.
Participation
Potentially all groups could be engaged and participate in this IG and we with them. Considerable
discussion of vocabularies has been part of RDA group activities at Plenaries and as part of
discussion with the DFT WGs. Much more work and discussion would be useful such as with the
PP WG and its terminology that was only briefly sketched out without full definitions. PP along
with MIG has expressed an interest in more formalized definitions that can be processed by
computer and the Ted-T tool may be capable of doing this or at least demonstrating its feasibility.
Discussion with several IGs for example continues even though related WGs are ending. An
example is the PID IG as well the newly formed DF IG. Some of this work is reflected in concepts
and definitions in our terminology tool. Interest remains in several ongoing groups including those
that are inherently dealing with terminology such as the standardization of data categories and
codes WG, but also groups that have expressed specific interest or that have had sessions on
vocabulary develop. These include:








Legal Interoperability of Research Data Working Group expressed interest in developing
a compendium of legal terms for data, or at least link to another similar effort & would
want to participate in a side meeting at P5 focused on vocabularies.
Big Data Analytics IG
Data in Context IG
Data Fabric IG which is newly formed and working to agree on a description of what is
meant by the concept of DF This will involve some terminology clarification
Metadata IG and related groups
Research Data Provenance
Marine IG which noted that this is a “ topic” that arises on a regular basis within this
community and there is scope for a joint meeting on this topic at the next RDA plenary.”
Agricultural Data Interest Group
Note also that DFT IG activities should also be of help to the current RDA Infrastructure
WG chairs collaborations and participation at these meetings will be a way of sharing
information and getting valuable input.
Outcomes (
Success will be measured by the degree which this work on vocabularies is adopted as part of IG
and WG efforts. The IG will also serve the purposes of supporting related discussions at other
venues such as NDS with related interests.
This work horizon of DFT IG has at least a 15 month (3 Plenaries) view to assist groups currently
engaged. With their participation we expect to develop vocabularies for these groups and stored
these on the term tool developed as part of the earlier DFT WG. However, the IG may go on
longer as needed for adoptions and to accommodate new WGs or continuing issues for extant RDA
IGs. This will help to mature, sharpen & harmonize the concept definitions for adoption and use.
Mechanism
The general goal is provide a forum for discussions of standard vocabularies across and with
RDA groups and related communities. We will hold virtual working sessions to prepare for P5
and kickoff the IG at P5. Part of this preparation will be to work with completing WGs (DTR,
PIT & PP) to develop final vocabularies that are part of the adoption effort.




DFT IG will fully employ the practices and capabilities provided by the RDA community to
date.
These include:
Monthly or bimonthly telecoms/”webex” to get the IG up and prepared for activities at P5.
Face-to-face work sessions at RDA Plenaries supporting interactions amongst IG members and
with interested groups to pursue vocabulary discussion and capture in Ted-T.
Summaries of sessions and white papers will be publicly available from the RDA Wiki.
Meeting schedules, group goals and operation specifics will be matured after initial experience
at P5.
Timeline:
We will hold virtual working sessions to prepare for P5 and kickoff the IG at P5. Part of this
preparation will be to work with completing WGs (DTR, PIT & PP) to develop final
vocabularies that are part of the adoption effort. We expect to participate in co-development of
vocabularies for the previously mentioned groups between P5 and P6 and conduct community
sessions at P6,such as for the DF IG.
Potential Group Members:
The idea for this IG emerged from the discussions at P4 session between participants and other
group chairs whose activities have included vocabulary development or would like to. Some
experts who showed interest joined this IG and participated in DFT WG events or are adopters
on the vocabulary are included.
FIRST NAME LAST NAME
EMAIL
TITLE
Proposed Co-Chair,
SOCoP Secretary
Proposed Co-Chair,
MIG Co-chair
Gary
Berg-Cross
gbergcross@gmail.com
Raphael
Rebecca
Ritz
Koskela
Keith
Jefferey
Jane
Greenberg
raphael.ritz@rzg.mpg.de
rkoskela@unm.edu
Keith.Jeffery@keithgjefferyc
MIG Co-chair
onsultants.co.uk
janeg@drexel.edu
Drexel U.
Peter
Reagan
Rainer
Tim
Wittenburg
Moore
Stotzka
Delauro
peter.wittenburg@mpi.nl
rwmoore@renci.org
rainer.stotzka@kit.edu
timmo@jhu.edu
Tobias
Weigel
weigel@dkrz.de
Johannes
Keizer
johannes.keizer@fao.org
Helen
Glaves
hmg@bgs.ac.uk
Morris
chris.morris@stfc.ac.uk
Chen
chenY58@cardiff.ac.uk
Paul
Uhlir
PUhlir@nas.edu
Bridget
David
Almas
Dubin
bridget.almas@tufts.edu
ddubin@illinois.edu
Chris
Yin
DF IG co-chair
PP Co-chair, RENCI
PP Co-chair
Johns Hopkins
Deutsches
Klimarechenzentrum
GmbH (German
Climate Computing
Center)
FAO
co-chair of the Marine
Data Harmonization IG
Team leader on the
PIMS lims project,
Daresbury
ENVRI project,
Cardiff, UK
Director, Board on
Research Data and
Information, NAC, US
Tufts U
University of Illinois
Download