Crowdsourcing Tasks in Open Query Answering Elena Simperl Barry Norton Denny Vrandeˇci´c

advertisement
AAAI Technical Report SS-12-06
Wisdom of the Crowd
Crowdsourcing Tasks in Open Query Answering
Elena Simperl
Barry Norton
Denny Vrandečić
Karlsruhe Institute of Technology, Germany
elena.simperl@kit.edu
Ontotext AD, Bulgaria
barry.norton@ontotext.com
Karlsruhe Institute of Technology, Germany
denny.vrandecic@kit.edu
Abstract
and purpose of the application can form the basis for the
automatic design and seamless operation of crowdsourcing
features to overcome the limitations and complement computational methods and techniques. This paper instantiates
our general description of a crowdsourcing-enabled framework for linked data management tasks (Simperl, Norton,
and Vrandečić 2011) for the use case of open query answering.
The following section describes open query answering.
Section then identifies the tasks required in order to answer
open queries, before Section describes how they can be formalized using our framework. Related work is given in Section .
Open query answering is the idea of answering queries that
are not given using the vocabulary of the queried knowledge
base but instead the vocabulary of the inquirer. Many aspects
of open query answering can be tackled through the combination of human effort with algorithmic techniques. In this
paper we explore its applicability to crowdsourcing, using a
framework in which human and computational intelligence
can co-exist by augmenting existing Linked Data and Linked
Service technology with crowdsourcing functionality. We analyze how the task can be decomposed and translated into
Mechanical Turk projects in order to achieve this vision.
Introduction
Open query answering
Answering open queries, that are, queries that are not using a
predefined vocabulary, required by the knowledge base, but
are open to the inquirer using their own vocabulary, requires
a number of tasks to be completed, namely identity resolution, vocabulary mapping, and metadata completion. These
tasks have been subject to a wide array of techniques that
attempt to perform them automatically, but yet require human input to produce training data and validate their results
or data of sufficient quality.
We have extensively discussed in previous work the importance of combining human and computational intelligence to handle such inherently human-driven tasks. Abstracting from their technical flavor in the context of Linked
Data, such tasks tend to be highly contextual and often
knowledge-intensive, thus challenging to fully automate
through algorithmic approaches (Cuel et al. 2011; Siorpaes
and Simperl 2010). Instead of aiming at such fully automated solutions, which often do not reach a level of quality
required to create useful results and applications, we propose
a framework in which such human computation becomes an
integral part of existing Linked Data and Linked Service
technology, through crowdsourcing functionality exposed
via platforms such as Amazon’s Mechanical Turk.1 We will
demonstrate how the types of tasks that are required to answer open queries can largely be uniformly decomposed,
and a formal, declarative description of the domain, scope
We define the task of open query answering as answering
a query that is given using the vocabulary of the inquirer
instead of the vocabulary of the knowledge base. Whereas
query answering systems usually require the usage of their
vocabulary and conceptualization, in open query answering
this is not the case. The client application can simply use
their own ontology and not worry about mapping or translating neither the results nor the query to the ontology of the
knowledge base.
To give an example: assume a SPARQL endpoint that contains knowledge about a number of friends using the FOAF
vocabulary (Brickley and Miller 2005), e.g. like the following set of triples:
hp:Harry foaf:givenName "Harry" ;
foaf:familyName "Potter" ;
foaf:mbox <mailto:scarface@hogwarts.ac.uk> .
foaf:knows hp:Hermione , hp:Ron .
hp:Hermione foaf:givenName "Hermione" ;
foaf:familyName "Granger" .
hp:Ron foaf:givenName "Ron" ;
foaf:familyName "Weasley" ;
foaf:mbox <mailto:weasley17@hogwarts.ac.uk> .
An address book application might want to query that
SPARQL endpoint in order to learn about all the acquaintances of its user and to allow to select them. Assume that
the application is using the vCard vocabulary (Dawson and
Howes 1998), which is widely used for address books applications. If Harry uses the application, it might send the
following query to the endpoint:
c 2012, Association for the Advancement of Artificial
Copyright Intelligence (www.aaai.org). All rights reserved.
1
http://www.mturk.com/
65
descriptions (e.g. the value for the property vcard:fn
(full name) would be the concatenation of the values for
foaf:givenName and foaf:lastName).
SELECT ?name ?lastname ?email WHERE {
?friend foaf:knows ex:Harry .
?friend vcard:given-name ?name .
?friend vcard:family-name ?lastname .
?friend vcard:email ?email .
}
3. Metadata completion: It is clear, given the state of the
current Web of Data (and may always be true), that certain properties, necessary for a given query, may not be
uniformly populated. Manually conducted research might
be necessary to transfer this information from the human
readable Web, especially where scraping or database conversion have failed. In the same way, manual inspection
(e.g., during browsing) or automated quality checks may
indicate that the data required from a query is of low quality. Fixing this is typically a human-driven tasks, even if
problem areas can be identified automatically.
Using the SPARQL query on the above knowledge base
will yield an empty result set. But the application aims to get
back the following SPARQL result set:
?name
Hermione
Ron
?lastname
Granger
Weasley
?email
mailto:knowitall@hogwarts.ac.uk
mailto:weasley17@hogwarts.ac.uk
In order to achieve this goal, three steps need to happen:
1. The SPARQL processor needs to resolve the identity of
the two Harrys.
Technical realization
2. The SPARQL processor needs to map the vCard properties used in the query to the appropriate properties in
FOAF.
Our approach to use Mechanical Turk for open query answering envisions the enhancement of SPARQL query processors with integrated crowdsourcing features that deal
with the packaging of specific tasks as Mechanical Turk
HITs, and with the integration of the crowdsourcing results
with existing, semi-automatic functionality. The former has
to include user interface management capabilities, in order
to produce optimal human-readable descriptions of specific
tasks operating on specific data, to increase workers’ productivity, and to reduce unintended behavior such as spam.
We propose to use SPARQL patterns to drive the generation
of HITs interfaces. The corresponding components of the
SPARQL endpoint need to interact with Mechanical Turk to
post specific HITs, assess the quality of their results, and exploit these results in a particular query. In order for this interaction to occur online we may require specific optimizations
to predict the time-to-completion of the crowdsourced tasks.
Technically the tasks introduced in Section can be characterized according to graph patterns from the SPARQL
query language representing input to the human task and its
desired output. In particular, the former expresses a query
over the existing collection of Linked Data, necessary to
provide the input to the task. The latter represents a query
that should succeed when the additional assertions that result from the human task have been added to the knowledge.
This is similar to the characterization of Linked Open Services (Norton and Krummenacher 2010) and Linked Data
Services (Speiser and Harth 2011), and also the characterization of discovery goals in (Norton and Stadtmüller 2011).
The approach is described in further detail in (Simperl, Norton, and Vrandečić 2011).
The inclusion of commonly-used predicates that have preconfigured screen representations in Linked Data Browsers2
means that tasks can be given HTML-based representations
simply based on existing technology.3 This is a significant
advantage of the Linked Data nature of the data sets being subjected to crowdsourcing in this approach. Below we
sketch the form of the input and output, across the previously
mentioned tasks.
3. The email of Hermione needs to be added to the knowledge base.
Note that for the sake of simplicity we have chosen a fairly
trivial example, but both the mappings and the identity resolution can become arbitrarily complex.
Human intelligence tasks in open query
answering
In the previous section we identified three steps required for
answering the query in our example. These three steps correspond to the following three tasks in linked open data management (Heath and Bizer 2011):
1. Identity resolution: Although identifiers for individual
resources can be directly reused in assertions in other data
sets (usually as the object in triples), it is often the case
that an identifier scheme is created for an overlapping set
of resources. One of the Linked Data best practices stands
that the predicate owl:sameAs should be used to equate
the two identifiers. Identity resolution, then, involves the
creation of owl:sameAs links, either by comparison of
metadata or by investigation of links on the human Web.
2. Vocabulary mapping: It is similarly encouraged to reuse
vocabulary terms over the Web, but often certain vocabularies are used in a specific context, or cover a given aspect of a domain better than another. In our example the
vCard vocabulary is wider used for contact data, which
is why the address book application assumes it, whereas
social network data is more aligned with the FOAF vocabulary. But both ontologies cover a fairly widely overlapping domain, thus allowing to map the vocabulary and
the conceptualization. Note that in the example above
we only have simple and complete one-to-one mappings.
In many cases the mapping may be incomplete (i.e. use
the rdfs:subPropertyOf or rdfs:subClassOf
predicates instead of owl:equivalentProperty respectively owl:equivalentClass) or no appropriate
concepts may exist and thus would need more complex
2
E.g. the retrieval of an object connect by foaf:depiction
and display it as an image.
3
See also http://km.aifb.kit.edu/sites/spark/.
66
Conclusions
Identity resolution The identity resolution between the
two Harrys can be represented by a single input graph:
In this paper we have outlined a framework for crowdsourcing the steps required for the task of open query answering. We have shown for an exemplary query how it breaks
down into tasks, and then how the well-defined structure and
semantics of standard-compliant LOD datasets can support
the automatic breakdown of tasks for further crowdsourcing. Especially the possibility to gather further information
by using resolvable identifiers enables us to set up human
tasks that contain all the necessary information.
We believe that the proposed solution could enhance today’s possibilities for answering open queries, providing viable and effective solutions for its challenges that are known
to be reliant on human input in one way or another.
{ ?vperson vcard:given-name ?vgiven ;
vcard:family-name ?vfamily ;
vcard:email ?vemail .
?fperson foaf:givenName ?fgiven ;
foaf:familyName ?ffamily ;
foaf:mbox ?fmail . }
and an output graph:
{ OPTIONAL { ?vperson owl:sameAs ?fperson } }
Existing Linked Data browsing technology can be used to
transform the graphs satisfying these queries into an HTML
page, showing the labels and the email address, with a button that will assert the owl:sameAs link, and another
which will not (hence the OPTIONAL keyword in the output
graph).
Acknowledgements
The research leading to this paper was partially supported
by the Network of Excellence PlanetData, funded by the
European Community’s Seventh Framework Programme
FP7/2007-2013 under the contract 257641.
Vocabulary mapping The vocabulary mapping is structurally similar to the identity resolution task in the given example, we merely need to compare the properties instead of
the instances and use the owl:equivalentProperty
predicate.
References
Brickley, D., and Miller, L. 2005. The Friend Of A Friend
(FOAF) vocabulary specification.
Cuel, R.; Morozova, O.; Rohde, M.; Simperl, E.; Siorpaes,
K.; Tokarchuk, O.; Widenhoefer, T.; Yetim, F.; and Zamarian, M. 2011. Motivation mechanisms for participation in
human-driven semantic content creation. Intl. J. of Knowledge Engineering and Data Mining 1(4):331–349.
Dawson, F., and Howes, T. 1998. vCard MIME Directory
Profile. RFC 2426, Internet Engineering Task Force.
Doan, A.; Ramakrishnan, R.; and Halevy, A. 2011. Crowdsourcing systems on the World-Wide Web. Communications
of the ACM 54:86–96.
Franklin, M.; Kossmann, D.; Kraska, T.; Ramesh, S.; and
Xin, R. 2011. CrowdDB: answering queries with crowdsourcing. In Proceedings of the 2011 International Conference on Management of Data SIGMOD 2011, 61–72.
Heath, T., and Bizer, C. 2011. Linked Data: Evolving the
Web into a Global Data Space. Synthesis Lectures on the
Semantic Web Theory and Technology. Morgan & Claypool.
Norton, B., and Krummenacher, R. 2010. Consuming dynamic linked data. In Proceedings of the First International
Workshop on Consuming Linked Data COLD2010, volume
665. CEUR-WS.org.
Norton, B., and Stadtmüller, S. 2011. Scalable Discovery of
Linked Services. In Proc. 4th Intl. Workshop on REsource
Discovery (RED 2011), volume 737. CEUR-WS.org.
Parameswaran, A., and Polyzotis, N. 2011. Answering
Queries using Humans, Algorithms and Databases. In Conference on Inovative Data Systems Research CIDR 2011.
Schall, D.; Dustdar, S.; and Blake, M. 2010. Programming
Human and Software-Based Web Services. IEEE Computer
43(7):82–85.
Metadata completion In order to encode this task we
check if a the required triple is part of the graph (note that
the NOT EXISTS filter was added in SPARQL 1.1):
{ ?person vcard:given-name ?given ;
vcard:family-name ?family .
FILTER NOT EXISTS { ?person vcard:email ?m } }
with the output graph:
{ ?person vcard:email ?email }
Related work
Combining traditional data management technology and
crowdsourcing has recently received some attention in the
area of data bases (Doan, Ramakrishnan, and Halevy 2011),
with approaches such as CrowdDB (Franklin et al. 2011) and
TurkDB (Parameswaran and Polyzotis 2011) proposing extensions of established query languages and processing techniques to deal with the challenging inherently arising when
dealing with less deterministic computational resources such
as humans in environments with high performance and scalability constraints.
There is an increasing body of research available that
looks into methods and techniques to optimize worker productivity and HITs design, with the most promising findings
being published at the annual HCOMP workshop.4 Combinations of human and computational intelligence outside
the scope of Mechanical Turk has been traditionally researched in the workflow engineering area (Schall, Dustdar, and Blake 2010). Issues related to task design and assessment of quality results have been addressed in the area
of Games with a Purpose, for instance in (Thaler, Siorpaes, and Simperl 2011; Seneviratne and Izquierdo 2010;
Van Ahn and Dabbish 2008).
4
See http://www.humancomputation.com/
67
Seneviratne, L., and Izquierdo, E. 2010. An interactive
framework for image annotation through gaming. In Proceedings of the International Conference on Multimedia Information Retrieval MIR 2010, 517–526.
Simperl, E.; Norton, B.; and Vrandečić, D. 2011. Crowdsourcing tasks in Linked Open Data Management. In Proceedings of the Xth Workshop on Consuming Linked Open
Data (COLD2011). Springer.
Siorpaes, K., and Simperl, E. 2010. Human intelligence in
the process of semantic content creation. World Wide Web
Journal 13(1):33–59.
Speiser, S., and Harth, A. 2011. Integrating Linked Data and
Services with Linked Data. In Proc. 8th Extended Semantic
Web Conference ESWC2011, volume 6643. Springer.
Thaler, S.; Siorpaes, K.; and Simperl, E. 2011. SpotTheLink: A Game for Ontology Alignment. In Proceedings
of the 6th Conference for Professional Knowledge Management WM 2011.
Van Ahn, L., and Dabbish, L. 2008. Designing games with
a purpose. Communications of the ACM 51(8):58–67.
68
Download