CLODA - University of Cyprus

advertisement
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
CLODA: A Crowdsourced
Linked Open Data Architecture
Georgios Larkou, Julia Metochi
Georgios Chatzimilioudis and
Demetrios Zeinalipour-Yazti
Data Management Systems Laboratory
Department of Computer Science
University of Cyprus
http://dmsl.cs.ucy.ac.cy/
First IEEE Intl. Workshop on Mobile Data Management, Mining and Computing on
Social Networks (MobiSocial), co-located with IEEE MDM'13, June 3, 2013, Milan, Italy.
1
© Larkou, Metochi, Chatzimilioudis and Zeinalipour-Yazti, Mobisocial'13, Milan, Italy
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
Crowdsourcing Definitions
•
Crowdsourcing = Crowd + Outsourcing
–
•
Jeff Howe (2006). "The Rise of Crowdsourcing". Wired.
Definition from Wikipedia:
– "Crowdsourcing refers to a distributed
problem-solving model in which a crowd of
undefined size is engaged in the task of
solving a complex problem through an open
call"
“Crowdsourcing with Smartphones”, Georgios Chatzimiloudis,
Andreas Konstantinidis, Christos Laoudias, Demetrios
Zeinalipour-Yazti, IEEE Internet Computing, Special Issue:
Sep/Oct 2012 - Crowdsourcing, May 2012. IEEE Press, Volume
16, Pages: 36-44, 2012.
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
3
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
Crowdsourcing StakeHolders
Open Call (Task)
Solutions
Requester
(Crowdsourcer)
Rewards
Workers
Platform
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
(Solvers)
4
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
Crowdsourcing Incentives
• Tangible (Monetary) Incentives
– Cash, Credit or Gifts (MTurk, Kickstarter)
– Unintended or as-a-by-product (reCaptchas)
• Ethical Incentives
– Socialize & Fun
– Earn Prestige
– Altruism
– Learn something New
• Usually a combination of several
incentives
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
5
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
CLODA Motivation
Collect & Link Open and Close Data with
smartphones through an open call for ethical benefit
b
a
Linked Open
Data (LOD)
(Freebase,
DBPedia)
Closed Data:
Web 2.0 APIs (Google,
Twitter, Facebook)
CLODA
Prototype
c
Sensor / Geo Data
Collect
Reward
Verification
CLODA (LOD)
d
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
6
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
Linked Open Data (LOD)
• Linked data refers to web accessible data
(HTTP/URI addressable) that is structured
(RDF) to allow computer link and query
(SPARQL) the data [ aka 4 principles ]
• Conceived by Tim Berners-Lee and realized by
the Semantic Web Community.
– Tim Berners-Lee (2006). "Linked Data—Design Issues". W3C.
• Linked Open Datasets (LOD):
–
–
–
–
–
DBPedia (3.6M things describing Wikipedia)
GeoNames (describing 7.5M Geographic features)
YAGO & YAGO2 (combining Wikipedia, GeoNames and others)
Freebase (39M things - Google's LOD project!)
FOAF (describing
People – Relationships –Open Social Net!) 7
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
Traditional LOD (RDF)
• DBPedia Example: LOD describing 3.64M
Wikipedia things out of which 1.83M are
classified in a consistent ontology
– 416,000 persons, 526,000 places, 106,000 music albums,
60,000 films, 17,500 video games, 169,000 organizations,
183,000 species and 5,400 diseases
Infobox example
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
8
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
LOD the Google (JSON) Way!
• Freebase Example: another LOD by Google.
Instances
Relationships
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
9
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
LOD is Interlinked & Annotated
LOD is interlinked, e.g., Freebase points to Dbpedia
predicates …
http://dbpedia.org/page/Manchester
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
10
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
LOD is Highly Interlinked!
http://www.stateofsearch.com/search-in-the-knowledge-graph-era/
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
11
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
LOD can be Queried!
LOD can be queried!, e.g., Freebase with MQL Queries
(JSON encoded like Web2.0 APIs)
Resembles
Xpath
Querying
MQL Queries
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
12
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
CLODA Motivation
Collect & Link Open and Close Data with
smartphones through an open call for ethical benefit
Closed Data:
Web 2.0 APIs (Google,
Twitter, Facebook)
Linked Open
Data (LOD)
(Freebase,
DBPedia)
CLODA
Prototype
Sensor / Geo Data
Collect
Reward
Verification
CLODA LOD
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
13
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
Web1.0: The Unstructured Web
http://books.google.com/
(content in HTML
only apprehensible to
User)
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
14
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
Web2.0: The Structured but Closed Web
https://www.googleapis.com/books/v1/volumes?q=databases
content in XML/JSON
apprehensible to Computer
This web is closed: Requires
Keys to access (OAUTH) and
has download quotas
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
15
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
Web2.0: The Structured but Closed Web
In fact, Web2.0 Services are omnipresent!
(Google, Twitter, Facebook, Youtube, Linkedin, …)
http://www.programmableweb.com/ - 7800 APIs!!! + 6800 Mashups!
quota
https://code.google.com/apis
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
16
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
CLODA Motivation
Collect & Link Open and Close Data with
smartphones through an open call for ethical benefit
Closed Data:
Web 2.0 APIs (Google,
Twitter, Facebook)
Linked Open
Data (LOD)
(Freebase,
DBPedia)
CLODA
Prototype
Sensor / Geo Data
Collect
Reward
Verification
CLODA LOD
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
17
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
CLODA: Indoor Localization
•
•
•
Numerous ways to localize without power-hungry GPS
but most of the solutions rely on additional hardware
(RFID, sensor networks, etc.)
Smartphones can nowadays localize off-the-shelf with
onboard sensors and WiFi signal fingerprints (coined
Hybrid Localization)
These solutions require that services acquire local
data through Crowdsourcing (e.g., Google Indoor)
–
–
–
–
Building RadioMaps, MagnetometerMap, etc.
[Airplace] "The Airplace Indoor Positioning Platform for Android
Smartphones", C. Laoudias et. al., Best Demo Award at IEEE MDM'12.
[HybridCywee] "Demo: the airplace indoor positioning platform", C.-L.
Li, C. Laoudias, G. Larkou, Y.-K. Tsai, D. Zeinalipour-Yazti and C. G.
Panayiotou, in ACM Mobisys'13. Video at: http://youtu.be/DyvQLSuI00I
Wifislam.com (bought recently by Apple for 20M)
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
18
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
CLODA: Scanning Items
Preview
LOD
LOD
Scan & Link
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
19
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
CLODA: Scanning Items
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
20
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
CLODA: Indoor Localization
Founded on prior work
Cywee / Airplace
http://youtu.be/DyvQLSuI00I
Airplace and Anyplace
(Navigate) seamlessly indoor or outdoor
http://anyplace.cs.ucy.ac.cy/
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
21
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
CLODA Motivation
Collect & Link Open and Close Data with
smartphones through an open call for ethical benefit
Closed Data:
Web 2.0 APIs (Google,
Twitter, Facebook)
Linked Open
Data (LOD)
(Freebase,
DBPedia)
CLODA
Prototype
Sensor / Geo Data
Collect
Reward
Verification
CLODA LOD
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
22
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
NoSQL DataStore: CouchDB
Document in CouchDB
Map Function
function(doc) {
for (i in doc.authors) {
author = doc.authors[i];
emit(doc._id, author);
}
}
Results (through REST/HTTP or Futon)
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
23
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
NoSQL DataStore: CouchDB
Export JSON Data to RDF with
Sessel (CouchApp that
generates RDF triples from
CouchDB documents)
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
24
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
CLODA Motivation
Collect & Link Open and Close Data with
smartphones through an open call for ethical benefit
Closed Data:
Web 2.0 APIs (Google,
Twitter, Facebook)
Linked Open
Data (LOD)
(Freebase,
DBPedia)
CLODA
Prototype
Sensor / Geo Data
Collect
Reward
Verification
Last step essentially
provided interlinking
CLODA LOD
with existing LOD
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
25
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
CLODA on the LOD Graph!
CLODA
http://www.stateofsearch.com/search-in-the-knowledge-graph-era/
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
26
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
CLODA Incentives
• Incentives:
– Ethical Benefit
• Similar to people-centric sensing / wardriving
– Enhance collaboration between users
• Richer querying possibilities in the future.
– Might be imposed by an Organization.
• E.g., Inventory Management in a Hospital
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
27
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
CLODA Quality Issues
• Data Quality / Data Freshness
– LOD data suffers from both of these aspects
– Crowdsourcing offers Freshness but still lacks
explicit Quality guarantees (repeat N times then
majority vote)
• Possible Solutions: Integrate Location-aware
Techniques to validate data added and linked
– Task of Identifying the neighbors of all users
continuously with Proximity, see IEEE MDM'12.
– Task of Identifying similarly moving users with
SmartTrace, see IEEE TKDE, June, 2013.
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
28
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
CLODA Testbed Issues
• Currently, there are no testbeds (like motelab,
planetlab) for realistically prototyping Smartphone
Network applications and protocols at a large scale.
• Currently applications are tested in emulators.
– Sensors are not emulated. 
– Reprogramming is difficult. 
• SmartLab (http://smartlab.cs.ucy.ac.cy/) is a first-of-akind programmable cloud of 40+ smartphones
deployed at our department enabling a new line of
systems-oriented research on smartphones.
"Crowdsourcing with Smartphones", Georgios Chatzimiloudis, Andreas Konstantinides, Christos Laoudias, Demetrios
Zeinalipour-Yazti IEEE Internet Computing (IC '12), Special Issue: Sep/Oct 2012 - Crowdsourcing, May 2012. IEEE
Press, 2012
"Demo: A Programming Cloud of Smartphones", A. Konstantinidis, C. Costa, G. Larkou and D. ZeinalipourYazti, "Demo at the 10th International Conference on Mobile Systems, Applications and Services" (Mobisys '12),
Low Wood Bay, Lake District, UK, 2012.
29
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
CLODA Testbed Issues
SmartLab: Massive smartphone simulations with
our first global open smartphone IaaS cloud –
http://smartlab.cs.ucy.ac.cy/
Static Androids
Mobile Androids
[SmartLab] "Demo: a programming cloud of smartphones", A. Konstantinidis, C.
Costa, G. Larkou, D. Zeinalipour-Yazti, In ACM Mobisys '12. [ By our Group ]
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
30
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
CLODA Testbed Issues
Presenter: Demetris Zeinalipour, http://dmsl.cs.ucy.ac.cy/
http://smartlab.cs.ucy.ac.cy/
31
Dagstuhl Seminar 10042, Demetris Zeinalipour, University of Cyprus, 26/1/2010
CLODA: A Crowdsourced
Linked Open Data Architecture
Thanks! Questions?
Georgios Larkou, Julia Metochi
Georgios Chatzimilioudis and
Demetrios Zeinalipour-Yazti
http://dmsl.cs.ucy.ac.cy/
First IEEE Intl. Workshop on Mobile Data Management, Mining and Computing on
Social Networks (MobiSocial), co-located with IEEE MDM'13, June 3, 2013, Milan, Italy.
32
© Larkou, Metochi, Chatzimilioudis and Zeinalipour-Yazti, Mobisocial'13, Milan, Italy
Download