- LoCloud

advertisement
XIIth Annual International Conference for
Professionals in Cultural Heritage
LoCloud Micro Services and
the Digitisation Workflow
Walter Koch (kochw@ait.co.at)
AIT-Angewandte Informationstechnik
Forschungs GmbH, Graz - Austria
LoCloud is funded by the
European Commission's ICT Policy Support Programme
Content
•
•
•
•
•
LoCloud Project (relevant activities)
Vocabulary Microservice
Microservice used in the OpenUp! Project
Digitisation Workflow (Books)
Digital Curation (Long Term Preservation)
2015-11-08
Walter Koch (AIT)
2
LoCloud Project
(relevant activities)
2015-11-08
Walter Koch (AIT)
3
locoud.eu
About:
– LoCloud is a Best Practice Network co-funded under the
CIP ICT-PSP programme of the European Commission.
– LoCloud is supporting small and medium-sized
institutions in making their content and
metadata available to Europeana.
– A cloud-based technology infrastructure will enable the
aggregation of local content, and
– a number of micro-services will help to reduce technical,
semantic and skills barriers and to render the content
more discoverable and interoperable.
2015-11-08
Walter Koch (AIT)
4
Microservices
Six development teams:
AIT, Athena RC, AVINET, IPCHS, UPV/EHU, VUKFC
All the services are implemented on virtual machines in a
cloud testlab (using the OpenNebula cloud computing
platform)
API documentation is available for each service
End-user interfaces are provided for three microservices
Each service has been integrated in MORe
2015-11-08
Walter Koch (AIT)
5
LoCloud Microservices
Cloud-based micro-services for metadata enrichment and
capture:
• Geo-location API
• Vocabulary service
• Historic Place Names service
• Geo-coding application
• Vocabulary matching service
• Background linking service
• Wikimedia application
http://support.locloud.eu/
2015-11-08
Walter Koch (AIT)
6
LoGeo API
• Uses NER-Name Entity Recognition
• Recognizes and returns one or more place names
candidates accompanied by geographical coordinates.
GPL
IPCHS
2015-11-08
Walter Koch (AIT)
7
Vocabulary service
• Based on “TemaTres”
• Incorporates 29 standard
thesauri
• allows users to add and
create vocabularies, and
add terms or translations
• add thesauri terms to all
items of a metadata
package
• Integrated in MORE
• Exploited by the LoCloud
vocabulary matching
microservice via its API
GPL
AIT
2015-11-08
Walter Koch (AIT)
8
Historic Place Names service
• A semi-automatic historical geoinformation management function
and web service.
• Enables local cultural institutions
to collaborate in developing the
Thesaurus of HPN and enriching
ingested metadata.
• Enables export of selected
LoCloud HPN Thesaurus data sets.
GPL
VUKF
2015-11-08
Walter Koch (AIT)
9
Geo-coding application
• End-user oriented application
• allows institutions to add geographical locations to existing content in
a controlled crowd-sourcing environment and then
• load the augmented data back into their original production
databases.
GPL
AVINET
2015-11-08
Walter Koch (AIT)
10
Vocabulary matching service
• Builds on work in the PATHS project (FP7)
• Automatically assigns relevant concepts and terms to item records
• Retrieves terms from the Vocabulary service
Apache License 2.0
UPV/EHU
2015-11-08
Walter Koch (AIT)
11
Background linking service
• Based on work in PATHS
• Follows the Named Entity
Disambiguation approach
• Enriches items with links to
external contextual
information in DBpedia and
Wikipedia
Apache License 2.0
UPV/EHU
2015-11-08
Walter Koch (AIT)
12
Wikimedia application
• A web service that uses a REST interface to communicate with
Wikimedia commons
• The main functionalities are harvesting content form
Wikimedia, parsing the harvested content, and identifying
useful entities that can be mapped to the ESE or EDM
metadata schemas.
• The mapped ESE / EDM records are then sent to MORe to be
delivered to Europeana.
Wikimedia Commons Licensing
Athena RC
2015-11-08
Walter Koch (AIT)
13
The LoCloud
Aggregation Structure
Enrichment
Services
Europeana
Cloud
Testlab
2015-11-08
Walter Koch (AIT)
14
About: Microservice
Wikipedia: https://en.wikipedia.org/wiki/Microservices
(2015-09-15)
[...] microservices is a
• software architecture style in which
• complex applications are composed
of small, independent processes
• communicating with each other using languageagnostic APIs.
• These services are small, highly decoupled and
focus on doing a small task,facilitating a modular
approach to system-building.
2015-11-08
Walter Koch (AIT)
15
Evolution of Architectures
pwc: Technology Forecast: Rethinking Integration, Issue 1, 2014, page 3
2015-11-08
Walter Koch (AIT)
16
Vocabulary Microservice
2015-11-08
Walter Koch (AIT)
17
http://vocabulary.locloud.eu/
• Vocabulary Microservices
• This is the entry page for the
LoCloud vocabulary application.
• LoCloud uses a variety of standard vocabularies
during the metadata enrichment process.
These vocabularies have been established with the
TemaTres tool and are available in the LoCloud cloud
testlab.
• In addition, it is possible to feed the tool with your
own vocabularies, establish new vocabularies or
add languages to existing vocabularies.
2015-11-08
Walter Koch (AIT)
18
Vocabulary WebService
20+ Functions, API
Parameters
Example
2015-11-08
Walter Koch (AIT)
19
Example: Search
UNESCO-Thesaurus
http://test113.ait.co.at/tematres/unesco/services.php?task=search&arg=peace
2015-11-08
Walter Koch (AIT)
20
SPARQL-Example
DISMARC-Genres
• Query: "Find all Broader Term for 'Acid Jazz‘“
• SPARQL End Point: http://test113.ait.co.at/tematres/vocab/sparql.php
Query
Result
Thesaurus
Entry
2015-11-08
Walter Koch (AIT)
21
Service Call („fetchTopTerms“)
embedded in a test application
MicroService
Vocabulary Web
Service
2015-11-08
Walter Koch (AIT)
22
Test Application Development
Designer View
Development of a
Drop Down List
2015-11-08
Walter Koch (AIT)
23
The Test Application:
Part of Data Entry
Path to the
Thesaurus Item
„Candidate Term“ 
Crowdsourcing activity 
Next Slide
(Vocabulary Management)
2015-11-08
Walter Koch (AIT)
24
Microservice used in the
OpenUp! Project (open-up.eu)
„Common Name Service“
2015-11-08
Walter Koch (AIT)
25
OpenUp! - Natural History
Aggregator for Europeana
Micro Service:
Common Name
Service
2015-11-08
Walter Koch (AIT)
26
OpenUp!
Transformation Workflow
Invoke Micro
Service:
Common Name
Service
Next Slide
2015-11-08
Walter Koch (AIT)
27
OpenUp!
Transformation Parameters
Micro Service:
Common Name Service
Service End Point (Cache)
2015-11-08
Walter Koch (AIT)
28
Result: OpenUp!
Enriched Meta Data Set
Latin Name
2015-11-08
Walter Koch (AIT)
Common
Names
29
Digitisation Workflow
(Books)
2015-11-08
Walter Koch (AIT)
30
Bibliotheca Alexandrina
http://www.bibalex.org
Points for MicroService Integration
2015-11-08
Walter Koch (AIT)
31
Digital Curation
(Long Term Preservation)
2015-11-08
Walter Koch (AIT)
32
Digital Curation
State-of-the-Art Examples (2010)
• Archivematica
– Linux VM based toolset allows producers and archivists
to implement a small scale workflow
• UC3 / CDL Merritt
– Micro-Services specifications
– implement building blocks for the Digital Curation
Lifecycle
2015-11-08
Walter Koch (AIT)
33
Archivmatica
Micro Services
2015-11-08
Walter Koch (AIT)
34
California Digital Library
UC3 Merritt
• California Digital Library UC Curation Center
• Independant, Interoperable Micro-Services
• Complex curation function through combination
of atomistic services
Knowledge
Enhancement
Annotation
2015-11-08
Walter Koch (AIT)
35
Micro-Services
(Merritt vs. Archivematica)
CDL/UC3-MERRITT
2015-11-08
ARCHIVEMATICA
Walter Koch (AIT)
36
High-Level Workflow
• OAIS activities provide
a sound basis
• Archivematica
Use Cases
• Merritt specifications
• Micro-Services are the
building blocks
2015-11-08
Walter Koch (AIT)
37
CDL
Curation Micro Services
1)
Annotation service 2)
The Annotation service provides
a mechanism by which the
consumers of managed digital
content can enrich that content
through additional description,
association, or related content.
The service supports the
following methods:
• Annotate
• Submit.
1) „An Emergent Micro-Services Approach to Digital Curation Infrastructure“, IPRES 2009
https://escholarship.org/uc/item/5313h6k9
2) UC Curation Center / California Digital Library, merritt: An Emergent Micro-services Approach
to Digital Curation Infrastructure; Rev. 0.6, 2010-03-25
2015-11-08
Walter Koch (AIT)
38
Contact
Walter Koch, kochw@ait.co.at
http://www.ait.co.at
2015-11-08
Walter Koch (AIT)
39
Download