IET ICT Labs: The first Doctoral Summer School “Imagine the Future

advertisement
IET ICT Labs:
The first Doctoral Summer School
“Imagine the Future in ICT”
(September 3-7, 2012)
We are pleased to announce that the first Summer School "Imagine the Future in ICT” will be
held from September 3 to 7, 2012 in Paris - France (Co-Location Center, 23 avenue d’Italie).
This Summer School is planned within the frame of the EIT ICT Labs Doctoral School and
will be organized by Inria, in collaboration with EIT ICT Labs partners.
During the event, we aim at creating a small community involving both students, young
innovators and speakers. Main thread is to arouse curiosity and imagination among the
attendees in allowing them to share open issues with recognized scientists.
Lectures of recognized scientists shall put emphasis on prospective issues (what may be the
state of the art in 5 years ?). Talks given by business leaders shall relate success (or failure)
stories from business.
Objective and subject
The general objective of this series of summer schools is to promote excellence and
imagination among PhD students, young researchers and innovators within the KIC EIT ICT
Labs.
To this aim, most of the speakers will be ERC (European Research Council) laureates who
will bring an ambitious and ground-breaking vision of their field... Keynote speakers from
leading companies in the field will also participate. In order to favor discussions and
exchanges, the audience will be limited to highly motivated people (no more than 40
persons).
The 2012 edition will address the challenges of data management in today's digital
world.
Organizational details
Participation in the Summer School (incl. lectures, accommodation, social events) will be free
for 40 attendees (35 from the nodes). Travel costs are not taken in charge: attendees should
make a travel claim to their own department / institution or node.
Admission:
1
We want to attract highly motivated students with a strong background at PhD level
(exceptionally master level), R&D engineers from industry or SMEs and young innovators.
We propose each node to participate in the admission process of attendees, to select and
propose 5 students with short personal profile (+ 2 additional names – final list will depend on
the places available). Closing of the first selection of attendees: May 15th, 2012.
Contact : contact-imagine-the-future-2012@inria.fr
Thematic presentation
The Digital World has become a reality: in early 2011, it has been observed that humanity
creates every two days as much data as it did from the dawn of human civilization until 2003,
and most of this data is user data, under the form of Web content, blogs, social network
contents, pictures and movies. Data usage and consumption scenarios cover all areas of
science, from experiment-produced data to documentary, historical and statistic data used by
social sciences. Ever increasing data volumes are also produced through any economic
activity; private data such as personal information, medical data, social media further
contribute to this data deluge. Computers, initially introduced in the aftermath of the 2nd
World War to achieve computation capacity breakthrough, are now being more and more
used to store and analyze such data.
Due to this ubiquity of data, “literacy” in today’s society necessarily includes the capability of
producing, using, reasoning about, and sharing digital data. Indeed, the availability of such
unheard-of volumes of data, especially on the Web, has started to change cognitive function
in humans: a psychology study has shown that when told that a piece of information is also
available on the Web, instead of memorizing the information, subjects tend to learn where to
look it up! From a scientific data management perspective, recurring challenges in this
context are:
- Large scale: the volumes of data to be manipulated explode in all application domains.
Architectures currently considered to handle such volumes are massively parallel, typically
deployed in clusters or clouds. Massively parallel storage and processing of data is
becoming just as important as the parallelization of massive computations that are growing
common e.g. in experimental or economic simulations…
- Potential heterogeneity: data is frequently produced by independent actors, with different
purposes and different interests in modeling the world. Heterogeneity gaps ensue, in the
structure, organization and operation constraints of each database, which appropriate
integration techniques must properly formalize and resolve. This is exacerbated in the Web
context (social networks, semantic Web, open data…) where sources count by the million.
- Structure and semantic complexity: more and more data on almost all aspects of real
world are now at our disposal, and so are potentially the answers to all our questions – if only
one knew how to tease these answers out of the data. Data formats are becoming complex
and new formats emerge frequently; complex languages with appropriate efficient
implementations are required. Data is often rich with meaning, describing the semantics of
an application domain. Well-grounded formalisms to capture this meaning enable the
automation of reasoning tasks, which goes beyond the reach of humans at the current data
volume scales. 11388
- Global intelligence: Because of the flood of data, tools to filter, classify select information
are more are more fundamental. Typically they rely on extracting “global intelligence” from
web information of from the participation of web users.
Different approaches have been considered such as recommendation, data mining, trust
evaluation.
The context is also particularly ripe with potential for interaction with other related branches
of computer science research: machine learning and natural language processing are
powerful tools for making sense of Web data; data visualization techniques help bring out the
message in the data; the programming language community shares the database scientists’
interest in verifiable languages amenable to massive parallelization…
2
With a main focus on data, our summer school will bring to the fore and explore these
unprecedented convergence and opportunities.
Tentative agenda
Monday, Sept. 3rd
8.30-9.00
Tuesday, Sept. 4th
Flash presentation 5 attendees
Wednesday, Sept. 5th
Flash presentation 5
attendees
Thursday, Sept. 6th
Falsh presentation 5
attendees
Friday, Sept. 7th
Flash presentation 5
attendees
9.00-10.15
Keynote and discussion
Keynote and
Keynote and discussion Keynote and discussion discussion
10.15-10.30
Coffee Break
Coffee Break
Coffee Break
Coffee Break
10.30-11.30
Brainstorming Session in
groups
Brainstorming Session in
groups
Brainstorming Session in
groups
Brainstorming
Session in groups
Lunch
Lunch
Lunch
Conclusion of the
Summer School
11.30-13.30
13.30-14.00
14.00-15.30
15.30-16.00
16.00-17.00
17.00-17.30
Evening
Registration
Opening + Welcoming
speach
Flash presentation 5-7
attendees
Flash presentation 5-7
attendees
Flash presentation 5-7
attendees
Keynote and discussion Keynote and discussion
Keynote and discussion Keynote and discussion
Coffee Break
Coffee Break
Brainstorming Session in Brainstorming Session in
groups
groups
Coffee Break
Brainstorming Session in
groups
Coffee Break
Brainstorming Session in
groups
Report of the groups a nd
s ynthes e of the da y
Report of the groups a nd
s ynthes e of the da y
Report of the groups a nd
s ynthes e of the da y
Free evening or
Recreational
Programme
Report of the groups a nd
s ynthes e of the da y
Invited talk
Free evening
Soci a l Event
Keynote speakers
- Prof. Stefano Ceri
Professor at Politecnico di Milano, Director of Ata Schuola Politecnica ( the school of
excellence for top-level master students selected from Engineering, Architecture, and Design
Faculties of Politecnico di Milano and Politecnico di Torino). ERC grant on search computing.
Interest: extending database technologies, engineering of Web-based applications and, more
recently, complex search systems, search computing
- Prof. Dr. Emil Aarts
Vice President and Scientific Program Manager of the Philips Research Laboratories
Eindhoven.
Interest: computational intelligence, interaction technology, and design.
- Dr. Serge Abiteboul
Professor at Collège de France, Member of the French Academy of Science, Senior Scientist
at Inria. ERC grant to develop a formal model for Web data management.
Interest: database, data management, web, change control, electronic commerce, electronic
documents, object databases, distributed databases, heterogeneous databases, active
databases, database theory, database languages, complexity theory temporal databases...
- Prof. Dr. Martin Kersten
CWI Fellow INS1 (Information Systems - Database Architectures).
Interest: Data based architectures, query optimization and their use in scientific databases.
- Dr. Ioana Manolescu
Research Director at Inria.
Interest: Cloud-based management of Web data, Efficient tehcniques for large-scale SML
data management, Database techniques for the Semantic Web, Scalable interactive
visualization.
- Dr. Bruno Levy
Senior Researcher at Inria, ERC Starting Grant GOODSHAPE, that aims at computing the
optimal sampling of 3D objects.
Interest: Computer Graphics: the interaction of light with the geometry of the objects.
3
- Pekka Aakko
Chairman of the board at Applifier (game promotion network on Facebook and Mobile with
about 150 million users monthly), Director at Applifier Europe, Founder and CEO at
Assembly Organizing.
Abstracts
-
Prof. Stefano Ceri
Search Computing
Search computing is a new multi-disciplinary science which provides the abstractions,
foundations, methods, and tools required to answer compex queries, such as "Who are the
strongest European competitors on software ideas? Who is the best doctor to cure insomnia
in a nearby hospital? Where can I attend an interesting conference in my field close to a
sunny beach?" This information is available on the Web, but no software system can accept
such queries nor compute the answer. Search computing is an evolution of service
computing where ranking is the dominant factor for composing services. While state-of-art
search systems answer generic or domain-specific queries, search computing enables
answering multi-domain queries via a constellation of dynamically selected, cooperating
search services.
CrowdSearch: Using the Crowds for Improving Search Experiences
While Web search is concerned with locating information from content
on the Web, crowdsearch denotes information extraction directly from online humans.
CrowdSearch uses the crowds as sources for the content processing and information
seeking processes; it fills the gap between generalized search systems, which operate upon
world-wide information - including facts and recommendations as crawled and indexed by
computerized systems – and social systems, capable of interacting with real people,
in real time. Crowdsearch is part of the more general trend towards crowdsourcing, which is
emerging as a general paradigm of organizing the human working and leisure activities by
means of computerized platforms. Industry is heavily moving toward crowdsourcing, as
demonstrated by the high number of new start-ups that exploit social networking, social
participation, humans as sensors, application gamification, and other variants of this trend.
Data
Dr. Serge Abiteboul
sciences:
from
First-order
logic
to
the
Web
Information technology has revolutionized our lives. Computers are traditionally seen as
computing machines, although their main purpose is now to manage information. We will
cover important aspects of data management, including relationships with mathematical logic
and complexity theory. We will discuss the Web that can be seen as a huge distributed
database. We will mention challenges: the extraction of global knowledge from the masses of
available information and distributed reasoning in the semantic Web.
“Viewing the Web as a Distributed Knowledge Base”
Information of interest may be found on the Web in a variety of forms, in many systems, and
with different access protocols. A typical user may have information on many devices
(smartphone, laptop, TV box, etc.), many systems (mailers, blogs, Web sites, etc.), many
social networks (Facebook, Picasa, etc.). This same user may have access to more
information from family, friends, associations, companies, and organizations. Today, the
control and management of the diversity of data and tasks in this setting are beyond the skills
of casual users. Facing similar issues, companies see the cost of managing and integrating
information skyrocketing.
4
We are interested here in the management of such data. Our focus is not on harvesting all
the data of a particular user or a group of users and then managing it in a centralized
manner. Instead, we are concerned with the management of Web data in place in a
distributed manner, with a possibly large number of autonomous, heterogeneous systems
collaborating to support certain tasks.
Our thesis is that managing the richness and diversity of user-centric data residing on the
Web can be tamed using a holistic approach based on a distributed knowledge base. All
Web informations are represented as logical facts, and Web data management tasks as
logical rules. We discuss Webdamlog, a variant of datalog for distributed data management
that we use for this purpose. The automatic reasoning provided by its inference engine,
operating over the Web knowledge base, greatly benefits a variety of complex data
management tasks that currently require intense work and deep expertise.
This work is part of the Webdam European project.
- Pekka Aakko
Provisional title: “Applifier - social game discovery with 150 million users”
Applifier is the company that transformed globally how new games in social networks and
mobile are discovered via cross promotion. Applifier's fast growth into leading cross
promotion service on Facebook has had multiple challenges: the company has grown into
servicing over 150 million gamers each month and the amount of data collected from users
looking for new games is huge. The presentation will touch the data collection and refining,
trends in the social gaming industry in general and how the company actually pivoted from a
failed social gaming startup.
5
Download