Excellence_LPv1204

advertisement
Industrial Leadership: Research & Innovation Actions
ICT 2014 - Information and Communications Technologies
H2020-ICT-2014-1
AutoMat: Autonomous Material Handling
Acronym: AutoMat
Date of Preparation: March 24, 2014
Work program topic addressed by AutoMat: Robotics
Coordinator: Libor Preučil
e-mail: preucil@labe.felk.cvut.cz
tel/fax: +420 224 357 290/224
#
Participant organisation name
Short name
Country
1
2
Czech Technical University in Prague
Institut National de Recherche en Informatique et en
Automatique FR
Aalto Korkeakoulusaatio
Almende B.V.
University of Lincoln
VOP CZ, s.p.
CTU
INRIA
CZ
FR
AALTO
ALMENDE
UoL
VOP
FI
NL
UK
CZ
3
4
5
6
AutoMat
page 2 of 37
page 2 of 37
AutoMat
Abstract
The project AutoMat targets research and development of scientific and technical principles focusing applications of
ad vanced auto mation and aut ono my i n th e do mai n of const ru ction machin es. Primary aim of the there under
suggested project is to r e s e a r c h a n d develop methods leading to practically usable and marketable unmanned systems.
The ultimate goal is to provide these with the necessary capabilities to propose and f u l f i l l autonomous modifications to its
workspace in material handling, construction and landscape forming tasks not for large devices only, but specifically for smaller
machines not addressed previously, and which are diverse in size and in the art of their use.. The keypoint herein stands in expected
substantial increase of the use efficiency of robotics methods with construction machines in accomplishing the required
tasks autonomously, making these adaptive, autonomous and independent on environmental infrastructure than ever
before, as well as delivering a scalable and bringing up a truly functional solutions. This will be achieved hand in
hand with creating a technological demonstrator at TRL8 in the project runtime followed by a real product in the
domain of small and inexpensive DAPPER construction machine, that will easily penetrate the European market.
The project focuses three core scientific challenges, are based on advices from, and expertise of construction machine
manufacturers, their users and i n i t i a l m a r k e t s t u d i e , w h a t d enotes the key enabling technologies
needed for application of autonomy and robotic technologies in construction machines domain:
(1) Autonomous modification of the given machine activity plan, including proactive robot-habitat interaction towards
improvement of the robots’ mo bi li t y and e n v i r o n m e n t maintenance, and further extension of capabilities of active
workspace modification to facilitate the robots’ localization and mapping (a concept we refer to as “Bulldozer SLAM”).
(2) Introduction of p erformance op ti mizat ion and safety into the machine controller learning. The proposed approach
is inspired by ontogenesis and enables expertise transfer among various types and sizes of, even coo p erat i n g ,
co n s t ru ct i on mach i n es , to assure their optimal safety and performace.
(3) 3D mapping and semantic segmentation of the highly dynamic 3D workspace providing techniques necessary to
consistently represent the environment over time, regardless of the changes imposed by the system itself, and to further
facilitate planning and cognitive control of the robots towards achieving the tasks of the target application.
The project aims at resolution of the principal and outstanding problems achieving highest possible robot autonomy in real,
uncertain and highly structured environments, with specific focus onto the project outcome application in the product. The
project’s achievements will be highlighted in a real-world experiment s c e n a r io , w h e r e a s t h e e x p e c te d im p a c t w i l l
s h o w u p in m a r k e t d i s s e m in a t i o n o f th e e l a b o r a t e d a u t o n o m y s o lu t io n s f o r s m a l l - s c a le c o n s t r u c t io n
m a c h in e s .
Contents
2
2
AutoMat
page 3 of 37
page 3 of 37
AutoMat
B.1 Excellence
3
B.1.1 Objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3
B.1.1.1 Target application and verification scenario . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3
B.1.1.2 Scientific challenges . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5
B.1.1.3 Introducing safety into the controller learnin g . . . . . . . . . . . . . . . . . . . . . . . . . . .
7
B.1.2 Relation to the work programme . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
11
B.1.3 Concept and approach
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
12
B.1.4 Ambition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
12
B.1.4.1 Perception and Distributed 3D mapping . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
13
B.1.4.2 Control of learning and operating risks
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
14
B.1.4.3 “Bulldozer SLAM”- System for Simultaneous Localization and Active Mapping with Environment
Modification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
15
B.1.4.4 Autonomous material handling
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
16
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
16
B.1.4.5 Relation to other EU projects
B.2 Impact
20
B.3 Implementation
21
B.3.1 Work plan Work packages, deliverables and milestones . . . . . . . . . . . . . . . . . . . . . . . . .
21
B.3.1.1 Work package list and Summary of effort . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
22
B.3.1.2 List of Deliverables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
22
3
3
AutoMat
page 4 of 37
page 4 of 37
AutoMat
Chapter B.1
Excellence
B.1.1
Objectives
A system for an autonomous manipulation with material in waste repositories and bulk materials storages is proposed
in this project called AutoMat (Autonomous Material Handling). The project aims at development of autonomous functionalities for robotic construction machines with focus on specific demands of end-users of these devices. Even thought,
alike solutions have already been introduced with large construction machines before (Catepillar, Komatsu, etc.), these approaches targeted
mainly dully automation of certain repetitive activities. As good examples may be i.e. automated transportation of ore and spoil in mines, or base
and causeway forming for highway construction sites, etc. The major and the most common constraints in these use cases are requirements for
well established infrastructure as i.e. precise (D)GPS availability, many assumptions on the workspace occupancy (typically a single-machine
operation only), machine activity following pre-programmed and rigid plan, all leading to a common de-numerator of these approaches – weak or
very limited capability to handle uncertainty through the given task accomplishment. Besides, the aforementioned solutions are offered
exclusively with very large and costly devices, what draws back the possibilities to further wider dissemination of these applications.
Therefore, to compensate for the previous drawbacks, a primary objective herein is to reach a higher usability of the
autonomy in these applications, which remains necessary for the system commercialization. The key idea is, the project
addresses research o f u s a b l e principles for autonomous manipulation with material according to humandesigned mission objectives, being capable to operate robots in varying scenes, under uncertainty and even more –
taking the advantage of intentional physical modifying of the scene towards improvement of the autonomous
operation. The herein presented approach counts with the requirements and constraints imposed by i n d i v i d u a l
r o b o t i c s y s t e m s , a n d e v e n f l e e t s o f i n v o l v e d r o b o t s , and optimizes coupling of each component of the problem to
the others (the workspace shaping and the robot operational capabilities). Usage of machine learning principles denotes a powerful tool to
achieve the optimization across various types of robotic construction machines, in diverse situations and tasks. Such an approach
of intentional active adaptation of the workspace by robots for their needs, and needs of the task given, stands for the
essence of the herein suggested advance towards efficient deployment of the robotic system in unstructured real world
workspace.
“The project AutoMat is devoted to development of a robotic system capable of cognitive material handling to achieve
the given task and to increase the robotic construction machines autonomy and performance.”
Particular requirements for the effective utilization of t h e s e robots are identified by the robotic system itself during
the mission accomplishment. As a simple example: the system may find out that a new road going from a
material storage would speed up the material displacement, what may invoke building of this road. Such, or similar
cognitive judgments are essential as the useful and important properties of the workspace can rarely be
specified
prior to the mission. These are dependent upon the characteristics of the workspace and can be obtained during the
robot deployment in the particular scenario first. The ability to handle all the aspects of the autonomous system
activity in real-world environment i s crucial for its deployment without necessity of presence and advice of robotic
experts. As the latter stands for one of the main current bottlenecks of commercial application of autonomous
construction machines for material handling and manipulation tasks, it has been selected as the main targeting of the
activities within the AutoMat project. Moreover, to reflect the needs of the future exploitation activities concerning
this technology, all the solutions will target the domain of small size construction machine in tight cooperation with
its’ manufacturer. The expectation is, the systems’ moderate price and wide variety of possible applications will affect
the dissemination activities and final impact in a positive way.
4
4
AutoMat
B.1.1.1
page 5 of 37
page 5 of 37
AutoMat
Target application and verification scenario
To be more specific, let us mention three scenarios, where we want to advance the development of recent technologies
for autonomous driving and operation of mobile machines to b r i n g t h es e t o sufficient technology readiness level for their
commercialization. These three scenarios include most of the typical functionalities demanded by end-users o f
s m a l l -size construction machines (see the attached market study), which has become one of the key products in the
VOP partner portfolio.
Scenario 1: The waste repository.
In the first scenario, a metropolitan waste repository area, which consists of a sealed ground pool on which the waste is
stored, is considered. To reuse efficiently the area, the waste is piled up into a hill or stored into layers for further reclamation. Fetching the new coming waste, the loading machines (in our scenario autonomous loaders and bulldozers)
shall drive up the hill and store the waist in a proper place. As the environment (the waste pile) rapidly changes with
the amount of stored waste over time, the machines have to build/re-build their environmental models and temporal
roads to the storage area (typically up the hill) to assure delivery of new waste. Similarly, a request for steady shaping
the waste hill to the requirements on the repository exists, as well as a request to cover the waste by a layer of regular
soil to reclaim the environment may arise. Again, further safe access roads for these activities are needed to be built
for
temporal
use.
5
5
AutoMat
page 6 of 37
page 6 of 37
AutoMat
The challenging task in this scenario is the autonomous control of robots in changing workspace (the workspace is
changed by the robotic systems itself). In addition, the autonomous system has to be capable of autonomous planning
of workspace modifications (to build the temporary access roads) with the aim of accomplishing the given plan (to
store the waste into hills or layers for the reclamation). These changes depends on the properties of the environment
(various types of waste etc.) and therefore cannot be planned prior the system deployment.
Scenario 2: Bulk materials storage and manipulation.
In the second scenario, a bulk material storage area (i.e. construction material retailer, or a factory using bulk raw
materials as inputs for their production) as sand, gravel or similar repositories is considered. The task for the system
is to keep the storage in terms of delivering required type and amount of the material to certain unloading area (i.e. to
load a truck) while not mixing these and keeping the storage clean and tidy. In the case of delivery of materials to be
stored, the loader shall fetch the material to a proper place of the storage (including carrying and unloading the delivery
on the top of the storage pile). As for The waste repository, the loaders shall build the environmental model, suggest
certain temporal changes (building temporal roads) to fulfill the fetch and carry tasks, etc. Specifically, it is assumed
the whole scenario may be executed in GPS-dark areas (inside a production hall, under metal roofing, etc.) so the
GNSS (i.e. GSP) systems cannot be relied on.
Scenario 3: Landscape reclamation activities.
The third scenario dealing with landscape reclamation is the most challenging one, but it brings also the biggest
possible income in future. Nowadays, there are many lands in technologically advanced as well as developing countries
that are burdened by the legacy of industrial activities like exhausted coal mines, dumps, or old and unused industrial
complexes. Across Europe more than 300,000 locations are waiting for the reclamation (see section ?? for details).
The proposed robotic landscape reclamation is a tool that can be used to recultivate such lands and to return them back
to the nature or to make them usable and comfortable for humans. The reclamation does not include only the initial
workspace restoration, but also a continuous long-term maintenance to provide the sustainable environment growth.
Landscape reclamation activities typically focus shaping of the land combined with soil redistribution. This typically
comprises removal of tailing materials followed by targeted shaping of the underlying layers and spreading of fresh
quality soils, sand, or peat, etc. which are bulk materials. For the reclamation performed in purely natural conditions
(i.e. in woods) the soil forming machines cannot rely purely on GNSS positioning as shielding and dispersion of the RF
signals (i.e. by tree canopies) decreases precision to unusable levels for this purpose. To assure sufficient precision
of the land forming actions and to guarantee fine localization of the machine (herein autonomous robot) relatively
to the ground it is useful to rely on existing landmarks on the processed ground. Artificial building of some other
additional/temporal landmarks during the machine work time (the previously mentioned bulldozer SLAM) will enable
and improve the machine activity efficiency towards the given goal. Besides, creation of temporary roads to fetch and
carry processed materials, remove tail materials and bring new ones seems to be inseparable part of this scenario as
well.
For testing the Scenario 1, a bordered workspace will be prepared to enable experimental verification of all scientific
challenges being tackled in the AutoMat project. In the environment, areas with heterogeneous surface (sand, gravel,
small stones, bigger stones...; see Fig. ?? for example) will be created. A testing plan of storing of construction waste
(crushed bricks) will be given to the system by human supervisors. The surface of the selected part of the industrial
waste dump will be appropriately modified to highlight the key functionalities of the system, which is the cognition and
adaptation to environment changes and the safe machine learning. The aim of the experiments, which is the verification
of developed methodologies, does not require utilization of real size excavators and smaller mobile robots of particular
partners may be used in this phase of testing. Therefore, the experimental site needs to be suited for utilization of
available robotic platforms. The robots workspace will be adapted prior the autonomous system deployment to enable
movement of employed small-size robots.
The robotic system equipped with necessary actuators for the material carrying and handling (shoveling) and sensors
for outdoor sensing will be capable of the plan execution and its modification towards better performance of robots.
Based on the evaluation of the short experimental trials and information obtained from the sensors, the system can
design new auxiliary roads to simplify movement of robots. The road building will be demonstrated by shoveling of the
sand from a harder surface. This action can be realized with small size robots and it provides measurable improvement
of the robot’s movement. Besides, the system can propose a creation of new localization landmarks (e.g. a small
sand hill or a group of stones) in locations where the localization system suffers from high uncertainty (the proposed
Bulldozer SLAM).
6
6
AutoMat
page 7 of 37
page 7 of 37
AutoMat
Thanks to the Destro company, being part of the AutoMat advisory board, permission to realize the autonomous wast
7
7
AutoMat
page 8 of 37
page 8 of 37
(a) Outdoor bulk materials storage.
AutoMat
(b) Indoor bulk materials storage.
(c) The industrial waste repository - overview.
(d) The industrial waste repository - detail.
Figure B.1.1: Examples of workspaces used for final demonstration (scenario 1 and scenario 2). Source: Destro and
VOP companies.
handling task in the workspace of industrial waste dumps has been obtained. The provided area is large enough to
prepare various real-world experiments in such a scale, which corresponds to the size of the utilized experimental
robots (see Fig. ?? for examples of available robotic platforms of university partners). Besides, the consortium may
take the advantage of various environments presented in the area (see Fig. ?? for examples). The experimental site
is part of the protected areas of the Destro company with limited access, which makes the experiments reasonable
and with minimal costs (there will be no charge required by the Destro company for these services).
The second scenario will be tested at the test-field of VOP partner, built for this purpose and using its small-size
construction machines (DAPPER). The purpose of this endeavour is t o c r e a t e c o n d i t i o n s f o r transfer of the
technology and skills gathered previously with the smaller laboratory robots o nto th e ta rg et D AP P E R product.
Afterwards, a selected set of particular functionalities creating autonomy of the controlled machine, specifically imposed by
material storage cases and material manipulation scenarios, will be encapsulated and provided to p r o s p e c t i v e f u t u r e
u s e r s o f t h e s ys t e m – D A P P E R customers. In addition, the complete integrated system will be tested in its final
configuration and for the expected range of environmental conditions for use. The aim of this experimentation is to
verify, that completed software system together with the VOP’s product and given use scenarios has qualified for
commercialization. The final system is expected to satisfy safety conditions and to achieve reliability and performance
required by TRL8 standard.
B.1.1.2 Scientific challenges
8
8
AutoMat
page 9 of 37
page 9 of 37
AutoMat
In addition to the technological challenges required by the scenarios motivated by typical applications of smallsize construction machines, three main scientific challenges arise from the proposed concept: The first challenge
is the above mentioned possibility of the autonomous modification of the given material handling plan,
which is the key idea of the project. This application requires to equip the employed robots with a set
9
9
AutoMat
page 10 of 37
page 10 of 37
AutoMat
of skills for material handling, whereas two m a j o r bottlenecks were identified across roboticists and scientists and
manufacturers of construction machines communities:
• The real-world material handling at construction sites and waste repositories may involve a large fleet of
various machines with diverse end-effectors (attached tools).
• Development of controllers, their tuning for the task , robustness and safety tests, etc. directly with the large and
heavy machines may become too risky and dangerous for the workspace as well as for the u s e d robotic
machines themselves.
Therefore, the project aims at development of an approach, which introduces setup procedures and safety into the
controller learning stepwise with smaller and safer robots, and provides experience/skills transfer between
different robotic platforms subsequently.
The third scientific challenge is related to perception and representations for distributed mapping of dynamic 3D
workspace. The material handling in the above mentioned applications focus the following aspects of perception
and mapping:
• The robotic system itself can intentionally and markedly change the workspace, what forms the outdoor mapping
problem highly dynamic.
• A semantic segmentation of the 3D map is required to efficiently realize the material handling task.
• Various construction machines may change and perceive the workspace simultaneously.
All these afore listed challenges pave the research road far beyond the current state-of-the-art in robotics and machine
learning, and t h e a t t a i n e d s o l u t i o n s w i l l significantly extend the existing methodologies.
Proactive robot-environment interaction
The h e r e i n a d d r e s s e d robotic systems purposely modify its workspace to enable more efficient deployment for
the material handling, either via hierarchical segmentation of the material manipulation task into multiple levels, or by
building dedicated structures, navigation landmarks to adjust the workspace to its’ needs. In that aspect, certain
preliminary changes/adjustments of the workspace may help to fulfill much more demanding tasks consequently as:
• Autonomously proposed m o d i f i c a t i o n s t o
the workspace enable/simplify operations of robots
• The plan for material handling is purposely modified to couple better to the robots’ needs (constraints,
intentions)
Let us refer to the possibility of autonomous modification of the given material handling plan towards improving the
operational performance of robots as “proactive robot-environment interaction” in this project. This idea goes beyond
the regular understanding of autonomous systems operation in outdoor workspace – classically trying to
accommodate the robot to the environment. Here, the workspace is not considered as a passive invariant but it
serves as a vehicle that facilitates accomplishment of the desired task by robots. The robots have the c a p ability to
suggest changes to the workspace that are not explicitly included in the desired task. The modifications a r e
suggested t o influence the desired task of the robots in positive ways. These can be by increasing the efficiency of
the robots’ deployment, or by removing existing constraints or obstacles to accomplish successfully the task at all.
This brings intelligent robotics yet another step closer to human thinking.
In common scenarios, human actors tend prepare their working site to facilitate their foreseen activities. Construction
workers build scaffolding, auxiliary footpaths and other auxiliary structures even if these are not directly included in
their tasks. Besides, the human actors propose improvements based on their previous experience gained in similar
tasks before. We propose to design an intelligent robotic system with similar abilities in the AutoMat project.
In the material handling, the system is expected to suggest changes of the early plan for the given task during
its’ execution, r e s p o n d i n g t o v a r i a t i o n s o f t h e e n v i r o n m e n t s h a p e , p r o p e r t i e s , e t c . to enable effective
10
10
AutoMat
page 11 of 37
page 11 of 37
AutoMat
completion of the given task. T h e r e f o r e , t h e robots may on the fly propose new roads to optimize their motions, new
landmarks to serve their better localization and many further aspects.
All the afore represents a set of important steps towards effective and usable autonomy, that allows
commercialization of the autonomous system in question in broad field of construction works, material manipulation
and reclamation activities applications. Through the obtained level of autonomy a given plan may be adjusted for
realization by robots in a way as it would be done by robotic experts, if present.
In the proposed concept, the improvements to the current state-of-the-art in respective fields will be targeted using a
series of short autonomously designed partial and simplified attempts to solutions evaluated i n test trials.
Nevertheless, the beauty of segmentation of the target problem is, robots change the surrounding workspace i n a
m o d e r a t e w a y o n l y in these separate trials. T h a t a l l o w s t h e m t o measure and evaluate performance
in the key aspects of localization, motion performance, possibility to move the material, etc.
Based on
11
11
AutoMat
page 12 of 37
page 12 of 37
AutoMat
the obtained rewards, the successfully evaluated trials/actions are subject for further evolution and generalization. The
generalized actions are integrated into the proposed large-scale changes of the material handling plan.
An example of such a proactive robot-environment interaction might be an effort to adjust the workspace in order to
improve the localization of robots. This leads to a technique that we call “Bulldozer SLAM”.
Bulldozer SLAM
The robotic system autonomously adapts the given plan of the material handling to achieve more efficient and precise
localization of the robots in the workspace over time.
• The workspace is naturally modified to decrease the localization uncertainty and increase the map quality/precision.
• The robots learn the environmental changes that positively affect the localization uncertainty.
• The learning is realized during the robots’ deployment executing particular scenario.
The possibility of the workspace modifications towards better localization of robots exceeds the state-of-the-art of
SLAM approaches. Let us sort the SLAM methods according the process of the mapping and localization as follows:
(Passive) SLAM → Active SLAM → Bulldozer SLAM
The traditional SLAM approaches “passively” observe the working environment with the aim to determine robot position
and they build a model of the workspace, while the robot is controlled through some other means. The Active SLAM
techniques (also called SPLAM - Simultaneous Planning, Localization and Mapping) control actively the robot in order
to best minimal localization error, e.g. by avoiding places providing poor contribution to localization. The Bulldozer SLAM
goes even beyond as it controls not only the robot’s motion, but also other robot’s actuators leading to workspace
modifications.
There are robotic systems, that build artificial landmarks in the areas where the information from the obtained map is
not sufficient for the localization. These artificial landmarks can even be purposely built with the only aim for the
localization improvement. In the proposed Bulldozer SLAM, the landmarks arise as a side product of the material
handling. O f t e n , e ven a slight change of shapes of objects in the workspace may significantly decrease the
uncertainty of the localization with a minimal effect to the targeting of the given task.
B.1.1.3
Introducing safety into the controller learning
The active modification of the workspace by autonomous heavy Bulldozers and construction robots introduces a high
risk of damage of the robots, or hardly reversible and unwanted changes to the surrounding workspace, etc. More
generally, a key challenge for autonomous cognitive robots is to acquire the desired skills and thereafter
operate without bringing into danger themselves, or others, or the workspace.
Thus, dangerous situations must be thoroughly prevented during the robot training, and avoided during the course of
the robot missions. Two complementary approaches will be designed and deployed to address the safe learning and
operating challenges. The first one is inspired from the biological developmental process. The second one, detailed
below, proceeds by transfer of expertise between different robotic platforms.
B.1.1.3.1
Developmental Learning
The developmental process is remotely inspired by the learning of motion primitives by infants. The idea is that natural
limitations, e.g. their limited strength, prevent infants from endangering their body when exploring their sensori-motor
abilities. As children grow, their musculoskeletal system evolves and becomes stronger. Adults, who can operate with
full strength, are able to seriously injure themselves; but they are assumedly skilled enough to avoid injuries with high
probability.
This analogy leads to define a robotic developmental process as follows. The developmental learning will be implemented as the actuator ranges, initially very limited, are gradually increased along the learning process. The primary
12
12
AutoMat
page 13 of 37
page 13 of 37
AutoMat
stage relies on using randomly generated actions with very limited intensity, albeit sufficient to build some forward
models. This stage will be used to build a reward function. The progress beyond the state of the art, e.g. the design of
13
13
AutoMat
page 14 of 37
page 14 of 37
AutoMat
intrinsic, curiosity-driven motivations, relies on the use of the scarce feedback from the human teacher (“it’s better; it’s
not better”). These rewards will be used to determine when and how the robot actuator range can be extended, akin to
the process of musculoskeletal growth. Then, the intensity of the robot actions will increase as its skills improve.
The process will be iterated until the robot is able to deploy the desired skills with full strength.
In summary, the natural growth process is emulated by controlling and gradually extending the actuator ranges. The
teacher’s feedback allows the robot to simultaneously shape its exploration of its sensori-motor abilities (what is allowed) and to improve its skills in situation (what is appropriate).
Expertise transfer among various robotic platforms
Along the same line, the musculoskeletal growth of the robot can be emulated by considering different robotic platforms,
with increased motor abilities. The growth of robot bodies and “muscles” can be realized by a utilization of bigger
and more powerful vehicles. The proposed approach is inspired by the success story of NASA’s Mars exploration by
bigger and bigger mobile platforms. Taking the knowledge and experience obtained by several generations of vehicles
experimentally employed in the task of Mars exploration, the scientists developed a robust large-scale machine.
The progress beyond the state of the art is to achieve an automatic transfer of expertise among robotic systems,
combining the fast and cheap acquisition of primary skills on usually smaller and inexpensive robotic platforms, and transferring these skills to more powerful robots thereafter. The mechanism of expertise transfer is
the second scientific challenge related to the machine learning in AutoMat. It tackles the following objective:
• To achieve a skill (policy) transfer between different robotic platforms.
The experience transfer between different robotic platforms of the team is useful in general, not only for the robot’s
“growth”. Let consider a situation in which a robot learns to move in a sandy part of the site. By the short experimental
trials, it discovers that stopping in this area may be dangerous, slowing down can cause unwanted wheel slippage, fast
turning may cause skidding, etc. Another robot with a smaller size can combine this experience with its own knowledge.
It can learn that sandy areas have to be passed quickly even without an experience from such a kind of area before.
Without this knowledge, visiting this area could be fatal for the smaller robot. Contrariwise, a bigger robot can learn
that it is better to go slower to avoid the skidding that could cause serious collisions and so on.
This cognitive approach is crucial for robust utilization of robots in the afore mentioned application scenarios. It
simplifies installation of the autonomous system on the site of a new customer. Besides, the cognitive approach
allows adaptation of the system and enables to deal with unforeseen events and slight variations of t h e given task.
This also reduces service costs and reduces time needed for adaptation of the system.
The utilization of the proposed machine learning techniques to obtain the controllers and skills for the material handling
is a very demanding process. As it is difficult to forecast if the required skills can be gained by these techniques in a
sufficient manner at the time when these are needed by depending research activities (e.g. the mentioned proactive
robot-environment interaction requires the learnt skills as one of its inputs), the learning procedures design remains a real
challenge. Nevertheless, the benefit of obtaining safe controller/skills learning and the skills transfer between
different platforms together is so attractive for end-users of the t e c h n o l o g y i n q u e s t i o n that the AutoMat
consortium has decided to spent part of the planned efforts on investigation of these methods. Also some manufacturers
of large construction, or agricultural machines (Liebherr Group, Zetor tractors - members of the AutoMat’s Scientific
Advisory Board), indicated the needfulness of a mechanism for controller/skills transfer as the key property of the
autonomous a c t i v i t i e s o f t h e i r p r o s p e c t i v e s o l u t i o n s i n i . e . material manipulation and agricultural
activities due to i n v o l v e m e n t o f e v e n a fleet of diverse machines.
AutoMat consortium has a long-term expertise in predictive control of outdoor robots in tasks of workspace maintenance (the Snow Shoveling project at CTU [100, 99, 98, 47] or the Soil Moving at AALTO [41]). These control methods
of material pushing can be r e used for the preliminary phases of the project while the herein investigated novel machine
learning techniques will still be under development (and without any possibility of the skills transfer as proposed in this
section). Details on management of the skills learning activity can be found in section ??, Significant risks and
contingency plans.
B.1.1.3.2
Perception and representations for distributed mapping of dynamic 3D workspace
14
14
AutoMat
page 15 of 37
page 15 of 37
AutoMat
AutoMat addresses the problem, in which an autonomous system performs workspace modification task (material
handling) based on an a-priori plan. The plan and its’ goal in AutoMat describes a sequence of actions and the desired
high-level target state of the workspace. This plan is further conveyed to actions of robots by planning the sequential
tasks
that
have
to
be
performed
in
order
15
15
AutoMat
page 16 of 37
page 16 of 37
AutoMat
Figure B.1.2: Motivation for the learning of large and heavy construction machines through the expertise transfer among
various robotic platforms.
to reach the desired goal. A vital prerequisite for this process is t o enable recovery of the workspace state as a
spatial representation (or simply a map). Real-world workspaces are by nature 3D and therefore in order to truly
represent its state a 3D representation is needed. Moreover, by active modifying of the workspace the robots
continuously change the state of the workspace. Thus, a central property of the map should be that it can consistently
represent the workspace regardless of the changes.
16
16
AutoMat
page 17 of 37
page 17 of 37
Finally, AutoMat addresses also ouotlook to multiple robots working for a common goal.
modification
17
17
AutoMat
Generally, the workspace
AutoMat
page 18 of 37
page 18 of 37
AutoMat
application requires various construction machines and the proposed machine learning approach requires vehicles of
different sizes. It is important to point out, AutoMat project is not focused on the multi-robot cooperation and
coordination, but more than one robot can occur on work site at the same time solving either the same, or different tasks.
Imagine loaders transporting sand near to a s t o r a g e a r e a and our DAPPER machine is used for storing the
material at the dedicated place, i.e. into a heap. The loaders change part of the machine workspace which has to be
taken into account if planning tasks for DAPPER. The preceeding means that the workspace representation should
allow real-time updates by multiple robots/vehicles.
To sum up, in AutoMat we aim for a 3D model of the workspace, that is kept and updated in real-time, by
possibly multiple robots, or by a combination of robots and human-driven vehicles, T h e m o d e l i s
m a d e capable of representing the dynamics of the workspace consistently.
• Spatial representation for mapping in dynamic 3D workspace
We plan to build upon an existing spatial representation — the Normal Distribution Transform (NDT). The NDT was
originally developed in the context of 2D laser scan registration [17]. The central idea is to represent the observed
range points as a set of Gaussian probability distributions. NDT has later been extended to three dimensions in the
context of 3D r a n g e scan registration [64]. Prior work has shown that the NDT is an accurate [105] representation for
building of 3D maps, however, the primary use of the representation has so far been in modeling of a single (or
few) scan(s). In our recent contributions [96, 95], we presented an extension of the NDT framework to explicitly
model free space, introducing a novel 3D representation — the Normal Distribution Transform Occupancy Map (NDTOM). The proposed framework natively supports multi-resolution maps, as well as consistent probabilistic updates of
re-observed portions of the workspace. The resulting algorithm provides the only current solution capable of online,
real-time modeling of large scale, dynamic 3D workspace.
In AutoMat our key contributions will be to extend the NDT-OM to consistently represent different timescales
of dynamics in long-term operation.
• Distributed mapping in 3D
The second part of our main contribution is to enable efficient update of NDT-OM in the context of distributed
sensing. Having multiple robots observing the workspace using 3D sensors produces the need for communicating
the information between the entities. However, 3D sensors produce a vast amount of information, and to communicate
this information is not feasible. In AutoMat we propose to approach this by sending this information encoded as
normal distributions. We already demonstrated in [96, 95] that a NDT map can be built recursively by combining
arbitrary size sample sets. The challenge of this task is that low-resolution gaussian components cannot be fused
with high-resolution ones in a trivial manner. Thus, we propose to recreate the measurement from the low-resolution
representation by predicting the maximum likelihood measurement based on the model. This is a similar approach as
presented in [19], however, we aim to incorporate the learned model of dynamics into the sensor model. Using this
formulation the observations, coming from different sources and with different resolutions, can be consistently updated
to a global model.
• 3D Semantic Segmentation and geometrical/topological mapping
This task is to extract relevant structure from the acquired 3D maps by segmentation of the acquired 3D representations
into semantic categories including:
1) Landmarks (meaning distinctive 3D features that
mapping),
ensure
2) “Diggable” regions - places where the m a n i p u l a t e d
material can be loaded,
high-quality results in localization and
bulky
3) Paths already prepared for efficient movement of machinery,
4)Traversable terrain,
18
18
AutoMat
page 19 of 37
page 19 of 37
AutoMat
5) Other “obstacles”, etc.
The initial models used for the semantic segmentation will be acquired by a process of unsupervised learning based on
autonomously designed testing trials. Unsupervised learning of the 3D semantic segmentation will use techniques of
multi-scale grouping and categorization that are capable in detecting the courses and ridges and merging the basins in
19
19
AutoMat
page 20 of 37
page 20 of 37
AutoMat
the 3D maps and associate to the nested structures other physical classifiers to which the areas are being “diggable”,
“traversable”, “forming an obstacle” in terms of physical observables acquired by the robots. These models will be
continously refined based on the robot’s experience.
Algorithms for representation of the 3D map to obtain a meaningful model for the mission planning will be developed.
This enables to employ the map not only for the motion planning and obstacle avoidance, but also for a planning of the
given task (the material handling) and for visualization of the plans.
B.1.2
Relation to the work program
Let us explain how your proposal addresses the specific challenge and scope of the topic Robotics, as set out in the
work programme of ICT 2014 - Information and Communications Technologies.
Specific Challenge: Research implementing the Strategic Research Agenda established by the euRobotics AISBL
(the private partner in the future Public-Private partnership in Robotics) will be essential to attain a world-leading
position in the robotics market. Driven by the applications needs identified in this Strategic Research Agenda (SRA),
challenging R&D problems will have to be addressed, to make substantial progress in robots capabilities and improve
the Technology Readiness Levels (TRL) of robotics R&D. In addition, a dedicated effort is necessary to close the
innovation gap, allow large scale deployment of robots and foster market take-up. Robotics is very broad, both in terms
of technologies and disciplines it involves, but also in terms of markets and stakeholders. It is therefore essential to
address the inherent fragmentation.
In accordance with the specific challenge of this call, the project AutoMat focuses on reaching new levels of autonomy, adaptability and reliability f o r s m a l l m a ch i nes to be able to deploy the autonomous system in industrial
applications, d e l i v e r i n g t r u l y u s a b l e p r a c t i c a l s o l u t i o n t o a u t o n o m y o f a s m a l l
construction machine and to deliver this in a form of a product on the
E u r o p e a n m a r k e t . This, together with prototyping of particular autonomous skills required by end- users of
construction machines increases TRL in the broad application of autonomous handling and moving of bulky
materials. Nowadays, the complex autonomous systems for control of construction machinery are successfully
verified in laboratory conditions (achieving TRL 4). Another use area of these existing technologies is with special and
dedicated large construction machines, that require solid functional infrastructure (as i.e. GPS navigation), and runtime supervision on
expert level (detailed activity plan given, etc.), achieving higher TRL but being extremely rare due to its; complexity of use, price and
running costs. TODO: add references here The hereby proposed project aims at d e l i v e r y of a complete system
qualified through test and demonstration in an operational environment of a n industrial test polygon (i.e. a waste dump
provided by t h e V O P member of t h e c o n s o r t i u m , t h e m a n u f a c t u r e r o f s m a l l - c o n s t r u c t i o n m a c h i n e s , a n d
t h e p r i m a r y d i s s e m i n a t o r a n d u s e r o f t h e h e r e i n d e v e l o p e d a p p l i c a t i o n o f t h e t e c h n o l o g y. M o r e o v e r ,
Destro company, Zetor tractors, and other parties involved in the AutoMat advisory board will also turally have
principal access to the investigated technologies and therefore will foster their wider dissemination. The final
demonstration and the corresponding use scenario will be fulfilled to achieve TRL8 and prototyping the target
autonomous functionalities. Achievement of extraordinarily high l e v e l s o f TRL is allowed by utilization of DAPPER
s m a l l c o n s t r u c t i o n m a c h i n e - b e i n g a f r e s h a n d m a r k e t e d product of VOP - as the robotic platform for
final demonstration. Originally, DAPPER has been developed for manned operation, with outlook to future remote and
autonomous control modes, investigated and delivered in the framework of this project. Within the project, practical
implementation, use scenarios and related autonomous functionalities will be tailored directly for this machine
based on demands of the target users, market investigations as well as of the m a n u f a c t u r e r c o m p a n y
needs.
The aim of the project endeavour is to develop a cognitive autonomous system enabling robotic c o n s t r u c t i o n
m a c h i n e to operate in dynamic, worksite and real-world environments, reaching measurable improvements of
autonomy in material handling, interacting in a safe way with the work environment and humans.
The target solution of the AutoMat project will be prepared for commercialization as an extending supplement of
services and products offered together with the DAPPER construction machine b y VOP company. Relying on the
advances of the methods of intelligent mobile robotics developed in this project, the proposed system will resolve
situations not anticipated at t h e design time, that enables operation in nondeterministic, dynamic, real-life workspaces.
Moreover, a new level of autonomy will be reached due to the fact that the autonomous system may suggest
20
20
AutoMat
page 21 of 37
page 21 of 37
AutoMat
improvements to the given material handling plan with the aim to increase the efficiency, reliability and overall performance
of its act i n g in the workspace towards the given task accomplishment. T h e Cognition, in the sense of continuous
sensing of the workspace and consequent adaptive system behavior and response, is crucial to be able to act in
realistic,
nondeterministic
workspaces
as
the
afore
given
use
scenarios
definitely
are.
21
21
AutoMat
page 22 of 37
page 22 of 37
AutoMat
Furthermore, we propose to build a system, which is not purely specialized for the listed particular workspaces, but
w e o f f e r a b a s e k i t o f n o v e l m e t h o d s which may be autonomously adapted for other similar kinds of
workspaces. The robots’ workspace is dynamic in the sense of its significant modification, performed by the
autonomous system itself. Due to the large changes in the workspace, the c o m m o n state-of-the-art approaches of
the key robotic components for mapping, localization and navigation cannot be employed since the sensory
information from the workspace are not persistent. In line with the Call requirements and conditions, the project
focuses advanced robotics systems, given their potential to underpin the competitiveness of key manufacturing
sectors in Europe.
AutoMat covers multi-disciplinary R&D and innovation activities via the strong cooperation between two robotic
partners, two AI partners and two companies that act as the goal-givers and the driving force of technology
transfer via use-cases, the three market- oriented verification scenarios. The Pre-Commercial Procurement (PCP)
is built in through involvement of the VOP partner in the project consortium. This partner enables and contributes massively to
prototype development and therewith stimulates deployment of the final product. Therefore AutoMat addresses the key
abilities necessary for commercialization of autonomy in the tackled application scenarios that are driven by market
needs (the end-users specified requirements).
From the list of bottlenecks preclusive of the commercialization and crucial cognitive capabilities which are missing
today, AutoMat deals with advanced sensing, perception and sensory data understanding (WP1), learning
towards adaptation and functional improvements (WP3), reasoning and action execution at increased level of
autonomy (WP4), all followed by bringing the project outcomes to experimentally verified application scenarios and high TRL8
prototyping towards a product (WP5) The spatio-temporal cognition in real-world is exactly the scope of the WP1.
Besides, AutoMat is built on an emerging inter-disciplinary approach c r o s s b r e e d i n g i n d u s t r i a l a n d
o u t d o o r robotics with artificial intelligence, which enables to significantly extend the area of applicability of intelligent
robotic systems with massive impacts into real-life applications.
B.1.3
Concept and approach
As mentioned the AutoMat objectives and also the three application scenarios are driven by wishes of customers
of VOP partner. These scenarios, the waste repository, the bulk materials storage and the landscape reclamation
activities, are consider by the VOP company as their target applications, for which the Dapper product is designed.
Taking into account the current level of development of autonomous systems of mobile robots, a set of technological
challenges and three main scientific challenges were selected as key components necessary for reaching a level of
development necessary for deployment of autonomy into the construction machines. As mentioned nowadays, the
complex autonomous systems designed for construction machines are tested mostly in laboratory conditions (TRL 4
technology validated in lab) and some components and subsystems have been validated or demonstrated in a relevant
environment (TRL 5-6). The objective of AutoMat is to go from the laboratory condition to market and to design a
prototype of complete system qualified through demonstration in relevant environment (the industrial waste dump and
the bulk materials storage area) with middle size construction machines, the Dapper product of VOP partner (this
matches to TRL 7 system prototype demonstration in operational environment). In addition, a set of autonomous
functionalities required by VOP’s customers will be completed, qualified in the real conditions of the final experiments
and offered to end-users as an added value of the Dapper product (this activity will be realized in accordance with the
TRL 8 system complete and qualified).
TODO: All: Describe any national or international research and innovation activities which will be linked with the
project, especially where the outputs from these will feed into the project; Where relevant, describe how sex and/or
gender analysis is taken into account in the projects content.
B.1.4
Ambition
TODO: ALL partners: Where relevant, refer to products and services already available on the market. Please refer to
the results of any patent search carried out.
The AutoMat project aims to develop methodologies and techniques for an autonomous mobile robotic system capable
of material handling. The study of related literature, projects web-sites and product portfolio of key manufacturers
22
22
AutoMat
page 23 of 37
page 23 of 37
AutoMat
worldwide has shown the most important technological aspect and the key scientific objectives that need to be tackled
to achieve a state, which enables to offer such a system on market. Although the state-of-the-art techniques for
autonomous navigation, localization and mapping enable autonomous deployment of robots in large scale outdoor
workspace, they tend to fail/degrade in performance in the case of substantial changes of the robots’ workspace, which
is the case in the identified market oriented scenarios. Some of the existing techniques are well-suited for continuous
23
23
AutoMat
page 24 of 37
page 24 of 37
AutoMat
and slight changes caused by different weather and seasonal conditions, for example, but the extensive modification
of the workspace by the robotic systems itself requires completely new approaches for perception and distributed
mapping. The active modification of the workspace may not only be a handicap for the methods in intelligent robotics.
On the contrary, properly realized modifications of the workspace may increase the efficiency of employed systems, as
is demonstrated in the proposed “Bulldozer SLAM” approach, going beyond the state-of-the-art in existing active SLAM
techniques.
The second important aspect, which needs to be considered, is the future employment of large, heavy and expensive
machines for the tasks of material handling and workspace modification. Although state-of-the-art machine learning
techniques are frequently used for evolving the skills of autonomous vehicles, it is difficult to guarantee safety, mainly
in the beginning of the learning process. It is necessary to go beyond the state-of-the-art of machine learning methods
to facilitate learning of large robots that may be dangerous to their surroundings as well as to themselves.
The AutoMat consortium proposes to enforce the ideas of material handling by enhancing the state of the art with cognitive capabilities, relying on safeness in the machine learning, perception and distributed 3D mapping and Bulldozer
SLAM. These issues together with related past and ongoing projects are considered in more detail in the following
sections. All these aspects were chosen by the construction machine manufacturers (project partner VOP, member of
the AutoMat’s Scientific Advisory Board Liebherr Group) as the key elements necessary for deployment of autonomy
together with their products in the above mentioned industrial applications.
B.1.4.1
Perception and Distributed 3D mapping
Several approaches for 3D spatial modeling have been proposed and used successfully in robotic mapping systems.
Elevation maps are a 2.5D parametrization of space, obtained by associating a height value to cells organized in a 2D
grid [12]. However, due to the dimension reduction elevation maps can model only a single surface per cell, making
overhanging structures like e.g. different layers in the waste repository scenario impossible to represent correctly.
Several authors have proposed to extend the elevation grid approach by modeling space using Gaussian Processes
(GP). Assuming that each point in 2D space x = (px , py ) can be associated to a height valuef (x) = pz , the central
idea of GPs is to represent f (x) as a Gaussian Distribution in function space. The available sensor data is used in
order to learn the hyperparameters of a GP, which can then be used to perform regression for any point in 2D space
and obtain an interpolated height value, resulting in a continuous spatial model. Lang et al. [61], as well as Plagemann
et al. [82], employ a non-stationary kernel that is adapted explicitly in order to handle different surface types. In another
series of influential works, Vasudevan et al. [118] propose to use GPs with a non-stationary neural-network kernel for
modeling of large-scale outdoor terrains in the context of open pit mining. As far as we are aware, however, no GP
frameworks have been applied in the context of online mapping or in dynamic workspace.
Different approaches to 3D representation attempt to relax the functional constraint of a unique height value per location, placed by the models discussed so far. Triebel et al. [114] propose the Multi-Level Surface (MLS) map as an
extension to elevation grids which allows for multiple height values to be stored per cell. The proposed models have
been used to perform localization and navigation in an outdoor environment. More recently, Rivadeneyra and Campbell
[86] propose an extension of MLS maps that incorporates Bayesian probability fusion for terrain estimation – Probabilistic Multi-Level (PML) maps. Both MLS and PML maps, however, do not explicitly model free space or track the
consistency of the surfaces in the map and are thus only targeting static workspace.
Another approach that removes the assumption of a single height value per cell relies on a re-formulated GP framework [81]. The proposed technique, Gaussian Beam Processes, performs a regression on the range in a typical 2D
(and by simple extension also 3D) laser scan, thus assuming only a single value per sensor ray, in a manner consistent
with the physical properties of range scanners. Recently, Smith et al. [102] present a similar formulation in three dimensions that is centered around a push broom laser configuration. These GP frameworks can be used to fully represent
3D workspace and are tightly coupled with the principle of operation of 3D range sensors, but are not suitable for online
mapping and assume static workspace.
Triangle meshes are another method for spatial representation popular in the computer graphics community. Each
triangle represents a facet in a mesh, a graph of interconnected vertices. The abundance of research on triangle mesh
visualization, reconstruction, and collision checking offers a powerful incentive to utilize them in robotics. In order to
24
24
AutoMat
page 25 of 37
page 25 of 37
AutoMat
obtain the best reconstruction results from noisy point clouds, special care has to be taken in filtering and handling
of uncertainty. For example, Wiemann et al. [121] present an approach for extracting triangle meshes from noisy
25
25
AutoMat
page 26 of 37
page 26 of 37
AutoMat
registered point cloud models. The model extraction is however performed as a post-processing step and the obtained
triangle mesh is not maintained online.
Finally, recent advances [78] in dense mapping methods utilize a volumetric representation of space, using a 3D
sample grid. Each stored sample represents the value of a signed distance function (SDF), which stores the Euclidean
distance to the nearest occupied cell. SDF methods have been demonstrated to produce extremely detailed models in
small-scale office-sized workspace using fine resolution grids, but are difficult to extend to large-scale workspace.
The Normal Distribution Transform (NDT) is a compact spatial representation, originally introduced by Biber and
Strasser [17] in the context of 2D scan matching. NDT represents space as a set of Gaussian probability density
functions. NDT has later been extended to three dimensions in the context of 3D scan registration [64, 65, 106] and for
planning and traversability analysis in 3D [107].
In our recent contributions [96, 95], we presented an extension of the NDT framework to explicitly model free space,
introducing a novel 3D representation – the Normal Distribution Transform Occupancy Map (NDT-OM). The proposed
framework natively supports multi-resolution maps, as well as consistent probabilistic updates of re-observed portions
of the environment. The resulting algorithm was shown to outperform a 3D occupancy map [123] in performance and it
was shown to be the only current solution capable of online, real-time modeling of large scale, dynamic 3D workspace.
However, NDT-OM is not a model of a dynamic environment as understood in the dynamic mapping literature in 2D
[16, 93, 7, 74, 122, 73].
Progress beyond state-of-the-art: In AutoMat we propose to study a representation of the dynamic workspace.
We will build upon the previous contributions of the consortium [16, 95, 93] and extend the state-of-the-art in several
aspects: 1) we will develop a novel 3D representation, which models several time-scales; 2) the timescales will be
learned online during operation; 3) we will model the state transition parameters by tracking the changes in a local
neighbourhood; 4) we will formulate a distributed approach for map updates and 5) we will provide real-time performance. Combining our contributions to existing contributions on NDT maps will push the frontier of 3D perception in
mobile robotics significantly, enabling a whole range of applications.
B.1.4.2
Control of learning and operating risks
In reinforcement learning, quite specific approaches have been designed for robotics, mainly aimed at scalability
through providing expert’s demonstrations (inverse reinforcement learning [1] and learning by imitation [21]), aimed at
the robot safety during learning through reversibility-related criteria [75], or aimed a unified Bayesian framework to deal
with several tasks [113] including SLAM [39]. Transfer learning for reinforcement learning [110] focuses on facilitating
new skill acquisition through extending existing skills, related similar tasks or environments.
Besides the safety and mechanical hazard issues, learning in situ raises an additional difficulty, that of defining appropriate rewards guiding the robots toward the acquisition of the desired skills. Taking inspiration from active learning, an
intrinsic motivation-based approach has been proposed by [10, 11]. The idea is to provide the robot with an internal
(autonomous) reward, measured as the error decrease of the forward model. The robot, driven by its internal reward,
thus focuses on visiting the fraction of the sensori-motor space most amenable to increase the coverage and improve
the performance of the forward model.
Another simpler internal reward has been proposed by [26], based on measuring the entropy of the robot log. With
the goal of maximizing this entropy, the robot thus demonstrates “curiosity” in the sense that it thrives to visit as many
different sensori-motor patterns as possible, thereby maximizing the coverage of the forward model. A limitation of
internal rewards is that they can lead the robot to experiment unsafe behaviors. Another approach, referred to as
preference learning-based has been proposed to drive the robot toward desirable behaviors, through exploiting the
sparse preferences of the human teacher [4, 3].
A key scientific challenge tackled in AutoMat is to enable safe in-situ skill acquisition. This challenge will be tackled
in two complementary ways. The first one, remotely inspired from developmental learning, starts with a very limited
actuator range of the robot and achieves motor babbling [25]; the robot being in an “infant” state can do little but at
the same time has little chance to endanger itself or others. Gradually, the robot extends the actuator range (“muskuloskeletal growth”) as it acquires the desired skills. The critical issue is to exploit the scarce human teacher’s feedback
to acquire the criteria of danger (what should not be done) and appropriateness (what should be done).
26
26
AutoMat
page 27 of 37
page 27 of 37
AutoMat
The second approach pursues the same line of developmental learning through expertise transfer among different
robotic systems. Note that the state of the art in transfer learning for reinforcement learning [110] to our best knowledge
only considers the transfer among different tasks (formally, reward functions) or environments (state spaces). While
27
27
AutoMat
page 28 of 37
page 28 of 37
AutoMat
the transfer of a controller among two robotic platforms with same sensor systems can be formulated as an inverse
reinforcement learning problem [1], the transfer among two different robotic platforms raises new and critical issues.
Progress beyond state-of-the-art: Taking into account the robot safety besides the controller accuracy has emerged
as a strategic research topic in the last 3 years (see [76, 75] and references therein). However the characterization
of “safe controllers” proposed so far in the literature to our best knowledge is built-in and hardly general (e.g. relying
on the reversibility of the policy, which indeed makes sense for aerial vehicles). The proposed approach in opposition
is based on a most intuitive metaphor: an infant can experience the world while incurring less hazards, due to his/her
own actions, than a child.
The progress beyond the state of the art thus involves two milestones:
• Starting from a very limited but safe initial state (the “robot infant state”), the robot gradually gains power and
knowledge through limited interaction with the human teacher: the active preference-based reinforcement learning setting [4, 3] is adapted to build an internal model of accuracy and danger.
• The skill transfer among different robotic platforms with same sensor systems will be achieved using inverse
reinforcement learning; a key difficulty here is to propose an adequate representation of the search space.
B.1.4.3 “Bulldozer SLAM”- System for Simultaneous Localization and Active Mapping with Environment Modification
In order to achieve our objectives related to performance in the material handling tasks outlined in the validation
scenarios (flattening the surface to create a “road”, making a small hillock to store the material, and transportation the
material), we must develop mechanisms for the goal-directed control of robots that exploit the novel representations
and functionalities developed by the rest of the project. We use the term “cognitive control” to refer to processes that
generate behaviour in a robot based on a higher-level interpretation of the semantic content of its sensors (as opposed
to low-level feedback control).
Most frameworks for goal-directed behaviour generation in robots rely on either reactive or deliberative multi-level
planning frameworks, from the pioneering early work on mobile systems [20, 112] to state-of-the-art cognitive mobile
robots [43, 14, 109]. These systems typically reason about descriptions of space encoded in relational form (e.g.
region connectivity, object placement, human location), but all assume to be descriptions of a static world. In AutoMat
we will develop a novel cognitive control framework for leveraging the long-term experience of the robot in pursuit of its
task-level goals in a dynamic workspace, including workspace modification by the robot itself.
Autonomous exploration to complete static 2D [5] or 3D maps [27] has been well-studied. For those applications of
static mapping well-established measures of information gain have been proposed and successfully employed [52,
103]. Another strand of research in robotic mapping is concerned with the addition of semantic information, where recent approaches build hierarchical representations featuring objects, categorised locations, and rooms [55, 117]. Such
semantic information was exploited for the problem of efficient visual search for objects by combining semantic mapping
and efficient planning [9]. Related approaches have also been applied to the problem of terrain classification for outdoor mobile robots [63]. In AutoMat, we will develop novel approaches for map exploration in dynamic workspace based
on information-gain driven view-point recommendation for dense scene reconstruction during workspace modification.
Additionally we will exploit the semantics of the workspace, by semantic segmentation of the acquired 3D models into
categories such as “landmarks”, “diggable regions”, “paved paths”, “traversable terrain”, etc, which are meaningful
and useful to the robot in achieving its high-level goals. These categories can be considered as “affordances” which
suggest a particular action, and will be learned and adapted based on the robot’s own experience.
Some previous works in robotics have exploited the principle of stigmergy [49], a mechanism that allows the coordination of actions within the same agent or across different agents by means of traces left in the workspace. Beckers
et al. [13] developed a system where a group of mobile robots gathered a set of randomly distributed objects and
clustered them into a pile, and also compared the results to similar clusters observed in ant colonies. Johansson and
Saffiotti [50] introduced a stigmergetic approach to robot navigation in which the robot creates and exploits a navigation
map which is stored in an array of RFID tags embedded in the floor. Our approach is related to stigmergy-based approaches because the robots in AutoMat will effectively create their own “landmarks” by favouring actions that produce
salient 3D features which enhance the robustness and accuracy of 3D mapping and localisation.
28
28
AutoMat
page 29 of 37
page 29 of 37
AutoMat
Progress beyond state-of-the-art: In AutoMat we propose to develop novel approaches for cognitive control of mobile
robots enabling “Bulldozer SLAM” for workspace modification. We will thus create a system that merges continual
29
29
AutoMat
page 30 of 37
page 30 of 37
AutoMat
planning-based control (to achieve the material handling plans obtained through interaction with human supervisors),
path planning using the representations obtained through the above-mentioned perception and distributed 3D mapping
capabilities, and robust execution mechanisms leveraging the skills acquired as action primitives for achieving the
anticipated workspace modifications. The innovations of AutoMat beyond the state of the art will include the following
aspects:
• We will extend the previous work on map exploration by enabling a robot to understand and predict dynamic
changes in 3D maps during workspace modification. Through conducting its own experimental trials the robot
will learn to predict the effects of its actions in modifying the workspace, and further learn which actions are likely
to achieve the best performance.
• We will extend the previous work on autonomous mapping by developing novel approaches for map exploration
in dynamic workspace based on information-gain driven view-point recommendation during workspace modification, and also by exploiting the semantics of the workspace, through segmentation of the acquired 3D models
into categories such as landmarks, diggable regions, traversable terrain, etc.
• We will further extend the previous work on robot map building by incorporating self-modification of the workspace
to actively create useful landmarks for increasing map quality and reducing localisation errors and uncertainty
during online operation.
B.1.4.4 Autonomous material handling
Finaly, let us mention state of the art of systems of autonomous material handling and their level of development
regarding market applicability.
TODO AALTO: please add references regarding material handling SOTA (e.g. related to your AVANT machine)
TODO UoL: please add references regarding material handling in agriculture
Progress beyond state-of-the-art: TODO All
B.1.4.5
Relation to other EU projects
There are many ongoing or finished EU projects related to the research areas investigated in AutoMat. The number
of related projects is even higher due to the necessary inter-disciplinarity and involved technologies from different
scientific fields. However, the proposed project is mostly related to ICT and cognitive robotics. Selected ongoing (and
a few finished) EU projects are listed in Table B.1.1, where their main research areas (also related to AutoMat) are
described.
The table shows that projects dealing with autonomous robotic systems tend to focus rarely on learning activities and
only few of them are focused on adaptation areas (ASCENS, CoCoRo, PANDORA, SYMBRION and REPLICATOR).
We will describe these projects in detail later on. Besides, we have selected several EU projects (SYMBRION, REPLICATOR, PeLoTe, DARWIN, NIFTY, LIVCODE, LOCUST, EYE2, STRANDS) with strong links to the AutoMat consortium to summarize possible sources of knowledge and scientific achievements that will be exploited within the AutoMat
project. We should mention that mainly due to this strong foundation of scientific achievements on which the ambitious
research activities of the AutoMat project may be built, the AutoMat budget remains within such a reasonable size.
The mentioned projects also significantly contributed to the endeavour of relatively high Technology Readiness Level
of autonomous mobile robotic systems designed for outdoor applications. It allows us to bring the level of development
in this area close to commercialization within such a short period.
Let us start the detailed summary of the most related project with SYMBRION and REPLICATOR, which appeared in
both lists due to their high relevance as well as strong inter-connections with the AutoMat consortium.
Symbrion & SYMBRION Enlarged EU
The flagship project of the EU in the area of evolutionary robotics aimed to develop a modular system of multiple robots
with an ability to form complex organisms for solving challenging tasks. The key idea of the project was to simulate
the evolution of complex robots from simple cells on-board of robots in real time and to try out in hardware realization
30
30
AutoMat
page 31 of 37
page 31 of 37
AutoMat
only effective results from the simulations. The AutoMat consortium have direct access to the results achieved in
Symbrion since two partners participate in both consortiums. Particularly, CTU will build on its scientific results in
31
31
AutoMat
page 32 of 37
page 32 of 37
AutoMat
Table B.1.1: Related ongoing and past EU projects with indications of the research areas considered.
Acronym
ALLOW
ASCENS
Adaptation
Learning
×
×
×
×
IM-CLEVER
MASH
NIFTi
×
×
PELOTE
PROMETHEUS
RACE
REFLECT
REPLICATOR
×
×
×
ROSETTA
SCOVIS
×
SEARISE
SF
×
×
×
×
×
×
×
SFLY
SHOAL
SPARKII
SYMBRION
Sensors
HMI
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
HUMOUR
PANDORA
Autonomous
Arch.
×
×
EYESHOTS
FRONTS
Perception
×
×
CO3 AUVS
CoCoRo
Cognitive
(Machine)
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
areas of environment modelling and simulations of complex robots behaviour. INRIA will utilize its achievements in
the area of machine learning and active learning that are used for robots motion evolution in Symbrion, similarly as
in AutoMat, where the deployment of robotic system in demanding outdoor environments is tackled. In comparison
with AutoMat, Symbrion was project of basic research with low TRL and its final verification scenario was realized in
structured laboratory conditions.
Replicator
Robotic evolutionary self-programming and self-assembling organisms were developed in Replicator with a direct correlation with Symbrion. In Replicator, intelligent, reconfigurable and adaptable “carriers” of sensors (sensor networks)
were built, and algorithms and novel approaches for cooperation within multi-robot system were developed.
Three AutoMat partners were involved in the Replicator project. Almende may further exploit their achievements in
mission planning of the modular systems into the approach for planning of the landscape plan realization. INRIA takes
benefit of the knowledge of learning mechanisms for self-assembling of organisms. CTU may contribute with its experience of autonomous navigation in dynamic environments. Similarly as Symbrion, also the Replicator achieved low TRL
in comparison with AutoMat, which is strongly focused on commercialization of project achievements. Although the
AutoMat project brings new challenges of real-world out-door deployment and the gained knowledge in the Replicator
and Symbrion projects may be used only partly, the whole consortium may take benefits of these expertises and also
the planned effort in particular WPs may be significantly lower than if starting from scratch.
CoCoRo
The Collective Cognitive Robots (CoCoRo) project aims to create an autonomous swarm of interacting, cognitive
robots (a swarm of autonomous underwater vehicles - AUVs). Although the used robots are different from the robots
32
32
AutoMat
page 33 of 37
page 33 of 37
AutoMat
considered in AutoMat, both projects share similar ideas of environment mapping. CoCoRo researches the potential
of cognition-generating software, which is supported by a suitable hardware concept. CoCoRo aims to develop an
33
33
AutoMat
page 34 of 37
page 34 of 37
AutoMat
embodied and distributed system of AUVs using collective cognitive capabilities derived from animals (e.g., social
insect societies). Thus, the project differs from AutoMat in the idea of the considering swarm of identical simple robots
in CoCoRo, in AutoMat rather more complex entities and highly heterogeneous units are considered. And of course the
possibility of active environment modifications is not considered in CoCoRo. Nevertheless, research streams in both
projects will be coordinated to maximise the overall outcome.
PANDORA
The PANDORA project (Persistent Autonomy through Learning, Adaptation, Observation and Re-planning) aims to
develop new computational models for reducing frequency of assistance requests, i.e., to make the robots persistently
autonomous. Three directions are investigated in the project to achieve this goal. The first is description of the
world using new probabilistic semantic representations. The second is directing and adapting intentions towards plan
adaptation under resource constraints. The third direction is robust acting learning methods and robot control. Again,
Pandora project differs mainly in lower TRL and therefore challenges associated with commercialization of robotic
system are not solved there. However, project;s achievements will be monitored and considered in AutoMat, especially
towards reliable navigation as it is also desired to have the required data acquisition persistently autonomous.
PeLoTe The PeLoTe project (funded under FP5) was targeted on concepts of creating the presence feature via remote
control and navigation, as well as knowledge sharing in combined communities of living and nonliving entities. Besides
other particular goals, the project aimed to study navigation algorithms in challenging environments, processing data
from standard as well as unique sensors (photonic mixed device camera) and research methods of cooperation, collaboration, and communication within a multi-robot team during the exploration of an unknown environment. Moreover,
graphical user interfaces were developed allowing to share information and knowledge from diverse origins and of different nature. As CTU coordinated the project and AALTO participated in it, the achieved experience will be utilized in
the above mentioned areas. Finally, several experiments were performed in a training facility of fire-fighters that gave
the project team members the experience with performing test in non-standard environments, which is also the case for
AutoMat and the new consortium will build on these results. The obvious difference between both projects represents
absolutely different application and again much lower TRL in PeLoTe.
DARWIN and NIFTY
Human-robot interaction is the subject of projects DARWIN and NIFTY, in which CTU participates. In the DARWIN
projects, two key tasks are solved: efficient interaction between assembling robots and human operators; and the
development of a cognitive architecture for learning from the interaction. The dexterous robots employed in the project
have to perform an assembly task. At the beginning, the robots operate using a detailed plan prepared by humans
and use its cognitive architecture to learn results of the actions. Later, a high level plan is given to the robots, which
translate it to a detailed one using the learned skills. Here, the robots are learned using the interaction with the
humans and objects in the environment. While assembly robots are employed in the DARWIN project, the NIFTY
project focuses on tasks in Urban Search & Rescue, where human-robot teams work in dynamic environments. Teams
of mobile autonomous cognitive robots coordinate their action during missions with humans. The robots learn from
actions performed by humans and adapt to their behaviours. This helps them to operate autonomously and even
estimate behaviors of the humans, which improve efficiency of the cooperation. NIFTi investigates how a cognitive
robot can complement models of its own capabilities and situation awareness, with cognitive user models of task load
and workflow. Although the approach of the projects to the active existence of robots in environments and the targeting
scenario are different from AutoMat, the gained expertise will be used to design efficient ways for communication
between robots and human supervisors, and for creating a common model (language) for exchanging modifications of
plans of material handling. Thanks to these preliminary achievements, it was not necessary to plan an additional effort
in AutoMat for these particular tasks.
LIVCODE, LOCUST and EYE2E
The multi-sensor fusion required for the all weather sensing will be built on achievements and knowledge obtained
in the LOCUST project, in which UoL partner participates. In this project, a novel architecture for bio-inspired visual
perception system has been developed. These results from LOCUST will be partially utilized in AutoMat as a foundation
of the development of the novel approach to the SLAM problem.
For the safe navigation in the outdoor environment, a robust collision detection system is required. It can be implemented using various sensor systems, which are studied in the LIVCODE project (UoL participates). The method
34
34
AutoMat
page 35 of 37
page 35 of 37
AutoMat
developed in the LIVCODE project is inspired by animal vision systems and provides a robust solution for intelligent
mobile systems. To allow fast image-based collision avoidance, a dedicated hardware was developed in the EYE2E
35
35
AutoMat
page 36 of 37
page 36 of 37
AutoMat
project, where UoL participated. The collision detection is realized using a neural network system, which is implemented on a VLSI chip. This allows fast computation and adaptation of the collision detection systems. Moreover, such
a system is more energy efficient. In AutoMat, some preliminary knowledge from EYE2E will be exploited for inclusion
of vision into the complex sensory system for the outdoor experiments.
STRANDS
The IP project STRANDS ”Spatio-Temporal Representations and Activities for Cognitive Control in Long-Term Scenarios”, in which UoL participates, aims to enable robots to achieve robust and intelligent behaviour through adaptation
to, and the exploitation of, long-term experience. The project will develop novel approaches to extract quantitative and
qualitative spatio-temporal structure from sensor data gathered during autonomous operation. The project will also
develop control mechanisms which exploit these structures to yield adaptive behaviour in highly demanding, real-world
security and care scenarios. Although the STRANDS project is concerned with indoor environments, it is expected that
the parallel development of cognitive control and outdoor mapping techniques for the highly dynamic environments in
36
36
Download