Qualitative Representations for Robots: Papers from the AAAI Spring Symposium A Qualitative Representation of Social Conventions for Application in Robotics Diedrich Wolter Frank Dylla and Arne Kreutzmann Smart Environments, Information Systems and Applied CS University of Bamberg, Germany Cognitive Systems, SFB/TR8 Spatial Cognition University of Bremen, Germany Abstract presented that are capable of interacting with individuals or groups of people, putting forward the prominent example of the museum tour guide robot RHINO (Burgard et al. 1999). Systems were designed to pass people (Pacchierotti, Christensen, and Jensfelt 2005) or approaching groups of people (Althaus et al. 2004) in an adequate manner. In (Shi et al. 2008) the change of velocity near other pedestrians is considered, whereas in the Companion framework more general joint human-robot navigation is considered (Kirby 2010). Taking a closer look at these existing social robot systems, they all have in common a behavior that is directly based on quantitative parameters tailored to specific tasks, i.e., they are not aware of their spatial context but their actions are hardwired to the control system. By contrast, social behavior as described in the social sciences involves abstract concepts of space and time that vary across social status, culture, etc. It requires observation and interpretation to align oneself with the current context. Not until awareness of conventions is tackled, a robot will be able to recognize that it is interfering with human activity unintentionally. Viewed from the perspective of intelligent agent design, abstract declarative representations of social conventions could offer appropriate means for such reasoning. Linking such abstract representations to robot control is a difficult endeavor though. With our work we aim to bridge the gap between abstract descriptions of spatio-temporal patterns and robot navigation techniques. The contribution of this paper is to describe how patterns of socially acceptable behavior can be represented using qualitative concepts of space and time. The representation obtained can then provide the basis on which a robot can achieve awareness of social context. We first give a system overview of how the approach integrates with a robotic software architecture. Section 3 reviews the concept of spaces (proxemics) and navigation conventions which are assumed to be socially acceptable. Section 4 shows how spatial knowledge present in the navigation conventions can be captured with qualitative representations. Using these spatial primitives we construct the new logic QLTL (Linear Temporal Logic with Qualitative Spatial Primitives), discuss required reasoning techniques and their realization. Section 5 presents a modeling of the socially acceptable navigation behaviors presented earlier and how they are processed in Section 6. Finally, Section 7 outlines handling of real-world situations on a robotic platform. Acceptance of everyday robots will largely depend on their ability of social interaction. Patterns of socially acceptable behavior have been characterized in social science by means of abstract concepts of space and time. This makes integration with robot navigation a challenging problem. Moreover, an integration should allow the robot to build awareness of these patterns as in reality there will be misunderstandings a robot should be able to respond to. The contribution of this paper is a representation of navigation patterns that is based on qualitative representations of space. We present a logic with clear semantics for specification of navigation patterns and show how it allows several conventions of social interactions to be represented. We also outline integration with robot navigation and give examples with real world situations. 1 Introduction Robots becoming part of our everyday life is a vision that is widely shared among researchers in the robotics community. Aside from technical challenges in design and development of such robots, we also need to acknowledge that such a system would be part of our human society. Acceptance of robots is henceforth also depending on their social acceptance. How does the robot interact with humans? Social interaction goes beyond goal-directed human-robot interaction, it also comprises casual or emergent interaction, for example, how a robot traverses a populated area. Shall it take advantage of its advanced motion and path-planning capabilities to rush through even the most crowded places? We are motivated by the assumption that acceptability of robots will depend on acknowledging established patterns of social interaction. Lindner and Eschenbach claim that people would feel offended if a robot cuts path in front of them (Lindner and Eschenbach 2011). Other patterns may be more imperative. For example, humans recognize and acknowledge that others have queued up in a waiting area and they line up as well, not trying to overtake and press forward if space permits. Programmers realizing a robot’s navigation component need thus to include patterns of socially acceptable behavior. In recent years several robots have been c 2014, Association for the Advancement of Artificial Copyright Intelligence (www.aaai.org). All rights reserved. 34 proxemics or social spaces (Hall 1966; Hall et al. 1968). high-level controller 3.1 Social Spaces Hall distinguishes four different kinds of spaces: social rule base KB Kinect LRF local pp motor global pp localize intimate space touching, hugging, close conversation, personal space interaction with well known people, social space interaction with people, and public space people in this region are ignored in general. features Additionally, for each region Hall defines a close and far sub-region to distinguish social interaction on a finer level of granularity. The nested structure of the spaces is depicted in Figure 2. Hall does not limit public space in its extend and the 7.6m relates to the close public space. Based on his work further investigations were carried out, e.g. asymmetries of personal spaces (Ashton and Shaw 1980), intercultural differences of spaces (Burgess 1983), or on which side people may pass in case walking paths are intersecting (Bitgood and Dukes 2006). Furthermore, Frith and Frith (2006) considered how these findings of social conventions help to predict other people’s behavior. Nevertheless, not much work has been presented on what people exactly do and why they behave like that. That is, the connection between qualitative descriptions and actual behavior remains unclear. In addition to Hall’s view other kinds of spaces which influence social behavior can be distinguished (Lindner and Eschenbach 2011). Affordance Spaces are based on possible actions an agent could perform in that space (see e.g. (Galton 2010), e.g. a park affords to play football in. In contrast activity spaces are defined on the basis of which actions are actually performed in them, for example a space a football game is actually played (see e.g. (Ostermann and Timpf 2007; Kendon 1990). Territory Spaces are regions which are claimed by a single or group of agents. In general, boundaries or extent of territories are marked with markers that are perceivable by potential intruders (e.g. (Lawson 2001)). The last space to mention is Penetrated Space describing spaces which are influenced by other activities, for example, the space one’s voice is perceivable (auditory) or the olfactory penetrated space of a barbecue. Although, the discrimination of all kinds of spaces is important, we restrict ourselves in this work to social spaces as defined by Hall for reasons of simplicity of the examples. In general, our approach is capable to deal with all kinds. A first formalization of social spaces for robotic systems has been presented in Lindner and Eschenbach (2011), purely based on mereo-topological knowledge that is axiomatized within the highly expressive Situation Calculus. In contrast to this approach, we make use of qualitative concepts to link robotic perception with logic primitives. Also, we opt for a more restricted logic that nicely integrates with robotic architectures. ... Figure 1: System overview, showing where social conventions could be integrated into a layered architecture. 2 Approach On the technical level, we develop a new representation language that combines the temporal logic LTL (Pnueli 1977) with qualitative spatial calculi (see Section 4). Based on our representation language, we formalize several conventions of socially acceptable navigation. We discuss how the new logic can easily be integrated with existing navigation components to ensure socially acceptable navigation behavior. For conceiving system integration we adapt the approach of a layered architecture as used successfully in many robotic systems, e.g. for museum tour guide robots (Burgard et al. 1999; Thrun et al. 1999). In the most general interpretation it consists of three layers (cp. Figure 1): 1) the low-level hardware control layer for any kind of sensor or actuator (e.g. wheels or camera), 2) the intermediate navigation and (pre-)processing layer (e.g. path planning and feature detection) and 3) the high-level reasoning and learning layer (e.g. Golog as in case of (Burgard et al. 1999)). The main organizational criteria is that on the lowest level data is completely quantitative and on the highest level abstract symbolic representations are predominant. We adapt this kind of architecture. Our robot setup involves a laser range finder (LRF) for obstacle detection and a Kinect camera for people detection. On the intermediate layer we consider the standard navigation modules. Socially aware navigation mainly affects the local path planning (’local pp’ Figure 1), but for example due to unavailability of socially acceptable paths, different routes on the global level may need to be considered. In this work, we focus on the interaction with local path-planning only. Within the navigation layer we propose adding a new module, which we call social (behavior) module, consisting of a declarative rule base of social conventions, means to keep track of rules applicable in the current situation, and interaction to the path-planning module. The declarative nature of the knowledge eases integration with the high-level controller to foster awareness of the social context. 3 3.2 Classification of Social Conventions In (Dylla, Coors, and Bhatt 2012) a classification of social navigation conventions is presented. It comprises five (not necessarily disjunctive) categories mainly discriminated on the basis of spatial configurations and the number of people involved. The authors don’t claim this taxonomy to be Social Conventions in Human Navigation Human navigation is spatial interaction of individuals based on social conventions (Kirby 2010). Hall described this interaction on the basis of spaces around individuals called 35 Personal Space < 1.2m (4 ft) Intimate Space < 0.45m (1.5 ft) Social Space < 3.6m (12ft) (a) in general moving on the right of a walkway, or (b) no running in a library These conventions may have to be adapted regarding cultural background. For example, in Great Britain pedestrians would evade to the left and overtaking should take place on the right side. Although not complete these kinds of conventions must be formalized in order to enable robots to be aware of these conventions and behave accordingly. (close) Public Space < 7.6m (25ft) Figure 2: Social Spaces as described by Hall (1966) 4 Representation of Coarse Knowledge Social conventions are given in natural language and thus in a vague, coarse, and imprecise manner. By means of qualitative descriptions one can focus on distinctions between objects that make an important and relevant difference with respect to a given task (Kuipers 1994). The field of Qualitative Reasoning (QR) is concerned with capturing such distinctions about objects in the real world, also considered as commonsense knowledge, with a limited set of symbols, i.e. without numerical values (Cohn and Hazarika 2001). These distinctions are captured by relations, which summarize indistinguishable cases into a single symbol. For example, in most cases it is sufficient to refer to the color ‘red’ as it is not of importance whether it is slightly lighter or darker than some prototypical red. Qualitative SpatioTemporal Representation and Reasoning (QSTR) is a subfield of QR, where the underlying physical structure of the domain can be exploited for performing well defined reasoning. In general, topological (e.g. part of) and positional relations can be distinguished (Freksa and Röhrig 1993). Positional relations can be subdivided into orientation (relative: e.g. left, right; and absolute: e.g. south, north) and distance calculi (e.g. close, far). A set of relations together with operations on them is called a qualitative calculus. Qualitative calculi are based on partition schemes of the underlying domain. The set of all possible relations between two spatial entities (or three in case of ternary calculi) is categorized into a set of atomic relationships called base relations (BR) which either represent themselves meaningful relations for the task at hand or which allow these relations to be obtained by means of union of base relations. Since relations are ordinary sets, set-theoretic operations are applicable to qualitative relations. Algebraically, qualitative calculi are related to relation algebras in the sense of Tarski (Dylla et al. 2013). For the purpose of this paper it is sufficient that a qualitative calculus allows us to model binary (or ternary) relations between spatial entities using unions of base relations. Since base relations are defined by a partition scheme, they are naturally disjoint and thus conjunction would be useless. The most widely considered knowledge representation for qualitative calculi is constraint-based. Given a set of variables X = {x1 , . . . , xn } and a set of base relations BR = {b1 , . . . , bm }, a knowledge base consists of constraints (xi {bi1 , . . . , bik } xj ) which say that spatial entities xi and xj are in relation bi1 ∪ . . . ∪ bik . Qualitative spatial reasoning then provides us with (calculi-specific) algorithms to decide whether a constraint-satisfaction problem (CSP) consisting of such constraints is satisfiable or not (Renz and Nebel 2007). The test of satisfiability also allows new con- (a) pedestrians approaching (b) overtaking a with the same head-on evading to the right direction on the left (c) following another pedes- (d) yield for another pedestrian trian in a narrow passage near a passage entrance Figure 3: Four examplary conventions of pedestrian navigation. The triangle depicts the robot and the circle some other agent. complete as, e.g., non-spatial discriminators like gender and cooperative behaviors like ‘ladies first’ are not considered. The five categories and some of the conventions are: 1. approaching head-on or from behind (a) pedestrians approaching in head-on both have to evade to the right (Figure 3(a)) (b) pedestrians moving in the same direction have to be overtaken on the left side (Figure 3(b)) 2. crossing situations (a) other (not necessarily moving) pedestrians on the left or right may be passed in their front with some distance, (b) may be passed in their back (with smaller distance), or (c) the agent lets the other pedestrians pass (by stopping). Here the other agent has to move. 3. bottlenecks or narrow passages (a) follow other pedestrians in a narrow passage (no overtaking possible) (Figure 3(c)) (b) ’crossing’ other pedestrian at narrow passage (c) yield to others near passage entrance (Figure 3(d)) 4. interaction with groups (a) evading or passing a group on the outside (b) crossing large groups (if passing is inappropriate or not possible at all) 5. individual constraints , i.e. conventions very dependent on the context an agent is in, e.g. 36 0 0 disconnected DC partial externaly connected EC overlap PO 2 1 3 equal EQ A 6 8 10 9 7 15 12 B 8 0 14 13 11 10 9 2 3 4 1 B A 15 12 11 10 5 6 14 13 7 8 9 ~ 4∠5 B ~ and A ~ 4∠3 B. ~ Figure 5: OPRA4 relations A 13 tangential non-tangential non-tangential proper proper part proper part inverse TPPI part NTPP inverse NTPPI provide the means to represent a configuration, the representation of a convention also involves inscribed behavior. Conventions inscribe behavior within a certain context, typically triggered by an event or associated with a process (e.g., keep left when standing on an escalator). Among the different options to represent behavior, we choose to apply a temporal logic that links spatial configuration knowledge (snapshots) to temporal sequence. This approach has several nice implications: Figure 4: The eight base relations of RCC-8. straints that follow from a given set of constraints to be inferred, similar to how resolution of logic formulas allows for deduction. An implementation of the methods described above is available via the qualitative spatial reasoning toolbox SparQ (Wolter and Wallgrün 2011)1 . In our formalizations we apply the topological Region Connection Calculus (RCC-8) and the Oriented Point Reasoning Algebra (OPRAm ) (Moratz 2006). • Employing a temporal logic to connect qualitative representations of snapshots allows static as well as dynamic spatial knowledge to be represented with the same vocabulary of spatial concepts. • Like any standard logic, it allows non-spatial and nontemporal knowledge to be represented aside the spatiotemporal knowledge using propositional symbols. RCC-8: The Region Connection Calculus Topological distinctions are inherently qualitative in nature and they also represent one of the most general and cognitively adequate ways for the representation of spatial information (Renz, Rauh, and Knauff 2000). Based on this inherent qualitative nature different qualitative calculi were developed, among them the Region Connection Calculus (RCC-8) (Cohn et al. 1997) which is based on a binary connection relation C(a, b) denoting that region a is connected to region b. Exploiting the connectivity of regions eight base relations are defined (see Figure 4). • Last but not least, temporal logic has been integrated with motion planning and robot control (Kress-Gazit and Pappas 2010; Ding et al. 2011; Kloetzer and Belta 2010) and recognition of processes (Kreutzmann et al. 2013). Of course, this approach has limitations. A technical limitation are missing quantifiers, thus the approach requires a predetermined maximum number of distinct objects to consider. A potential knowledge modeling limitation can be seen in binary truth evaluation of classic logics, e.g., if handling graduated, fine-grained concepts. OPRAm : A Relative Orientation Calculus In OPRAm relative orientation is partitioned into 2m equidistant angular cones and their separating lines with an intrinsic orientation. Within this work we apply granularity m = 4. For simplicity, these are numbered from 0 (intrinsic orientation) to 15. The relation is formed by a tuple of the relative orientation i of object B wrt. object A and vice versa (j), usually written as A 4 ∠ji B. In Figure 5 (left) relation A 4 ∠513 B is depicted. This directly relates to a linguistic expression like ”B is ahead to the right of A moving in the same direction”. If point positions coincide relations are only determined by the segment number s of A the orientation of B falls in, e.g. A 4 ∠3 B in Figure 5 −jm (right). We abbreviate complex relations by m ∠ji11−i = n jm j in ∨i=i1 ∨j=j1 m ∠i with i, j ∈ Z4m , ∗ abbreviates 0 − 4m. 5 7 1 5 13 4 5 tangential proper part TPP 15 2 5.1 QLTL: Linear Temporal Logic with Qualitative Spatial Primitives For representing social conventions we require temporal sequence knowledge, i.e. which situation occurs before/after another. This can be achieved using a lean logic like Linear Temporal Logic (LTL) (Pnueli 1977). LTL is a modal logic that extends propositional logic with modal operators which connect statements about snapshots (called worlds in modal logic) using dedicated modal operators. In order to integrate this logic with spatial primitives, we choose to encode qualitative spatial relations in terms of propositional symbols and we extend the semantics to also include a spatial semantic. Since conventions also depend on the types of objects involved in a certain configuration (person, obstacle, robot, etc.), we also introduce sorts to retain category information in logic formulae. This leads to the following syntax definition for our logic QLTL. The ingredients are: Formalization of Social Conventions We have argued that a representation of socially adequate behavior is essentially based on qualitative concepts of spatial configurations. While qualitative constraint networks • a set of spatial symbols S. Let s be a number of sorts, then Si = {si,1 , si,2 S, . . .}, i = 1, . . . , s are sets of spatial symbols and S := i=1,...,s Si 1 available from http://www.sfbtr8.uni-bremen.de/project/r3/ sparq/ • R = {r1 , . . . , rn } is a set of qualitative relation symbols 37 In this work, we define the set of qualitative relation symbols Q to be the union of OPRA4 and RCC-8 relations. This allows us to represent the social conventions. Since we will obtain an interpretation from the perception of the robot, we can assume it to be spatially consistent. • F = {f1 , . . . , fl } be a set of function symbols • G = {g1 , g2 , . . .} is a set of propositional symbols for representing general, non-spatial knowledge • The set of propositions P is defined as P := G ∪ {r(s, t)|r ∈ R, s, t ∈ (S ∪ {f (si )|f ∈ F, si ∈ S})}. The idea underlying this definition is to use natural notation of qualitative relations so that it is possible to represent spatial knowledge by a single propositional symbol. Propositions are either describing non-spatial facts (G) or some spatial relation r between two objects s and t which can either be sorts or some sort dependent aspect f (si ). For example, NTPP(h, sec(r)) that a human h is standing in the security range sec of the robot r. Formulae in QLTL are then defined recursively: • p is a formula for every p ∈ P • If φ is a formula, so is ¬φ • If φ, ψ are formulae, so is φ ⊗ ψ with ⊗ ∈ {∧, ∨, →, ↔} • If φ is a formula, so is M φ with M ∈ {◦, 2, } • If φ, ψ are formulae, so is φ N ψ with N ∈ {U, R} The semantics of QLTL are similar to LTL, i.e., an interpretation establishes an ordered sequence of worlds. Within each single world, all propositional symbols are mapped to truth values true and false, inducing the interpretation of formulae composed with logic conjuncts (∧, ∨, . . .). For convenience of notation QLTL includes function symbols for relating the individual regions established by an agent, e.g., social space, personal space, etc. The semantics of a function f is a mapping S → S of spatial symbols. In other words, functions are used as shorthand notations for the respective spatial symbols. In QLTL we further require interpretations within all worlds to be spatially consistent, i.e., sensor interpretations must not be contradictory. This defines the spatial semantics of QLTL. Within one world, the interpretation of all (spatial) propositions r(s1 , s2 ) with si ∈ S or si = fj (s) for some fj ∈ F, s ∈ S induces a qualitative constraint network with variables S and according constraints ri (x, y) where x, y are either the spatial symbols s1 , s2 or the symbols obtained by application of fj (si ). The spatial semantics of a relation r is defined by the respective qualitative calculus. Functions F are also assigned with a respective spatial semantic, e.g., mapping the position of a human to its estimated personal space. We say that an interpretation is spatially consistent if, first, all induced constraint networks are consistent and, second, if all mappings in F respect their spatial semantics, e.g., personal(h) is the region that determines the personal space of h as defined by the function personal. The semantics of modal operations connect distinct worlds within a sequence, identical to the sematics of LTL: ◦φ (next) φ holds in the following world 2φ (always) φ holds in the current and in all future worlds φ (eventually) φ holds in a future world, (φ ↔ ¬2¬φ) φ U ψ (until) φ holds at least until eventually ψ holds, but they dont have to hold at the same time φ R ψ (release) ψ holds until and including the world in which φ first becomes true 5.2 Conventions as QLTL formulae We introduce a convenient notation for representing conventions as QLTL formulae. Conventions considered in this work essentially come in an “if-then-until” flavor in the sense that observing a certain event or process triggers (start) a sequence of required configurations (effects) to achieve a behavior in accordance with the regulation until an end state or a break criteria is reached. The end criteria is reached if everybody behaves as expected while the break criteria prevents the system from getting stuck in a rule if something unexpected happens, e.g. a person involved does not behave as expected by turning around and moving away. Break criteria might be a timeout or if involved persons leave the public space of the agent. For reasons of space we do not consider break criteria in further detail. Conventions can easily be represented as QLTL formula if we pursue a declarative description of regulation-compliant behavior using the pattern φstart → ◦ φeffect U (φend ∨ φbreak ) (1) We note that QLTL is expressive enough to allow conventions to be represented which are not effective. If, for example, the sub-formula specifying the trigger condition would refer to a future situation, than it may not be possible to decide whether the trigger condition is satisfied. Roughly saying, we want to exclude conventions of the kind “if this will turn out to be wrong, don’t do it”. Deciding effectiveness of a convention is beyond the scope of this paper—we assume that the conventions are stated in a way that allows trigger conditions to be detected at the time they apply. We point out an inconvenience of directly using modal logic for knowledge representation, namely its lack of variables. The pattern introduced in Equation 1 can involve propositional symbols only. As a consequence, for a convention that says how to avoid an obstacle, we require separate formulae, one for each individual obstacle. To this end, we introduce a shorthand notation for conventions that supports variables. In the following we write x1 : s1 , . . . , xn : sn : φ with variables x1 , . . . , xn of respective sorts s1 , . . . , sn , V meaning xi ∈si ,i=1,...,n φ0 with φ0 obtained by substitution of xi for the respective spatial or propositional symbol. 5.3 QLTL Representation of Social Conventions We now exemplarily formalize social conventions from Class 1 and 2 (see Section 3.2) in QLTL. Throughout this section we use r to denote the robot (r : robot), and h for humans (h : human). In the following we use x and y to denote objects of any of these sorts.2 2 Remark: these symbols need different interpretations regarding the relation they are used for. For RCC-8 they need to be interpreted as regions, e.g. the space covered by an object (obj(x)), and for OPRA4 as oriented point (opos(x)). For brevity we omit this distinction in the presented formalizations. 38 To refer to the social spaces constituted by a human h we use the functional notation intimate(h), personal(h), social(h), public(h) respectively. On the syntactical level these functions denote special symbols for referring to the regions, whereas on the semantic level the specific region must be interpreted based on the physical object specified by h. In order to improve readability of formal conventions we define some macro relations to abbreviate complex relations. First, describing that two agents are in a head on situation: HEAD ON(x, y) := x 4 ∠15−1 15−1 y or are moving in the same direction: SAME DIR(x, y) := 7−9 x 4 ∠15−1 7−9 y ∨ x 4 ∠15−1 y. Next we define x being on the left or right side of y: ON LEFT(x, y) := y 4 ∠∗2−6 x, ON RIGHT (x, y) := y 4 ∠∗ 11−13 x and x being in front or behind y: IN FRONT(x, y) := y 4 ∠∗15−1 x, BEHIND(x, y) := y 4 ∠∗5−11 x. Finally, by OVERLAP we define that there is a partial or complete overlap: OVERLAP(x, y) := PO(x, y) ∨ TPP (x, y) ∨ NTPP(x, y) ∨ TPPI (x, y) ∨ NTPPI (x, y). Using these primitives we can formalize convention 1a as follows: φ1a start φ1a effect := := 6 ON 6.1 := (2) LEFT (h, r) R BEHIND (h, r) BEHIND (h, r) observations |= φstart ⇔ φeffect U φend becomes active (4) As discussed in (Kreutzmann et al. 2013) this task of model checking can be accomplished by first assigning propositional symbols to truth values according to the robot’s observation and then applying an ASP solver (answer set programming) to search a model for a given precondition formula. In this work we are however involved with a more complex logic that additionally involves object sorts. During model checking one needs to ensure that a model also respects the correct association of object sorts, i.e., humans in the formula can only be matched to humans, obstacles to obstacles, etc. A straightforward yet sufficient solution can be realized in ASP-based model checking. ASP supports relational knowledge that allows sort knowledge like “person(x)” to be denoted. Doing so for convention formulae as well as for observation ensures correct association. However, we are only interested in spatially consistent interpretations of the propositional symbols and filter out all spatially inconsistent models, as provided by SparQ (Wolter and Wallgrün 2011). Filtering out spatially inconsistent models in a subsequent step to model checking can lead to large amounts of models that need to be rejected. Therefore, we propose to introduce an intermediate step that responds to partial observations. Prior to grounding propositional symbols with observations, QSR is applied for constraint propagation in order to make implicit knowledge explicit. For example, we might observe two facts: two persons are standing in front, one to the left looking to the right and one to right looking left. Here, constraint propagation in OPRAm would reveal the relation between both persons would be looking at one another. Enriching observations prior to model checking can thus help recognizing whether a convention’s precondition is met. We are aware OVERLAP (r, social(h)) ∧ (ON φ2a effect φ2b effect φ2c effect 2 φeffect φ2end LEFT (h, r) ∨ ON RIGHT (h, r)) := ¬PO(r, personal(h)) ∧ IN FRONT (r, h) := ¬PO(r, intimate(h)) ∧ BEHIND (r, h) := stop(r) U (3) BEHIND (r, h) := φ2a effect := BEHIND (h, r) ∨ φ2b effect Detecting Applicability Testing convention applicability requires perception of the robot to be matched against the precondition of a convention φstart . The convention is applicable if the observations allow for a model of φstart . We are thus confronted with the task of model checking where observations provide (partial) knowledge and the task is to decide whether this partial model can be extended to a spatially consistent interpretation that makes φstart come true: This means, if the robot enters or is in the social space of another agent h and they are head on, the robot must not move into the personal space of h. Furthermore, h has to be on the left of r, thus r needs to turn right until r has passed h, i.e., r is behind h. Convention 1b can be modeled similar except that they are in SAME DIR(r, h) instead of HEAD ON(r, h) and r has to overtake on the left (ON RIGHT(h, r)). All conventions of class 2 are covered by the following: φ2start Processing QLTL Conventions Given a set of QLTL formulae representing the social conventions known to the robot, the two tasks are to (1) identify that the preconditions of a pattern are met and (2) to determine actions which are admissible with respect to the pattern. Although the two tasks seem different at first sight, they both can be tackled with the method of testing convention applicability. OVERLAP (r, social(h)) ∧ HEAD ON (r, h) := ¬PO(r, personal(h)) ∧ φ1a end Dealing with class 5 is straightforward. If the robot is in a specific context, e.g. a library l (φstart = OVERLAP(r, l)), he has to adapt his behavior, e.g., switch to quiet mode (φeffect = q mode(r)), until he is not in the context anymore (φend = ¬φstart ). ∨ φ2c effect If the robot enters the social space of h on the left or right, r has three options. Either he passes h in the front with not entering the personal space, pass behind h with a smaller distance, i.e. it is allowed to enter personal but not the intimate space, or he can stop until the human has passed. For brevity we only sketch the main aspects to consider formalizing the three remaining classes. In case of a narrow passage (class 3) we need to represent an overlap of obstacles with a space of the agent to interact with, e.g. a wall to the left in the social space of h: OVERLAP (w, personal(h)) ∧ ON LEFT (w, h). For dealing with conventions regarding groups (class 4) we need to redefine the spaces with respect to the individuals involved. One approach is to define the group space as the union of all individual spaces, e.g. social(g) := ∪h∈g social(h). 39 Kinect Laser Range Finder Fused View Situation Description NTPP(obj(robot), social(person 1)) OPRA(opos(robot), opos(person 1), {0, 1}, {0, 1, 15}) =⇒ head on(robot, person 1) Figure 6: From the sensor readings to the knowledgebase. Using a kinect and a laser range finder a human is detected and the positions and extends of his social spaces are determined, resulting in the knowledge base on the lower right. Due to uncertainty of the sensor readings, the OPRA4 relation of robot 4 ∠11 person is coarsened to robot 4 ∠15−1 0−1 person and returned by our system as OPRA(opos(robot), opos(person 1){0, 1}, {0, 1, 15}). that due to noisy sensor data qualitative relations might be computed which do not map reality exactly, e.g., jitter at relation transition. For reasons of space we neglect a detailed consideration here and refer to work like (Dubba, Cohn, and Hogg 2010) where sensor data is interpreted robustly by a pre-computation step. 6.2 gers Convention 1a, but it cannot be executed since there exists no path in the free space, that would not cross the persons personal space. Thus the robot will stop until the world changes such that the convention is not applicable any more, i.e., the start condition is not met any more. 8 Planning Admissible Actions As mentioned earlier we reduce action planning to checking convention admissibility. The idea, as discussed in (Wolter, Dylla, and Kreutzmann 2011), is to link the applicability check to a planner. Whenever the planner generates or extends a partial plan, these plans are checked for admissibility, discarding them if they do not match the convention. This kind of integration is easy with many planners like randomized road map planners or lattice-based planners. In order to check that a plan satisfies the inscribed effects of convention φstart → ◦ φeffect U φend , we simply need to modelcheck φeffect U φend with the plan generated by the planner, treating the intermediate configurations of the plan like observations when checking convention applicability. Essentially, this step is the same as process recognition with LTL formulae as described in (Kreutzmann et al. 2013). 7 Conclusions and Future Work The ability to respect social conventions is key for public acceptance of shared human-robot environments. Ultimately, the robot requires the ability to reflect on the social conventions, e.g., enabling it to decide on applicability of certain conventions. A promising approach towards such awareness is to pursue a declarative, abstract representation of conventions that supports abstract deliberation as well as integration with navigation components of a robotic system. By proposing the qualitative spatio-temporal logic QLTL we indicate how such abstract representation can be constructed. Qualitative primitives in the representation provide the important link between robot navigation and abstract logic. By adjoining qualitative spatial reasoning techniques from the SparQ toolbox with answer set programming (ASP) we obtain effective means to reason about QLTL formulae, in particular to recognize applicability of conventions. In future work we aim to exploit the flexibility of declarative convention specification in order to allow the robot to adapt to situations dynamically, in particular to handle necessary convention violations adequately. Examplary Case Study In this section we outline a robot implementation of our approach using a SICK LMS200 laser range finder and a Microsoft Kinect RGBD camera mounted to an Active Media Pioneer 3-AT mobile robot. The Kinect sensors allows recognition of humans and estimating their position and orientation. The orientation estimate in our implementation is not very crisp, but as can be seen in Figure 6 our approach can handle such uncertainty. The laser scanner is used to detect obstacles and determine the free space. Currently, only local reactive behavior is employed. In the example shown in Figure 6 a person is about to enter an office, which the robot is trying to leave. This trig- Acknowledgments Financial support by the Deutsche Forschungsgemeinschaft in the Transregional Collaborative Research Center SFB/TR 8 Spatial Cognition project R3-[Q-Shape] is gratefully acknowledged. References Althaus, P.; Ishiguro, H.; Kanda, T.; Miyashita, T.; and Christensen, H. I. 2004. Navigation for human-robot interaction 40 Kirby, R. 2010. Social Robot Navigation. Ph.D. Dissertation, Robotics Institute, Carnegie Mellon University, Pittsburgh, PA. Kloetzer, M., and Belta, C. 2010. Automatic deployment of distributed teams of robots from temporal logic motion specifications. IEEE Transactions on Robotics 26(1):48–61. Kress-Gazit, H., and Pappas, G. J. 2010. Automatic synthesis of robot controllers for tasks with locative prepositions. In IEEE International Conference on Robotics and Automation, 3215–3220. Kreutzmann, A.; Colonius, I.; Wolter, D.; Dylla, F.; Frommberger, L.; and Freksa, C. 2013. Temporal logic for process specification and recognition. Intelligent Service Robotics 6(1):5–18. Kuipers, B. 1994. Qualitative Reasoning: Modeling and Simulation with Incomplete Knowledge. The MIT Press. Lawson, B. 2001. Language of Space. Taylor & Francis. Lindner, F., and Eschenbach, C. 2011. Towards a formalization of social spaces for socially aware robots. In Egenhofer, M. J., and et al., eds., COSIT, volume 6899 of LNCS, 283– 303. Springer. Moratz, R. 2006. Representing relative direction as a binary relation of oriented points. In Brewka, G., and et al., eds., 17th European Conference on AI, 407–411. Italy: IOS Press. Ostermann, F., and Timpf, S. 2007. Modelling space appropriation in public parks. Proceedings of the 10th AGILE International Conference on Geographic Information Science. Pacchierotti, E.; Christensen, H. I.; and Jensfelt, P. 2005. Embodied social interaction for service robots in hallway environments. In Corke, P. I., and Sukkarieh, S., eds., FSR, volume 25 of Springer Tracts in Advanced Robotics, 293–304. Springer. Pnueli, A. 1977. The temporal logic of programs. In Proceedings of the 18th annual symposium on foundations of computer science (FOCS), 4657. Renz, J., and Nebel, B. 2007. Qualitative spatial reasoning using constraint calculi. In Aiello, M.; Pratt-Hartmann, I.; and van Benthem, J., eds., Handbook of Spatial Logics. Springer. 161–215. Renz, J.; Rauh, R.; and Knauff, M. 2000. Towards cognitive adequacy of topological spatial relations. In Proc. of Spatial Cognition II, 184–197. Springer. Shi, D.; Jr., E. G. C.; Donate, A.; Liu, X.; Goldiez, B.; and Dunlap, D. D. 2008. Human-aware robot motion planning with velocity constraints. In McQuay, W. K., and Smari, W. W., eds., CTS, 490–497. IEEE. Thrun, S.; Bennewitz, M.; Burgard, W.; Cremers, A. B.; Dellaert, F.; Fox, D.; Hähnel, D.; Rosenberg, C. R.; Roy, N.; Schulte, J.; and Schulz, D. 1999. Minerva: A secondgeneration museum tour-guide robot. In ICRA, 1999–2005. IEEE Robotics and Automation Society. Wolter, D., and Wallgrün, J. O. 2011. Qualitative spatial reasoning for applications: New challenges and the SparQ toolbox. In Hazarika, S. M., ed., Qualitative Spatio-Temporal Representation and Reasoning: Trends and Future Directions. Hershey (PA), USA: IGI Global. Wolter, D.; Dylla, F.; and Kreutzmann, A. 2011. Rulecompliant navigation with qualitative spatial reasoning. In Proceedings of the 4th Intl. Robotic Sailing Conference. Springer. tasks. In IEEE International Conference on Robotics and Automation, 1894–1900. Ashton, N. L., and Shaw, M. E. 1980. Empirical investigations of a reconceptualized personal space. Bulletin of the Psychonomic Society 15(5):309–312. Bitgood, S., and Dukes, S. 2006. Not another step! economy of movement and pedestrian choice point behavior in shopping malls. Environment and Behavior 38(3):394–405. Burgard, W.; Cremers, A. B.; Fox, D.; Hähnel, D.; Lakemeyer, G.; Schulz, D.; Steiner, W.; and Thrun, S. 1999. Experiences with an interactive museum tour-guide robot. Artificial Intelligence 114(1-2):3 – 55. Burgess, J. W. 1983. Interpersonal spacing behavior between surrounding nearest neighbors reflects both familiarity and environmental density. Ethology and Sociobiology 4(1):11–17. Cohn, A. G., and Hazarika, S. M. 2001. Qualitative spatial representation and reasoning: An overview. Fundamenta Informaticae 46(1-2):1–29. Cohn, A. G.; Bennett, B.; Gooday, J. M.; and Gotts, N. 1997. RCC: A calculus for region based qualitative spatial reasoning. GeoInformatica 1:275–316. Ding, X. C.; Smith, S. L.; Belta, C.; and Rus, D. 2011. LTL control in uncertain environments with probabilistic satisfaction guarantees. arXiv:1104.1159v2 [math.OC]. Technical Report accompanying IFAC 2011. Dubba, K. S. R.; Cohn, A. G.; and Hogg, D. C. 2010. Event model learning from complex videos using ilp. In Coelho, H.; Studer, R.; and Wooldridge, M., eds., 19th European Conference on AI, volume 215, 93–98. IOS Press. Dylla, F.; Mossakowski, T.; Schneider, T.; and Wolter, D. 2013. Algebraic properties of qualitative spatio-temporal calculi. In Conference On Spatial Information Theory (COSIT13). North Yorkshire, UK: Springer. Dylla, F.; Coors, M.; and Bhatt, M. 2012. Socially compliant navigation in crowded environments. In Spatial and Temporal Dynamics Workshop at ECAI 2012. Freksa, C., and Röhrig, R. 1993. Dimensions of qualitative spatial reasoning. In Carreté, N. P., and Singh, M. G., eds., Proceedings of the III IMACS International Workshop on Qualitative Reasoning and Decision Technologies – QUARDET’93, 483–492. CIMNE Barcelona. Frith, C. D., and Frith, U. 2006. How we predict what other people are going to do. Brain Research 1079(1):36–46. Galton, A. 2010. The formalities of affordance. In Spatial and Temporal Dynamics Workshop at ECAI 2010, 1–6. Hall, E. T.; Birdwhistell, R. L.; Bock, B.; Bohannan, P.; Diebold, A. Richard, J.; Durbin, M.; Edmonson, M. S.; Fischer, J. L.; Hymes, D.; Kimball, S. T.; Barre, W. L.; Lynch, Frank, S. J.; McClellan, J. E.; Marshall, D. S.; Milner, G. B.; Sarles, H. B.; Trager, G. L.; and Vayda, A. P. 1968. Proxemics [and comments and replies]. Current Anthropology 9(2/3):pp. 83–108. Hall, E. 1966. The Hidden Dimension, Man’s Use of Space in Public and Private. The Bodley Head Ltd. Kendon, A. 1990. Conducting Interaction: Patterns of Behavior in Focused Encounters. Studies in interactional sociolinguistics. Cambridge University Press. 41