A Representation Scheme for Managing Complex Knowledge

advertisement
A Representation Scheme for Managing Complex Knowledge
Abstract. A representation scheme called CKR-1 is introduced to deal with the challenges of
representing complex knowledge. Complex knowledge is defined as deep knowledge concerning a
complex object, event, situation or process. The nature of complex knowledge is identified from samples
of knowledge drawn from the insurance industry. These examples are used to highlight the efficacy of
the representation scheme as well as its limitations. CKR-1 is a type of semantic network that supports
class/sub-class and several types of relationships. It derives its expressivity from ability to support
abstractions, elaborations and alternative points of view. Despite this, the scheme suffers from
limitations stemming from the difficulty in expressing abstract concepts.
Keywords: knowledge representation, complex knowledge representation, domain engineering,
knowledge engineering, knowledge management, insurance industry.
1
Introduction
The basis for intelligent behavior in knowledge-based systems lies in the explicit representation of knowledge.
But despite decades of research, knowledge representation remains a standing challenge in the field. Initially,
researchers in the ‘70’s and ‘80’s developed schemes such as frames, rules, logic and semantic networks. These
were subsequently developed and refined in the context of domains such as natural language, qualitative
physics, medicine and software development [1]. The idiosyncracies of the domain dictated the type of
representation scheme employed. For e.g. Schank developed and used scripts for processing natural language.
He introduced a specialized language to describe actions in scripts such as “P TRANS P to POST OFFICE,” “P
MBUILDS LINE POSITION.” Since the objective of the representation is to achieve intelligent behavior in a
specific domain, issues in reasoning associated with these problems were also explored. In many cases,
inferencing dominated the research rather than the representation per se. Researchers attempted to address
problems such as selective inheritance (or subsumption), non-monotonic reasoning, belief revision, probabilistic
reasoning and temporal reasoning [ibid]. In recent times emphasis of researchers has shifted in many cases
completely into procedural approaches with emphasis on neural networks [2], case-based reasoning [3] and into
multi-paradigm approaches utilizing perhaps rules, frames and objects [4]. The semantics of the representation
have not been fully addressed.
Ontologies are also a recent development and define terminologies and
relationships in a specific domain. Their main purpose is to facilitate knowledge sharing and interoperability of
information systems [5] rather than deep reasoning. While numerous intelligent applications have been
developed, the emphasis has been on applying existing techniques in new areas (see for example [6]). The
conceptual and epistemic levels in Brachman’s terminology [7], i.e. the primitives and their relationships have
not been addressed adequately for the business domain. Knowledge engineering is still a challenge in this
respect and especially more so for complex types of knowledge. We will define complex knowledge (CK) as
deep knowledge associated with a complex object, event, situation, idea or process.
Examples of such
knowledge include ‘design knowledge,’ ‘decision knowledge,’ ‘legal knowledge’ etc. In this research, we will
focus on how to represent such knowledge. The knowledge should be of sufficient depth such that an intelligent
agent is able to answer questions about the domain from the resulting knowledge base.
2
Relevant Literature
Seminal work in knowledge representation was carried out in the ‘70’s and ‘80’s. Current literature in this area,
especially as it pertains to CK is sparse for reasons already mentioned. There seem to be two approaches to
knowledge engineering which can be thought of as “top-down” and “ bottom up.” The first is to identify models
of the domain and to implement them using a well known representation scheme [8]. This is the method
primarily used for Expert Systems [9]. ‘Pictorial Knowledge Representation’ for example, uses a domain model
to represent geometric figures such as ‘triangles’ and ‘polygons’. The figures are represented as ‘nodes’ in a
classification network with separate ‘property nodes’ describing properties such as “the sum of interior angles of
a triangle must be 1800” [10] . The scheme supports image recognition of geometric shapes.
In the second approach, the representation itself is part of an application or an environment. KL-One is the most
well-known of these [11]. KL-One arranges “concepts” in a classification hierarchy from the most general
(“thing”) to the most specific (e.g. “Red Ferrari” ). Classes and instances are referred as “general concept” and
“individual concept.” The individual concept needs to be linked to a general concept to be valid. Properties of
concepts are modeled with “roles” (e.g ‘color’ of a car) that are inherited from the parent concept (“vehicle”).
Concepts can be formed from more general concepts by restricting roles. Allowable fillers (“Role sets”) are
defined by the “Value Restriction” property of a role.
For e.g. ‘part’ of a ‘car’ could be the role set
‘door’,‘engine’, ‘trunk’. Thus an enhanced frame-based representation is employed to describe the structure of
concepts. Relationships between roles bind instances to specific roles e.g. ‘John’ as a ‘driver’ of the ‘red
Ferrari’. The most interesting feature from the point of view of CK are “Description Wires,” used to make
assertions of the type “John drove the Red Ferrari at the Grand Prix.” The assertions utilize sets of linkages
between concepts for example, between ‘Ferrari’ and ‘car’ and between ‘race’ and ‘grand prix’ and are stored
outside the representation. Despite its power, the linkages between concepts and roles render it difficult to
partition the resulting network and to comprehend the knowledge in the knowledge base. Not surprisingly,
recent work has shifted to the linguistic level, in the form of ontologies. KL-One has inspired other major and
minor schemes including Telos [12] and KRS [13]. An interesting feature of Telos is the representation of
instances of an object and its properties as predicates e.g. [Car] [Ferrari, red, john]. Recently, an extensive
attempt has been made with multi-layered semantic nets to represent natural language [14]. Multi-nets address
the conceptual and epistemic levels by including an extensive set of objects, events and relationships as well as
constructs for representing quantification. The network is also noteworthy in separating the intensional from the
extensional representation. Despite its power, the representation is unwieldy for business realms. In a different
domain, Zarri proposed a representation language NKRL, whose basic constructs are ‘events’ and ‘concepts’.
‘Concepts’ are objects that can occur in ‘events.’ ‘Events’ and ‘concepts’ are placed in separate classification
hierarchies. Events have a case-frame structure (e.g. ‘SUBJ’, ‘OBJ’, ‘TIME’) and are manually entered from
news stories.
Some relationships between events are defined such as Event 1 causes Event 2.
The
representation is utilized in information retrieval [15]. A similar scheme is proposed by Gomez who uses event
relationships (“sim” – simulataneous, “sprec” – strong precedence) to model complex processes [16]. Yet
another approach, AEI-3 has been proposed recently. AEI-3 uses semantic networks and makes use of two node
types (“class,” “instance”) and two link types (“structural,” “descriptive.”) to represent large volumes of
administrative knowledge (for example, “Manugistics is a client of BSS” or “the van leaves BSS at 11:00 am.”)
[17]. It overcomes some of the traditional limitations of semantic nets such as tractability, but is a minimalist
design owing to the relative simplicity of administrative knowledge. The basic ability echoed in all schemes is
to model concepts and relationships with substantial variations in the way either are dealt with. A few methods
include propositions as well.
Except for Multi-nets the conceptual and epistemic levels are not developed
adequately.
3
The Nature of Complex Knowledge
There is a paucity of literature concerning the nature of complex knowledge. Lacking empirical evidence, we
will hypothesize some characteristics based on samples from a text book by [18]. In their foreword, they state
“The American Institute for Chartered Property Casualty Underwriters and the Insurance Institute of America
are committed to expanding the knowledge of professionals in risk management, insurance, financial services,
and related fields through education and research.” Thus their comments establish the rationale for using the
text as an example of complex knowledge. A few representative samples are illustrated in Table 1.
Table 1. Samples of Complex Knowledge [18]
Item#
1.
2.
3.
4.
5.
6.
7.
Example
Property includes real property and personal property. Real property is land, buildings and other property
attached to it. §1.6.
A liability loss exposure is any condition or situation that presents the possibility of a claim alleging legal
responsibility of a person or business for injury or damage suffered by another party. §1.6.
Types of insurers include stock insurers, mutual insurers and reciprocal exchanges. §1.11.
To be insurable, a loss should have a definite time and place of occurrence… § 1.8
Underwriting expenses include acquisition expenses, general expenses, premium taxes and licenses § 3.8
Depreciation is allowance for physical wear and tear or technological or economic obsolescence §6.14.
A contract of good faith is an obligation to act in an honest manner and to disclose all relevant facts §7.7.
Note: “§” refers to section numbers, there are no page numbers in the cited reference.
Individual instances of CK in the insurance domain appear to exhibit one or more of the following
characteristics:
a)
They describe objects, events, actions, situations, concepts, objectives or policies (item #1, 3).
b) Objects are generally concrete such as “automobile,” “property,” and “underwriter.” Concepts are
abstract such as “loss,” “depreciation” and “indemnify.”
c)
They elaborate or define a concept as in item#2 and 5.
d) The elaboration imposes additional conditions or restrictions (item #4).
e)
Concepts involve other concepts or objects that may or may not be explicitly defined (item#3).
f)
Objects and concepts are related to other concepts/objects through structural, axiomatic, mathematical
or logical relationships (item#5).
The major challenge is to represent abstract concepts since they are complex, amorphous and often defined in
terms of other concepts/objects that are challenging to represent in a meaningful manner. Consider, “The board
of directors consists of elected officials.” The ‘board of directors’ (BOD) and ‘elected officials’ are complex
concepts. that are defined easily, but what about ‘elected officials’?.
4
Knowledge Engineering for Complex Knowledge
The objective of the representation is to serve as a foundation for knowledge-based systems and knowledgemanagement systems [17]. Since visual representations facilitate this task, we are committed to one that has a
graphical notation. Visual representations will be user friendly provided they are intuitive and not overly
complex. This places certain restrictions on the epistemology. Secondly, the representation ought to provide
sufficient representational mechanisms or representational adequacy so that knowledge may be stored and
queries, answered [19]. Here, we will focus mainly only on intensional knowledge (definition) although it will
be seen that the knowledge can be readily used in inferencing and hence satisfying inferential adequacy [ibid].
We also assume that syntactic elements can be enforced at the implementational level, so the current focus is on
the logical and epistemic levels. In view of the nature of CK we will impose further requirements on the
representation scheme.
Complex concepts such as premium are defined in terms of other concepts such as ‘insurance coverage’ which
may themselves be complex. Therefore it is convenient to refer to high level concepts without having to redefine
the underlying concepts, leading to the requirement of supporting abstractions. The ability to demarcate
concepts, known traditionally as network partitioning [20], will ensure modularity. Another issue that arises is
the multiplicity of definitions. A single concept has alternative viewpoints making this a requirement as well.
For example, insurance could be described as a business or as an obligation to provide coverage. Relationships
among concepts can be simple (concrete) or complex (abstract). A class-subclass relationship is an example of a
simple relationship. Abstract relationships are complex because they are qualitative, involve multiple concepts
and involve complex conditions. Thus another requirement is to model both simple and complex relationships.
5
A SCHEME FOR COMPLEX KNOWLEDGE REPRESENTATION - CKR-1
We will refer to our scheme as CKR-1 in honor of KL-One [11] and in recognition of the fact that no solution is
likely to be complete or elegant. The scheme is in development. Following discussion from the previous
section, the scheme will be graphical and will have constructs to model objects, events, activities, situations,
concepts and associations.
Physical objects and concepts are not distinguished in CKR-1 – both are treated uniformly and are modeled by
rectangles with name of the concept/object as the label. Rectangles could also denote variables and these are
represented with the name of the variable starting with a capital letter as in “Value,” “Time.” We will use the
generic term ‘object’ to refer to objects and concepts, ‘event’ to refer to events, actions and situations and
‘concept’ to refer to objects and events. Events are represented by rectangles with a vertical bar towards the left
and marked as “E,” “A” or “S” to denote “events,” “actions,” or “situations” respectively. Because of the nature
of CK, attributes of objects are not given importance, so there are no extensive semantics here. Our philosophy
is that there can be a separate and independent mechanism at the implementation level supporting storage and
retrieval of routine properties of objects. ‘Interesting’ properties of objects are modeled by property links
described subsequently. Objects, concepts and events, situations and activities can be atomic or complex. A
‘disaster’ for e.g. is a complex concept since it could be caused by fire, earthquake etc. and could result in
property loss. On the other hand, ‘fire’ and ‘earthquake’ are simple concepts from the point of view of the
domain. A special construct that is provided is a “named” object or event which is a user-defined concept that is
useful in concept definition as well as in modeling abstractions. For example, loss could be defined as a
situation where an object loses value. The same construct also serves as a mechanism to partition the network
as well as to express propositions.
There are two relationship types, ‘structural’ and ‘descriptive’. The structural relationships are “class-subclass,”
“part-subpart” and “has-a” (‘is-a,’ ‘p-sp’, ‘has-a’). Descriptive relationships are modeled with “rel: <name>”
label on the link where <name> is the name of the relationship. This allows for any arbitrary relationship
without semantic baggage. There are also standard relationship types such as logical relationships (‘GT’, ‘LT’..),
Causal (‘CL’), Temporal (‘TE+’ and ‘TE-‘), probabilistic (‘PR’), business relationships (example ‘payer payee’
– ‘PP’, ‘legal obligation’ – ‘OB’) and case relationships (‘Subj’, ‘obj’..) that are part of descriptive relationships.
Structural relationships are distinguished from descriptive only for the benefit of the user. Additionally, from
the point of view of inferencing, generic reasoning mechanisms would suffice for standardized relationships, but
interpreters will need to provide individual reasoning for non-standard relationships. Properties are modeled
with a “p:” link. An asset could have a property, “value” modeled with a “p: value” between “asset” and a
variable denoting value (e.g. “Value”). A required property is modeled with “rp:” link. Consider, the board of
directors consists of elected officials.
Here the required property is that officers must be elected.
To
accommodate required properties, we will use the “rp:” link type. Since a concept could be related to more than
one concept, we use an oval to represent non-significant relationships between two objects and a double oval for
conjunctive relationships. A dotted double oval would mean the same thing for disjunctive relationships. Thus
some logical propositions could be made with the scheme. There is a special link “elaboration” or “e:”, that
uses the same standard relationship types, used primarily for elaborating on properties of a relationship. It also
serves as a crude mechanism for conditions and quantification.
s:is_a
mortgaged
asset
rp: owned
:
by
mortgagee
bank
s : is_a
individual
business
Fig 1. Modeling simple concepts with CKR-1
asset
We will illustrate CKR-1 with examples of knowledge. For example a mortgaged asset could be defined as an
asset for which some percentage is owned by a bank i.e. rest is owned by the mortgage. As shown in Figure 1,
mortgaged asset is connected by “s:is_a” structural link to asset. It has the required condition that it should be
partly owned by a bank. There is an “rp: owned by” link between the mortgaged asset and the owners which in
this case are both the bank and mortgagee. Note the use of ovals for multiple arguments.
A more abstract concept such as insurance coverage is difficult to represent. Insurance coverage is the legal
obligation of an underwriter to compensate the insured in the event of a loss. The concept is challenging
because it involves a number of abstract concepts such as “legal obligation,” “compensate,” and “loss.” It needs
to be represented as three assertions:
‘Compensate-1’: Insured suffers loss. It is understood that the loss amount is quantified.
‘Compensate-2’: Underwriter compensates insured for the loss amount.
Coverage: Compensate1 and Compensate2.
The first two propositions are shown in Figure 2. ‘Compensate-1’ is represented as a user-defined ‘event type’.
The insured has a ‘relationship’ with ‘loss’ i.e. the ‘subject’ of the ‘loss’. The ‘loss amount’ is a required
property for this concept; it refers to a variable ‘Loss amount’. ‘Compensate-2’ is similar. ‘Underwriter’ and
‘Insured’ share a ‘Payer-payee’ relationship with an ‘elaboration’ link (‘e:’) on the relationship to show the
compensated amount. The reader must note that ‘loss’, ‘insured’ and ‘underwriter’ are complex concepts.
‘Loss’ will be represented as a degradation of ‘state’ of an ‘entity’ - which could be a person, object, animal etc.
This method while awkward, neatly partitions concepts and keeps relationships manageable; it can be used for
more complex concepts such as ‘salvage rights.’ However, concepts such as ‘liability insurance’ defined as the
obligation to pay on behalf of the insured for damage or injury to others’ property for which the insured is
legally responsible will challenge the representation.
6
Conclusions
CKR-1 is an initial attempt to model complex knowledge in business domains. It derives its power from the
ability to define arbitrary concepts in terms of other concepts using the ‘object’ types, relationship types and the
concept definition facility. Given this power, it may be easy to define contradictory knowledge. Knowledge
consistency is therefore an issue. There was an explicit concern for simplicity to ensure that the scheme does
not suffer from the weight of its own features. It is for this reason that record type structures for objects and
relationships among these were not included. Properties of interest could be modeled with the “p:” and “rp:”
links. Also we started out with a limited set of constructs that could be subsequently expanded. For instance,
there is no construct for a conditional assertion. Being completely graphical it is expected that a system based
on CKR-1 would be easy to comprehend and at the same time easy to implement.
We have allowed
object/event properties to be concepts. At the implementation level, the property of an object when modeled as
a concept should be treated as an ‘alias’ or ‘derivation’ of the actual property. One could also conceive of an
‘extensional layer’ for the representation that has assertions about instances of the definitions in CKR-1. All of
these are obviously future directions for the research.
COMPENSATE-1
rel: subj
E
insured
loss
rp: loss
amount
Loss amt.
COMPENSATE-2
rel: pp
E
insured
underwriter
E: payment
amount
Loss amt.
Fig 2. Modeling complex concepts with CKR-1
References
1. Brachman, R.J. The Future of Knowledge Representation. Proceedings of AAAI90 1082-1092 (1990).
2. Kurfess, F.J. Neural Networks and Structured Knowledge: Knowledge Representation and Reasoning.
Information Sciences 13, 5-13 (1999).
3. Zeng, Y., Wang, R., Zeleznikow, J. & Kemp, E. A Knowledge Representation Model for the Intelligent
Retrieval of Legal Cases. International Journal Of Law And Information Technology 15, 299-319 (2006).
4. Shaalan, K., Rafea, M., Rafea.: A, KROL: a Knowledge Representation Object Language on Top of Prolog.
Expert Systems With Applications, 15, 33-46, (1998).
5. Welty, C.: Ontology Research. AI Magazine 24, 11-12, (2003).
6. Vranes, S., Stanojevic, M.: Design Knowledge Representation in Prolog/Rex, Engineering Applications of
Artificial Intelligence 12, 221-228, (1999).
7. Brachman, R.J. On the Epistemological Status of Semantic Networks. Associative Networks Representation
and Use of Knowledge by Computers, 3-50, (1979)
8. Morik, K. Acquiring domain models. International Journal of Man-Machine Studies, 26(1), Jan 1987, 93104.
9. Hayward, S. A., Weilinga, B. J., and Breuker, J. A.: Structured analysis of knowledge, Int. J. Man-Machine
Studies 26, , 487-498, (1987).
10. Lee, E.T.: Pictorial Knowledge Representation using Pictorial Semantic Networks, 6, 155-160, (1988).
11. Brachman, R.J., Schmolze, J. G.: An Overview of the KL-ONE Knowledge Representation System,
Cognitive Science 9, 171-216 (1985).
12. Mylopoulos, J., Borgida, A., Jarke, M., Koubarakis, M.: Telos: Representing Knowledge About Information
Systems, ACM Transactions on Information Systems, 8, 325-362 (1990).
13. Marcke, K. V., Jonckers V., Daelemans, W.. Representation aspects of knowledge based office systems. In
Esprit 87, Achievements and Impacts. New York: North Holland, 1226-1238. (1987)
14. Helbig, H.: Knowledge Representation and the Semantics of Natural Language. Springer-Verlag (2005).
15. Zarri, G. P.: A Knowledge Engineering Approach to Deal with Narrative Multimedia Documents. Lecture
Notes in Computer Science, 2244, 363 (2001).
16. Gomez, F.,. “A representation of complex events and processes for the acquisition of knowledge from
texts.” Knowledge Based Systems, 10, 237-251 (1998).
17. Amaravadi, C. S.: Knowledge Management for Administrative Knowledge, Expert Systems, 25 (2), 53-61,
May (2005).
18. Luthardt, C. M., Wiening, E. A. and Fraolli, M. J.: Property and Liability Insurance Principles, American
Institute for Chartered Property Casualty Underwriters / Insurance Institute of America Malvern, PA, fourth
edition (2005).
19. Davis, R., Shrobe, H., and Szolovits, P.: What is a Knowledge Representation? AI Magazine, 14(1), 17-33
(1993).
20. Hendrix, G. G. (1979). “Encoding knowledge in partitioned networks,” in Findler, Nicholas, Associative
Networks: Representation and Use of Knowledge by Computers, Academic Press: New York, 51-92, (1979).
Download