Knowledge Acquisition and Modelling Knowledge Acquisition and Elicitation Ref: Knowledge Acquisition in Practice: A step by step guide, Milton, Springer-Verlag Knowledge Engineering Transfer View Human knowledge transferred to knowledge base =>knowledge exists and is accessible Typically interviews and task execution and observation used for KA End result set of rules that exercise knowledge made explicit Modelling View Need to build models Incremental, evolutionary process Model is an approximation of reality Models are subjective unstructured interview interviews KA Typology semi-structured interview structured interview natural techniques observation techniques group meetings questionnaires card sorting three card trick rep grid technique KA techniques limited time contrived techniques constrained tasks limited information 20-questions commentating teach back laddering process mapping modelling techniques concept mapping state diagram mapping Natural Techniques Interview Techniques Knowledge engineer asks questions of the expert or end user. Essential method for acquiring explicit conceptualisations and knowledge, but poor for tacit knowledge. Variations: Unstructured interview Semi-structured interview Free flowing, used in early stages of elicitation, can produce basics of knowledge domain, basically broad chat Main technique for elicitation Pre-defined questions sent to expert prior to interview, supplementary questions asked at interview. Can be used as part of validation. Structured interview Pre-defined set of questions, can simply be filling in a questionnaire at the interview. Interview Techniques Dependent on The questions asked Ability of the expert to articulate the knowledge Model built on knowledge elicited during interview Model reviewed by the expert Modelling Techniques Modelling Techniques Use of knowledge models with experts Used as validation and refinement Can show a basic model to an expert and prompt them to modify. Can show a complete model of knowledge provided by one expert to a second expert to cross-validate. Can create one from scratch with an expert – start with a blank page Model Based Knowledge Acquisition Each model emphasizes certain aspects of the system to be built and abstracts from others. Each model is indicative of one view of the world Models provide a decomposition of knowledgeengineering tasks: while building one model, the knowledge engineer can temporarily neglect certain other aspects. Knowledge Modelling Process Knowledge Modelling Use skeletal models Or generic tasks Generic tasks are templates of problem-solving activities that can be configured together to describe any intelligent activity. Modelling Frameworks Knowledge Modelling At least five different types of knowledge are distinguished: Tasks-goals Problem-solving methods describe the primitive reasoning steps in the problem solving process. Ontologies ways to achieve the goals described in tasks. In some knowledge modelling frameworks, problem-solving methods define subtasks to which other problem solving methods can be applied. We will call such a decomposition a task instance. Inferences correspond to the goals that must be achieved during problem solving. describe the structure and vocabulary of the static domain knowledge. Domain knowledge refers to a collection of statements about the domain. Principles Divide and conquer. Configuration of an adequate model set for a specific application. Models evolve through well defined states. The model set supports project management. Model development is driven by project objectives and risk. Models can be developed in parallel. Recommended Reading Knowledge Engineering: Principles and Methods Rudi Studer, V. Richard Benjamins and Dieter Fense Data & Knowledge Engineering (1998) Volume: 25, Issue: 1-2, Publisher: Elsevier http://www.hubscher.org/roland/courses/hf760/readings/st uder98knowledge.pdf Contrived Techniques Knowledge Capture – Specialised Techniques Contrived Techniques Primarily for deep, tacit knowledge Involve the expert performing tasks they would not normally do as part of their job. Most of these techniques come from psychology Knowledge Capture – Specialised Techniques Important types: Concept (card) Sorting Three Card Trick (Triadic) Repertory Grid Technique Constrained Tasks 20-questions Commentary Teach Back Usually involve expert doing two types of task: Tasks they normally perform Commentary is useful here Tasks designed to probe the expert Concept sorting or Triadic Concept (Card) Sorting Way of finding out how an expert compares and orders concepts Works best in small groups Simplest form is card sorting Can reveal knowledge about classes, properties and relations Collection of concepts (or other knowledge objects) are written on separate cards Cards sorted into piles by an expert in to piles - each card in a pile must have something in common Each time the cards are sorted it will be based on an attribute and each pile will represent a value Enables significant elicitation of properties and dimensions Used to capture concept knowledge and tacit knowledge Use in conjunction with triadic method Can also sort objects or pictures instead of cards Concept Sorting – How To ? Decide what classes of concepts you want to explore (in particular their properties – attributes and values) Write the name of each concept on a separate card At the session explain to the expert what is going to happen Ask the expert to name the piles Write down (or record) the results of the sort Collect the cards and ask the expert to sort again Repeat until the expert can’t sort anymore Triadic Elicitation Method (3 card trick) Used to capture the way in which an expert views the concepts in a domain. Present three random concepts and ask in what way two of them are similar but different from the other one. Answer will give an attribute. A good way of acquiring tacit knowledge. How does it work ? Select 3 cards at random Identify which 2 cards are the most similar? – Why? – What makes them different from the third card? Helps to determine the characteristics of our classes Picking 3 cards forces us into identifying differences between them There will always be two that are “closer” together Although which two cards that is may differ depending on your perspective Triadic Elicitation – How To? Explain to the expert that you are trying a technique to draw out deeper knowledge Collect all cards previously used Shuffle cards and randomly select 3 Place them on the table, two close together one further away Ask how the two close together are similar and the other different Write down (or record) what the expert says using an attribute Use the results to find another card sort to find the values of all concepts for this attribute If the expert can’t identify an attribute, just pick another 3 cards Repeat until the expert can think of no more differences 20-Questions Expert asks questions of the engineer The Knowledge Engineer thinks of an object/concept in the domain Expert asks yes/no questions to the knowledge engineer in order to deduce an answer. Knowledge Engineer notes the questions and the order in which they are asked need not know much about the domain, or have an answer in mind, just answer “yes” or “no” randomly The questions asked provide a good way of quickly acquiring attributes in a prioritised order. Can provide an insight into the key aspects, properties or categories and their relative priorities. Note that the main purpose of this exercise is not really to try and find out what the Engineer is thinking of, but to determine the important properties! 20-Questions – How To? Decide on set of concepts you need to explore in more detail Explain to the expert what is going on Ask the expert to imagine that you the engineer have the same level of knowledge they do about the set of concepts Instruct the expert that they should ask the least number of questions to deduce the answer Engineer can only answer yes and no Explain that the best way is to ask questions which split the concepts in half so that each question eliminates half the possible solutions Start As each question is asked write it down (or record it) When a number of questions have been asked take the expert back to an earlier question and change the answer you gave to prompt the expert to ask further questions After the session extract the attributes and values (or new concepts) from the questions asked and these will be added to the knowledge base Laddering Involves the construction, modification and validation of trees. Accessing personal construct system Take a group of things and ask what they have in common Then what other ‘siblings’ (brothers/sisters) there might be A valuable method for acquiring concept knowledge and, to a lesser extent, process knowledge. Can make use of various trees: concept tree composition tree attribute tree process tree decision tree cause tree Example Source: Bourne and Jenkins , Eliciting Managers' Personal Values: An Adaptation of the Laddering Interview Method, Organizational Research Methods, SAGE 2005 Concept Tree Hierarchical diagram of concepts showing classes and members Activities to create Move nodes (concepts) around the tree Add new node Deleting nodes Renaming nodes Difficulty is avoiding the problems which requires knowing: All links on the tree represent an ‘is-a’ relationship Terminology to describe the tree What classes to use in the tree Naming conventions to use How to deal with complex cases (e.g. multiple parents, synonyms) Concept Tree – ‘is-a’ relationship Is-a = is a type of Different to ERDs lorry car traffic steam ship vehicle ship sailing ship shipping lanes pollution traffic issues congestion Road safety What are the mistakes in this tree? Concept Tree - Terminology Root node Leaf node Branch Parent Children Descendants Concept Tree – What classes to use? Class is a concept which has children on a tree Other concepts are related to it by an is-a relationship To develop classes use either a top-down or bottom-up approach Top-down start with a set of general classes and refine Bottom-up start to develop classes by grouping those concepts that are similar Repertory Grid technique Used to elicit attributes for a set of concepts Used to rate concepts against attributes using a numerical scale Uses statistical analysis to arrange and group similar concepts and attributes Allows the expert to provide a rating of each concept for an attribute in concept sorting A useful way of capturing concept knowledge and tacit knowledge When many ratings are provided using many attributes statistics can be applied to find clusters and correlations Requires special software Repertory Grid – How To? 1st stage Concepts are selected (between 6 & 15) Set of approx. same no. of attributes is also required 2nd stage Expert provides a rating for each concept against each attribute Numerical scale is used 3rd stage Should be such that values can be rated on a continuous scale (e.g. small to large) Chosen from knowledge previously elicited Ratings are applied to cluster analysis to create a visual representation of the ratings called a focus grid Concepts with similar scores will be grouped together, attributes with similar scores will be grouped 4th stage Engineer walks expert through the results to gain feedback and prompt for further knowledge about the groupings If needed more concepts and attributes are rated and included in the grid Repertory Grid Example Domain elements are certain types of crime: petty theft, burglary, drug-dealing, murder, mugging and rape. This is one expert’s view on the issue. Consider carefully whether any pair of attributes are very similar, by comparing horizontal lines in this grid. Beware, when making this comparison, that the expert may have inadvertently ‘inverted’ the scale for just one of two similar constructs. The closest is probably the personal/impersonal one and the major/petty one. For example, in the example the major/petty construct has a value of 5 for ‘major’. If the expert had chosen 1 instead, and 5 for ‘petty’, then this construct and the personal/impersonal one would look very different. Further analysis may lead you to omit one pairing of constructs. Following that you would draw up a table showing how similar or dissimilar each domain element is from the others. For example, when the absolute-value metric is used, the (numeric) difference. Constrained Tasks Expert performs a task they would normally do, but with constraints. Variations: limited time limited data Useful for focusing the expert on essential knowledge and priorities Commentary and protocol generation Expert provides a running commentary of their own or another’s task performance. A valuable method for acquiring process knowledge and tacit knowledge. Variations: self-reporting imaginary self-reporting self-retrospective shadowing retrospective shadowing Knowledge Analysis and Modelling Knowledge Analysis Identifying the elements needed to build the knowledge base Concepts Attributes Qualities or features belonging to a class of concepts Values Things that constitute a domain Main elements of the k-base Specific qualities or features of a concept that differentiate it from other concepts Relations Way in which concepts are associated with one another Concepts Physical concepts Experts, roles of experts Organisations and groups Producers, suppliers, consumers, departments Areas of knowledge Marketing, physics, chemistry Problems, solutions, advantages, disadvantages Physical phenomena Activities performed by experts Issues Purpose of components or roles Tasks Documents, databases, websites People and roles Functions Plans, goals, requirements Sources of information Products, components, machines Pieces of information Mechanisms and forces Other issues Constraints, behaviours, states Attributes Of physical objects Of information Source, format, importance Of people Shape, age Gender, age, personality Of organisation Size, turnover, product range Values Come in different varieties Dependent on type Categorical For values that are numbers Text For values that are adjectives Numerical Adjective, number, sentence, paragraphs, hyperlinks, images, pictures For values that are one or two sentences Hypertext For values that are chunks of hypertext Relations Has part Performs Followed by Requires Causes Produces Can have an inverse relation Short exercise Think of something that illustrates each one of these Knowledge modelling K-model = way of viewing the knowledge in the k-base Each model provides a different perspective on the knowledge Helps clarify the ‘mess’ that is the knowledge Can be used in elicitation Trees Diagram showing hierarchical arrangement of nodes Node = concept Link = relationship Concept tree Composition tree Cause tree Mixed tree Concept tree Each link is an is-a relation Taxonomy Read from right to left Taken from www.pcpack.co.uk Other types of tree Composition tree Process tree Special form of composition tree All nodes are tasks Attribute tree All links are has-part Used to show components and sub-components of a concept Shows attributes and values to describe a concept Mixed tree Contains more than one type of relation Matrices Attribute matrix Presents set of properties of a concept (attributes and values) Concepts on vertical axis Attributes and values on horizontal axis Relationship matrix Shows two sets of concepts related to one another using a specified relationship Cells show which pairs of concepts have the relationship Maps Shows an arrangement of nodes linked by arrows Each node represents concept Link represents relationship Concept maps Process maps Concept map Many different types Knowledge Analysis – How to? How do you identify concepts from interview transcripts and documents? Need some codification Highlighters – different colours for different things Typical project 47 steps proposed by Milton Knowledge Acquisition in Practice: A step by step guide, Milton, Springer-Verlag Phase I Phase II Initial capture and modelling Phase III Start, scope and plan the project Detailed capture and modelling Phase IV Share and store knowledge Phase I – Start, Scope and Plan Identify a project How it can benefit, what it involves Gather opinions from relevant people Document ideas as project proposal Seek agreement on proposal from key people Start knowledge capture With domain experts break the domain into different topics and rank against key criteria Identify a proposed scope and finalise Identify sources of knowledge Define and understand the type of project to be able to create a schedule Collate the proposal, scope and schedule into a project plan and disseminate with other materials to team Phase II – Initial Capture and Modelling Learn the basics of the domain from documents and informal conversation with experts Prepare for semi-structured interviews then execute and transcribe Analyse results to identify concepts, create a concept tree to develop a taxonomy and validate with experts Create a k-page for each concept K-page = 2 column table showing all knowledge associated with a concept Create a glossary Build a meta-model showing the relationships between concepts and relationships Build appropriate k-models Continue with validation models Phase III – Detailed Capture and Modelling Use further interviews and modelling to capture more detailed knowledge Finalise k-model Prepare prototype end product used to carry out assessment exercise with sample of end-users Produce a completion plan defining what needs to be done to complete the project Use specialised techniques to do detailed knowledge capture If needed cross-validate between experts and resolve conflicts Phase IV – Share and store knowledge Define and create format of end-product Create provisional end-product using k-base Give to experts for full validation Create final end-product and release for use After use for some time assess impact on organisation and document it Conduct complete product review to learn lessons and make suggestions to change methodology Ensure end-product is useful, usable and used End-users must find Find product useful Find product easy to use Actually use it