LIBRARY OF THE MASSACHUSETTS INSTITUTE OF TECHNOLOGY ^l&n-=teaSl Hewoy mss. i;!si JAN ALFRED P. SLOAN SCHOOL OF MANAGEMENT CONTROL STRUCTURE FOR A QUESTION-ANSERING SYSTEM Ashok Malhotra January 1974 #693-74 24 TECii 74 CONTROL STRUCTURE FOR A QUESTION-ANSERING SYSTEM Ashok Malhotra January 1974 #693-74 This work was supported by Honeywell Information Systems, Inc. 200 Smith Street, Waltham, Massachusetts 02154. RECEIVED JAN 28 1974 M. I. T. LIURARIES PAGE 2 INTRODUCTION Despite my best efforts at explanation, my grandmother still thinks her, of the computer as a machine that answers questions. this is can do intelligence far more manifestation of a doing "sums"; than "sums"! It question-answering not surprising is therefore, to me, systems have as long powerful answers questions, while everybody for a To few that history in artificial intelligence as problem solving. describes the This paper design of a proposed answering system that will answer questions from Ue base. and the strategies corporate data discuss the rationale behind the proposed system will design assumptions. a question- decisions Further, we developed for that will follow discuss this system from the initial question-answering the basic principles and the underlying them. EARLY QUESTION-ANSUERING SYSTEMS flcCarthy importance (1) of seems verbal to have learning been the first and reasoning. described the design for an "Advice Taker"; accept natural language input and be able a to realize In 1958 the he machine that would to solve problems PAGE 3 involving deduction attention to much of number a contemporary seminal contribution. natural the parse sentences it subset of prestructured tree of SYNTHEX a of One of that (4) was in English. It format a an data questions in a data base on a able to those from that questions in a separate of the answering it. was an ambitious system that contained the childrens encyclopedia as a data An base. prepared listing the location of each "content word". that built system ingenious thus by the declarative of to was inspired by BASEBALL (3), operated base discussed (2), answered response and was to this roots it with those the input how surprising is and Phillips system a it trace its can the problems statistics creating questions about of work collaboration, provided baseball problems issues and A.I. called paper This had received earlier as data. Chomsky's limited of basic built work, matching with premises. understanding language Chomsky's given on text index was Sentences could be relevant to the question were then retrieved using this index and subjected to an analysis procedure to screen out the irrelevant sentences. The analysis considerations, procedure the system was incapable of consolidate was however, and was based on simple syntactic Further, not very effective. deduction and could not, the information contained in a number of Since the grammar was independent of the program it therefore, sentences. could easily PAGE 4 be extended but the program suffered from knowledge and the underpinning of SAD-SAM and SIR (5) (B) a lack of semantic of the uorld. a model accepted information in the form of a restricted set of sentence types and created a semantic network of objects and relations. questions. Mas capable of accepting and analysing ownership information. input formats in SIR It was but the considerably more difficult Each This network was then used to answer SAD-SAPl was resticted to familial of these early set, relations while SIR position part, and relatively simple to increase the semantic analysis routines were to extend. than one of the systems attacked more fol lowing problems. a. Creation and modification of (restricted) natural b. c. a world model from language input. Answering questions from a data base. Solution of simple problems using deduction on data base e ements. 1 In addition, understanding some most of subset of difficult each of these problems of or systems took these English! is Ue on the now task realize and can admire the of how ingenuity these early systems that achieved encouraging results by more less frontal attacks. PAGE 5 LATER SYSTEnS It surprising tinerefore, that later systems were more is not limited in their objectives and tended to focus of development U. natural in mainly is "transition network" grammar for syntactic parser very is are real, simple, if practicability and he has world situations. geologists lunar system. .. been able "The prototype was run twice a day for three days .. (of His data bases to demonstrate were invited . the systems in the moon-rocks system) and during this time the ... to During this demonstration . and although its efficacy powerful, computer-based question-answering of (7) Woods' from the chemical analysis of the moon-rocks (8). limited by its rudimentary semantic knowledge. real sentences to analyze a parser He used this parser in a system to answer it. from the data base of an airline schedule book questions later, a languages and implemented accordance with great there has been a the other areas as well. in Hoods developed A. of language Ue shall describe only the natural the above problems. question-answering systems here although deal on only one ask questions 80% of the of the questions were asked and that fell within the scope of the data base which were parsed and interpreted correctly in form in by Uinograd (9) exactly the which they were asked ..." A for the question and world of demonstrates a powerful command system implemented set of children's capabilities of blocks on sentence a table-top analysis and PAGE S comprehension. small part Further, to most the world contributes The simplicity of the impressi veness the knowledge of procedure and this makes it In developing a novel Carbonnel developed of in the is no however. system, the system in encoded as difficult to extend. approach to computer-aided instruction the prototype of a knowledge-based system that was capable of answering questions in addition to asking and evaluating them. A limited system based around knowledge of the geography of South America subset of English. The system is undergoing further development. is operational (10) with a limited PAGE 7 A QUESTION-ANSWERING SYSTEH FOR nANAGEMENT negligible The routine than documentation. agreed The reasons computers upon: must be instructed operate they impact of computer-based systems decision-making in at a is are also behind this are difficult to fairly well with and Although powerful, with a series of specificity and of need communicate excrutiating detail. level on other to knoun uell too elaborate conventions that the manager cannot tolerate. To be useful, deal about the Furthermore, must a environment within should not it be capable of manager which the performing functions that a question-answering decided to allow great must "understand" a manager operates. programming require procedural are useful and to the Starting with these premises we his day-to-day work. in proposed computer system system as questions that required a Ue management aid. retrieval from a data base as well as the evaluation of functions of retrieved data. restricted class of pre-programmed models could the form of subroutines but apart from A be executed in was these, no deduction permi tted. Starting from these guidelines and the system should be easy to computer the general premise that learn and use for managers with experience we developed a specifications, many of which proved more detailed to be quite no set of system different from earlier question-answering systems. 1. Si nee it is important that neophyte users should be able to PAGE 8 learn to use the system easily and painlessly the system should answer questions about itself. unique in (He believe this to be question-answering systems.) 2. The user should be protected from system errors and should not 3. Attempts should be made be expected to take any remedial action. as far as possible. to continue the dialog If a with the user question cannot be answered the system should attempt to say something relevant and intelligent to convince the user that request. If a question is it has understood his ambiguous or unclear the system should ask a question or make suitable assumptions (which should be intimated to the user) and generate an answer. 4. In a real situation questions from a it is not possible to answer data base containing only objects and events. Aggregate data must be included. For example, sales figures cannot be computed by checking and aggregating values from the set of individual sales events. The inclusion of aggregates, however, gives rise to name ambiguities. problem and its solution are described (To our knowledge, in a This later section. no existing question answering system incorporates aggregate data and the knowledge necessary to answer questions about 5. The subset of natural it.) language that can be used to address the system should possess adequate power and flexibility. Similarly, response times to typical questions should be PAGE 3 reasonable. These criteria have not been formalized because It impossible to postulate satisfactory conditions a priori. is Ue propose to establish satisfaction along these dimensions exper mental i ly. Ue are convinced that to have a great deal of particular Minsky setting it makes an (11) knowledge. This a question-answering system uould need knowledge about the business world, the was to be used in and about the data base. excellent case for too can be considered need the such for initial design to be an decision for the system. Figure is a 1. Functionally, the schematic system can parser and the processor. This paper a a model MAC. and the theory case of grammar of the world, the design is answering is its point systems, such parts, the rely of a model the semantic, sentences. and of The the processor. being implemented by Prof. group at U. A. Project "case oriented" approach to (See Fillmore Celce-murcia implementation.) Similarly, the processor and this of Automatic Programming It has a strongly the analysis of English two data base describes parser has been designed and nartin (12,13) divided into These two operating sub-systems upon three knowledge bases: scenario situation and system. diagram of the proposed be departure as Uood8'(7,8), is from and (15) the (14) for for an early also knowledge-based earlier question- its main strength. PAGE 18 In addition, more the system powerful uii I considerably operate on a data base I attempted with a question-answering than anything system to date. The parser examines parse for A it. the parse ly identified standard format. attempts response. general in meaning of the parse as the between encoded in a input and the input request and generate an appropriate Understanding the sentence therefore, takes place in a sense within the parser and much more specific sense in a the processor. Ue shall say only processor that the is known to the the parser since few words about a central routine examines each word is and creates a sentence the of the set of relations is a it parts processor uses The to fulfil some sense, in sentence, but more accurately, semantical to the system input is, in paper. to this Unknown is The morphology the input request and checks world model. it words are if it analyzed to determine whether they are variants of known words or idioms. recognized by the morphology routine a If a word cannot be message printed out indicating the offending word and the user is is asked to retype his request. Once to the complete sentence is find the sentence main verb accepted, and to arrange as cases of the also made of the case-frame of the the noun phrases in the Initial prepositions that main verb. mark some of the noun phrases assist the parser attempts in this process, verb which but use is determines the PAGE 11 cases contained in participate world the Most of in. the sentence If his request. valid parse. which is In some cases the set processor isolates the parser prints scenario knowledge to of parses cannot be message out a word be parsed, or cannot the rephrase to more than one processor these parses are transmitted to the All If ungrammat icai and words the scenario world. in a message to the user and asks him uses specialized them. this knowledge is Specialized model. meanings and some idioms are contained parser prints out determine meanings of the nouns which can take and the it cases they can the eliminate some of reduced to stating that one, the sentence is the ambi guous. THE DATA BASE This sub-section describes language a knowledge. Later organization on question-answering strategies sub-sections discuss for impact the encoding of this and consider some alternative organizations. The knowledge representation language (which may be the same programming language) as the objects and must allow their properties. It the representation must also be able property information of similar objects distinct. it is base convenient if the in which a to keep addition, language has facilities to query the data and extract information from HAPL (IB) In of it. manufacturing Ue use a language called corporation called The- PAGE 12 Battery-Company can be represented sotneuhat as ( fol lous: (THE-BATTERY-COnPANY) (THE-BATTERY-COnPANY) (IS (KIND CORPORATION)) (MANUFACTURE (THE-BATTERY-COHPANY) (SELL (THE-BATTERY-COnPANY) (KIND PRODUCT)) (KIND PRODUCT)) (EMPLOY (THE-BATTERY-COMPANY) EMPLOYEE)) (THE-BATTERY-COMPANY) MATERIAL)) (BUY This states corporation. buys manufactures and sells product can be THE-BATTERY-COMPANY that It a into this a kind employees of and The properties of the kind of product. inserted is employs material, definition or described separately as: ((KIND PRODUCT) (IS (KIND PRODUCT) LEAD-BATTERY) (COUNT (KIND PRODUCT) 5)) This states that the product is five types of (lead-battery being used as a name). could be described further as Massachusetts but law Similarly, a sub-chapter this may not be lead-batteries the corporation 15 corporation neccessary in and (KIND CORPORATION) may be sufficient. If the definition, product MAPL ui II definition keep definitions that may exist In the same way, in it is imbedded distinct from in other the main product the system. we can use MAPL to represent events. The statement "John met Mary at the corner." can be represented as; PAGE 13 ((MET) (AGENT (NET) JOHN) (AGENT (HET) MARY) (LOCATION (MET) (KIND CORNER)) (DETERHINER (KIND CORNER) DEFINITE)) flary can be or as a represented as kind of object verb "met". either another agent depending on our (as above) interpretation of the PAGE 14 QUESTION-ANSUERING STRATEGIES A semantic analysis of English questions is presented in the questions in that needed to concentrates on what is questioned and about it arrive The at them. meaning determined syntax important to the analysis. is The analysis questions in part by somewhat selfish in that is is, it mainly analyzes proposed question-answering system can Questions beginning with "why", for example that the expect to encounter. question of the the syntactic form and therefore however, , of rather than upon the syntactic form and the transformations what the linguistic analysis This differs from traditional Appendix. are not treated, as the system does not expect to be able to answer them. UH-QUESTIONS general, wh-questions In "which Object etc.) " ask properties (possibly for are modified by with "what", (questions starting the properties questioned tense words) by of objects "be" or while event events. or verbs "have" properties are generally questioned by the main verb of the event. After the questioned sentence is analyzed to the property can be or event exists in the knowledge is explicitly available. determine the retrieved directly base and if if property the object the property value At this point we should recognize that the knowledge base may be organized by object or by event. If it PAGE 15 is organized by object then each property however, one of of organization be represented as a I as its agent. If is also of the existence, verb. or "be", possible in which information is be A stored This allows a choice of retrieval paths and world or both ways. scenario knowledge can be used least searching. the I the knowledge base is organized by event objects will represented as properties dual event ui participants such its are questioned Thus, After somewhat complex. can be leads select the path that to to finding the entity whose properties the entity is located the value of the questioned property must be retrieved or deduced. This can be even more complex. Different question strategy but the general finding require types basic tasks of Note that in all finding the entity requiring will that embodies some level wh-questions in different from answering them not exist. It may be profit last year?" from are possible to the data base but be our profit next year?" requires a predictive model "Uhat requires answers objective restricted to the past and present tenses. answering and remain cases. wh-questions answer "Uhat was our the above of of the questioned property or deducing the value more or less central variants of subjective the future in the past judgement. tense is or present Thus, completely tense and a search for suitable subjective models that may or may PAGE 16 AGGREGATE DATA Some properties, itself, gives rise this means little; it to naming a be qualified by the parameters has to manufacturing it facility, product, customer, salesman and time-period. These may be be aggregated. to by "Sales", problem. over which is are production figures The aggregation can be made in several best kept as aggregates. ways and costs and such as Some of these parameters, or keys, as they are sometimes called, have be picked up from the various cases of the parse, to others can be or filled in by default and still others may be assigned typical commonsense values. TO SUB-ENTITIES there fact, In default that operates THEN is a powerful, general ATTRIBUTABLE IF A PROPERTY IS in English: THE ABSENCE OF SUB-ENTITY SPECIFICATION IMPLIES SUnriATION OVER THE SET OF SUB-ENTITIES, I.E. IT IMPLIES A VALUE FOR THE ENTITY AS A UHOLE. example, if there sales for all products Typical values periods, for complete period must contain, is product then on the data usually be unspecified. therefore, a in will list of key assumed a case grammer because each to be the last item variables which must be In addition, where the key variable be contained in the sentence. Time- question. Knowledge about each data specified before the data can be retrieved. contain information as to the sum of required. depend example, can if for In a sales specification, mention of a is no it must specifications This is not too difficult kind of information has a in special PAGE 17 assigned to case occur in the time periods For SKample, it. and customers "during" case usually the in "recipient" case. Typical value information must also be contained in the knowledge each data item but this about is better organized by classes of data items rather than for each item. one of the dimensions of In the case of "cost" is the reason for variously named transportation single name in This gives rise to expenditure. or type of costs such expense" and as "interest As each cost category cost". the system it aggregation is "product stored under a becomes necessary to determine the unique name by operating upon the noun modifiers of the word cost (or expense) and what can transportation cost for lead" done the our system in checking if context. This it is is a is "lead"). type of named by context its the context by finding a subset of called be data item (in "the This and then one of the modifiers continued recursively till is or no further subset ting is possible. Some sentences can be constructed so that the context serves to determine the retrieval. unique name as For example, be assumed to ask for the well as provide a key for its "Show me the cost for all products" can direct-manufacturing-cost as this is the only cost available that has "product" as a key. Since purposes an fairly different aggregations may obvious strategy disaggregated be required for different dictates that state and aggregated data be in kept in a response to the PAGE 18 is stored and data This requires knowledge of how the needs of the question. the keys maintained for each data look to for the in sentence to be But maintaining data at one item. level The aggregation of annual of aggregation may not be sufficient. sales for five products from five plants to five customers stored This can mean a by month requires 1508 probes of the data base! long wait at console for Thus, user. the higher levels of level of aggregation must be maintained as well. system The should aggregation required "association list" is the highest In our system, has time data every created to be This list contains the range over which aggregation retrieved. has recognize to question and use the the of aggregation for which data is available. level an able be to answer to be performed for each key variable on the assumption that the lowest aggregation whether can be level of aggregation is performed, will however, the list Before used. be is the processed to check data with a higher level of aggregation is available and used. If the so, list is modified and the data-name changed to a higher aggregate. Uoods' found that when analysis of a rock, they wanted, the ten most common elements. distribution of sales lunar geologists in fact, asked for the its analysis for only Similarly, a manager may want the by product whenever he asks for "sales". Thus, particular questions may come to have iconic meanings above and beyond thier literal meaning. The present design of the PAGE 19 system does not recognize iconic meanings. the direct concerns of not are and aggregation of data It can be argued that retrieval the question-answering system and it should restrict itself to extracting the key information from the sentence and passing knowledge retrieval efficiently. problem and hope He that of the data used be will it files to perform the this decoupling agree with uses system that a data retrieval on to it about the structure the of future in implementat ions. MODELS The use of meaning as a the word model, replica. relationships that result, Typically, establish stems from its management, in models the dependencies variables on independent, or decision, dependent variables are often figures are of set of target, or a health of the enterprise or the success of some portion of the model used to assist is the it and decision-making process by in the predicting the effects of changes The variables. of merit that measure in the Independent variables on the dependent variables. Management shapes and structures depending on the nature of the process they model and nature the of models the can take a decision-making variety they of support. consider only one class of models, those that can be by a set described can, of mathematical operations in the next sub-section) therefore, be encapsulated (such as I4e shall represented the functions on specified items of data and in a subroutine. (Other types of PAGE 28 models, and in particular, very important and very useful. modified by the manager are They have been excluded here reasonable size. Models See, order to limit the problem to a in however, Krumland (17).) usually referred are specialized those that can be calibrated, Input data is not specified unless neither exceptional and is it are the operations to be performed on the question. name in to by This information must, it. therefore, be available to the system as a property of the model. Key information relevant however, be specified A request search is for the question. using subroutine that execution will, key the calculates the possible to cause a therefore, After these have been located the data input names. retrieved always in model for input data must, retrieval of to the information and fed Although output value. mathematical specify models as into the is it functions of customary, data only, this is therefore, to specify models that use the output of other models when the as input. Thus, encounters an input evaluation routine always convenient. not that input retrieval is a to evaluate model output it. It is routine for it models calls the model The structure of correctly specified models ensures that this recursion always terminates. In addition knowledge about also a an argument list model, like knowledge to contain information as to where found in the sentence. Each model and a subroutine, about data items, the must key variable values can be must also have a definition PAGE 21 can that be used calculated?". questions questions such to answer In fact, as "How is profit more than one definition is required if about various aspects of the model have to be answered properly. FUNCTIONS required may be the property In some cases a function of data such as percentage, distribution or average. Unlike models, functions therefore, can operate on names of the inputs to the data and, variety of a the the function have to be specified in sentence along with key specifications for their retrieval. A number of are used conventional devices data items on which the function is to operate. to specify the For example, the question the percentage of will name the data item and the subsetting characterict ic: percentage of distribution a subset of plant capacity is required, is over all utilized?". the data item will the key variable or subsetting aggregated an item is required, Similarly, "Uhat if a be named along with The data will have to name. if be the unspecified variables and retrieved as a function of the specified characteristic. In general, names occur determination in data names, key different of the of key specifications for into account the function did parts specifications and to be executed. sales increase over 1972?" and sentence subsetting and the each argument must take Consider, "How much "How much did sales increase PAGE 22 To Interpret and answer these two questions correctly In 1972?". understand that the system must function to the Typically, only one set of Key variables is values. the the arguments are both of the same type with different key variable "increase" others being picked up specified, The defaults, however, by default. depend on knowledge about the function. the analysis Thus, data has to ask for functions of questions that depend heavily on the function to knowledge about determine data name and key variable information. been Once this has the data items can be retrieved and fed done, however, of into a subroutine that evaluates the function. Functions are usually specified by name exception is "How many" which objects or events. This is, in the asks for the question. count of however, an unusual a set An of function since it does not operate on data. DEFINITIONS Consider the questions "Uhat information do you have about product cost ?" and "Uhat do you know about product cost ?" After product cost possession is located directly and checked to be a subset of a of "you" (information) or located by a search from among the possessions of "you" what does the system reply? The problem known with "know" is similar, although the may be called knowledge and the search avoided. what is required is, in some sense, the definition of object In fact, product PAGE 23 This is implicit cost. in the question but must be made explicit for the system. convention examples these fact, In in English: ITS MOST IMPORTANT being the principal requires general more a IS ABOUT AN QUESTION REQUIRED. 1972?" asks for OBJECT THEN example, For value the value of sales: characteristic of data. "What dog?" is a the dominant characteristic of dogs in general, perhaps that they are animals. the demonstrate CHARACTERISTIC IS were sales in "Uhat IF THE this dog?", however, "Uhat is breed. Carbonnel Collins and in these represent (10) concepts" for each noun explicitly asks for perhaps its dominant characteristic of this particular dog: "super the data base. YES-NO-QUESTIONS As discussed restricted truth of stated appendix, the in to the present and the past propositions. yes-no-questions, if tense inquire about the Propositions may concern the properties of stated objects and the identity of actors and other particulars Yes-no-questions in the future tense about events. involve predictions and judgements about the future and are, therefore, excluded from the scope of this system. In a data base organized inquiring about the truth answered by selecting by nouns, a of stated properties the appropriate nouns properties against the stated properties. yes-no-question of nouns can be and matching their Ue find a recursive PAGE 24 matching property routine very be checked against those useful for this. must be It mentioned in the questioned remembered that the properties in the data must base and not vice versa as the data base may contain properties other than those questioned. Event matching Since events possessor is somewhat more complex in such a data base. stored are as properties of the agent or located in the question and in these nouns have to be the data base and their properties searched for the event. this is accomplished, or a set of possible events selected, properties have to be matched against those alternate organization The decision as to should be ("be") verb. whether objects stored under the the in the that requires a participate in under event and/or their question. or verb, by event, Once an event the existence Direct retrieval for identity-questions and yes-no- questions requires both facilities: "Who robbed the bank?" property of "rob" "Did Jack rob the bank?" property of Jack If properties retrieval is are only required for indirect retrieval required stored by the event then second question for events in an indirect much like the a noun-oriented data base. There events in contents seems to be an the world but essential duality between objects and depending on of the data base there or vice versa. Thus, retrieval the question and the may be more objects than events through one or the other will PAGE 25 lead to less searching. If files is lees of data length the retrieval important than the amount of searching required in the process, then a dual organization in which properties are stored under both events uith the help and objects and In fact, MAPL is selected, path search will to minimize Knowledge base, of the yield the best results. organization the retrieval designed for the dual properties under both the and automatically stores object and the event. IDENTITY QUESTIONS Questions different start that yes-no-questions. questions is Uhat is identity the properties stated i required "Who" while "which" answer as an that quite more like fact, in these to satisfies the answering the process of yes-no-questions on a set of a set of questions ask are "which" or are, object of the of satisfying the objects that are capable in general. objects "who" the question and in can be likened to answering candidate with wh-questions and from other for the questions can conditions identity of ask for animate either animate or nam mate identities. i Answering routines for identity questions, therefore, with the selection of a name for this set in is set of candidate objects. start The generic invariably specified as the main object noun identity questions with "be" verbs that specify the properties of the required object rather than the event it participated in. PAGE 2G candidate set The is the generic name. agent "Uho questions that There candidate set is, the ask the identity of any direct clues to the candidate of an event do not give set of objects. the the set of all objects that are "a kind of" however, the rule that the objects in given event and to perform the must be able this can often be used to narrow down the set from the set of all animate objects. is t i fairly small and so through all of them is not very search a me-consuming. Once the candidate set is objects In many data bases the set of animate is the selection process established, much like performing a yes-no-question matching on each except of each matching is that the result candidate or "no". The reply final the identity of the can be the question to event created from the set of identities that have been returned by the individual yes-no-questions. the appropriate answer is If this set "None of them" is set then the null for "which" questions and "No one" for "who" questions. THE WORLD MODEL PROBLEn Every data base and every question-answering system embodies a particular model of the world. Further, it expects questions about concepts and properties that are sensible in terms of model. if the user has a A severe problem can arise, model of the world which in significant ways. is therefore, at variance with that of the this system For example, our data base contains direct PAGE 27 manufacturing costs and overhead costs for activities that cannot The overhead costs are be directly attributed to manufacturing. not allocated to products, since we feel this therefore break-even points user who likes to have no meaning think in terms it A system. in the the system if merely can't provide break-even data. Ideally, a artificial, and of break-even quantities will ask for them and may not be able to proceed says is the system should be able to realize that there discrepancy between world models and since model of the world it should explain it it is cannot change its user to try to the to influence his. The proposed system approach to this problem. as break-even, world. that Every is simple-minded maintains a list of concepts, such models of the to variant question asks for one of these concepts responds by printing out why the question It extremely an knows belong it time a adopts an explanation of its world model it and inappropriate. IMPERATIVES Besides asking questions, the user of the system should be able to request services from the system by using commands. allows the dialog action to be much more and as the actions possible are the types of commands that it. The services provided natural. Commands ask by the system are may serve as meaningful by the system This for limited so input to are limited to data PAGE 28 retrieval, model execution and the provision of information about itself. Typical commands the to therefore, system, are as fol lou: "Show me the sales to Sears for 1973." "Display the names of customers with outstandings of over 85808." These questions seem to ask for data retrieval. "Compute the profit for '73." "Calculate the return on investment last year." These questions seem to ask for the value of functions of data or the execution of models. The distinction is specious, base will determine computed. what can The user will, of the database and, The structure of the data however. be retrieved in general, has to and what be be unaware of the structure therefore, his choice of verbs should not be considered significant. Other verbs have semantic significance and will require special routines to process data and generate answers. "Compare the distribution of sales to Gulf and Sears by unit." "Contrast the sales for each quarter of this year and last year. "Sort the customers by outstandings." CONCLUSION Host of the question-answering systems built to date have PAGE 29 started by devising data base answering strategies structures and defining question- from scratch. some extent, To this was necessary because each system addressed a particular kind of data base and answered limited sets of question types using parsers of very different styles and capability. great deal about knowledge from these systems and it We have however, representation and question is learnt a analysis desirable that future developers of question-answering systems build upon this experience rather than replicate it. Ue hope that our analysis, limited as it is to our situation and our goals, will be useful to future researchers. PAGE 38 APPENDIX ANALYSIS OF INTERROGATIVE FORHS IN ENGLISH This appendix attempts to classify the types of questions and commands possible in English. The classification is not primarily by syntactic structure, since that rather by semantic structure; structure of the sentence provides many clues to is the concern of the parser, uhat is asked and about uhat. is still but The rather important, houever, as it its semantic function. To the best of my knowledge, such an analysis has not been attempted before. For reasons of brevity presentation is restricted to the sentence types that are expected to be encountered by the question-answering system. English allows a wide variety of question types that can be analyzed along a number of dimensions. Dost basically, questions can be analyzed according to the answers they require: objective, i.e., mathematical and logical functions of known data, and subjective, i.e., requiring the formation of inferences and judgements beyond those possible from the models and data contained proposed system will not attempt in the system. to answer questions that subjective answers. This does not mean that it will The require be unable to answer questions that ask for reasons or opinions, but merely, that it will be restricted to providing such answers that can be constructed by sets of rules and data contained within the system. Questions requiring objective answers can be further subdivided according to the analysis required to generate the answer and the type PAGE 31 of sentence construction used. Uh-questions are so called because a word starting in "uh" ("how" is an exception) is used to ask the question. Such questions Yes-no-questions, on the other hand, ask for data or property values. are distinguished by an auxilliary verb, such as "do" or "can", preceding the noun group. These questions enquire about the truth of stated propositions and expect either "yes" or "no" as an answer. Sometimes, of course, the appropriate response is "I don't know". UH-QUESTIONS l.Also called adverb questions, these usually start with a question adverb. Each adverb refers to a particular type of attribute. "Uhy" asks for reason or causality and generally requires subjective answers. "Uho broke the window ?" Asking for the Identity of the causal agent. "Where did the accident happen "Uhen did the accident happen "What" is special in that ?" ?" Asking for the location. Asking for the time. the attribute and the object must both be specified for existence ("be" type) verbs. "What is the hieght of the Empire State Building ?" For other verbs, "what" asks the value of the object. Another case, typically the agent, must also be specified. "What did you see ?" "What did Alex write ?" PAGE 32 Sometimes the object can be assumed as "our" or filled in from context. "What Here sales in 1972 ?" "Uhat" can also be used to ask for definitions. "Uhat is a dog ?" "Uhat is contribution margin 7" "How" is somewhat special and comes in four forms: "How many people attended the ball ?" "How much beer will fill this glass ?" "How tall is Maria Asking for a count. Asking for quantity. Allowing the attribute ?" to be named after "How is profit defined In the final form, main verb and, 7" "How". Asking for method. the exact property questioned depends on the in some cases, even on its object. Question adverbs can occur by themselves as a sentence provided the context 2. is known: "Uhy ?", "Uhere ?", "How ?". Attribute values can also be questioned by an initial preposition phrase with a question adverb in place of the noun. "To whom did you entrust the keys 7" "From whom did you get the keys 7" Asking for recepient. Asking for source. PAGE 33 ?" "Uith what did John hit him Asking for instrument. In some cases the preposition may be shifted to the end. ?" "Uhat did John hit him with IDENTITY-QUESTIONS Questions starting with "who" or "which" are somewhat different from other Uh-questions in that they asl^ for the identity of objects whose properties satisfy certain criteria. "Uho broke the window ?" "Who is our largest customer ?" "Uhich plants had costs over 10% of budget last year ?" The first of these three questions requires a selection from among all animate objects. This is the case with "who" questions with semantic verbs. The second question requires a selection from the set of customers. Specification of a be selected from is common noun that determines the set of objects to typical of "who" questions with "be" verbs, (questions of the type "Uho is Guru Haharaj Ji ?" are an exception.) "Uhich" questions always specify a common noun which determines the set of objects to be searched. YES-NO-QUESTIONS Yes-no-questions come in a variety of forms. If restricted to the past and present tense they enquire whether objects have stated properties and capabilities and whether stated events PAGE 34 occurred. Another form They ask only if is a variation on Identity questions. the set of objects that satisfy the criteria is non-empty. If the future tense is also alloued, Yes-no-questions take on a considerably more complex character and use modal constructions to question stated events and courses of action are plausible, likely, recommended, neccessary, etc. Future tense yes-no-questions will not be discussed further in this appendix. The auxiliary verbs that signal yes-no-questions may be semantical ly empty. If not, they specify the following types of questions: Modals (could, uould, shall) The potential or expectation to act. "Have" type verbs (has, had) Possession or control over. "Be" type verbs (is, are, were) Existence. The following are typical examples of yes-no-questions: 1. Questions asking about the existence of a property. "Can monkeys sing ?" "Is the car colored ?" 2. Questions asking the truth of statements. "Is contribution margin the difference between list price and standard cost ?" "Are monkeys green ?" PAGE 35 3. Questions asking whether attribute values satisfy certain relation. "Is the car red ?" "is profit this year greater than a million dollars ?" "Is profit this year greater than last year ?" 4. Questions asking whether a command can be "Can you stand on your head carried out. ?" "Can you show me the profit for 19B9 ?" 5. Asking whether there is a non-empty set of objects with given properties. "Is there a house with three windows ?" "Are there two cars are identical ?" "Did any plants have costs greater than budget last year ?" "Are there two cars that are identical ?" Each of the above types of yes-no-questions can be formulated In a different manner that specifies alternatives to be selected from: "Do they have children or don't they ?" "Can you show me the profit for 1969 or don't you have that data ?" "Is his car green, black or red ? Clearly, these questions do not require "yes" or "no" as an answer but rather the selected al ternativeCs) or a negative. COnPOUND AND COMPLEX QUESTIONS The above examples are all simple questions, but questions PAGE 38 that occur as complex and compound sentences can be analyzed similarly. In complex questions, dependent clauses can be used to specify time, place or other features of the entity being questioned. "Uhat uas Soars' profit in the year Ajax lumber wont bankrupt ?" "Uhat car, did your brother say, he wanted us to tell Jane to buy An important class of complex questions are those for which an "if clause" specifies hypothetical conditions. Usually these are parameters to a model; unmentioned parameters being picked up by defaul t. "If sales went up by 10%, what would happen to profit ?" "If we closed down warehouses 2 and 5 and fed their customers from 3, how would this affect distribution cost ?" Compound questions can usually be broken up into two or more questions with the unrepeated conditions being common to them all. "Uhat was the temperature at 4 p.m. ?" in Phoenix on Saturday at 12 noon and ?' PAGE 37 BIBLIOGRAPHY (1) McCarthy J., "Programs with Common Sense," Proc. Symposium on Mechanisation of Thought Processes, H.M.S.O., London. 1959 (2) Phillips, A. v., "A Question-Answering Routine, Unpublished Masters Thesis, Mathematics Department, M.I.T., Cambridge, Mass., 19B0 (3) Green, Jr., B. F. , et a!., "Baseball Question Answerer," Proc. UJCC, Vol. (4) j An Automatic 19, 1961 Simmons, R. P., et al., "Indexing and Dependency Logic for Answering English Questions," American Documentation, Vol. 15 1964 (5) Lindsay, R. K. "Inferential Memory as the Basis of , Machines that understand Natural Language," in "Computers and Thought," Feigenbaum, E. A., Feldman, (B) Retrieval," (7) eds. J, Raphael, B. , and McGraw Hill, New York, 19G3 "A Computer Program for Semantic Information , in Uoods, U. A., reference (11) pp 33-145 "Procedural Semantics for a Question-Answering Machine.", Proceedings of the FJCC 19G8 PAGE 38 Uoods. U. A., "An Experimental Parsing System For (8) Transition Netuiork Grammars." in "Natural Language Processing", Randall Rust in Ed., Algorithmics Press New York, N.Y. 1973 (9) Uinograd T. "Understanding Natural Language," , Academic Press, New York, 1972 (10) Carbonell J. R. (11) (12) and Collins A. H. , Artificial Intel I igence, " Proc. , "Natural Semantics in IJCAI., 1973 "Semantic Information Processing," flinsky f1. I1,I.T. Press, Cambridge Mass., 19G8, pp 1-38 , ed. , Martin, U.A., "Translation of English Into HAPL using Uinograd' 3 Syntax and a Semantic Case Grammer", Automatic Programming flemo 11 Project nac, M.I.T., April 17 1973 (13) Martin, U.A., "Things That Really Matter — a Theory of Prepositions, Semantic Cases and Semantic Type Checking, Automatic Programming Memo 13 Project Mac, M.I.T, August 15 1973 (14) Fillmore, C. in Uni versa Is T. "The Case for Case," , Linguistic Theory, Bach and Harms ed. in Holt, Rinehart and Uinston, 19G8 (15) Celce-Murcia, M. in , "Paradigms for Sentence Recognition," System Development Corporation final report No. HRT-15892/7907 PAGE 39 (IG) Mart in, U. "riAPL 2," A., Automatic Programming demo 12 Project Mac, (17) Krumland R. M. I.T., B. , August 1 1973 "Concepts and Structures for a Manager's Modelling System," Thesis Proposal, Sloan School of Management, M.I.T., Oct 15 1973 H\ I Lib-26-67 lUOU UUJ OCf 13 -J ,,/-73 ||i||||||ii||i|ip|,.j,||T||fr|||"| I ^DflD 3 0D3 TQb DM '^O&O DD3 7°ih 3 31 6^1-7^ 3 TQfiD DD3 7Tb 6=?^-7'^ 3 TDflD DD3 fl?7 ? 3 TDflD DD3 627 t ^'??'- 7^i 6^fo'-7<^ 3 lOaO DD3 fl57 llill'!l'i'illi!i'i'"i'!''ir''i'lt'rr'i fc^??- 3 TDflQ DD3 fl2? 6?<?-7'-/ 3 TOflO 3 DOS 7Tb { ^Oflo'do'Vfl'^';"'^''