Practice guidelines and clinical effectiveness: A unified methodology

Decision support and disease management: a Logic Engineering approach John Fox and Richard Thomson Imperial Cancer Research Fund, PO Box 123, Lincoln’s Inn Fields, London WC2A 3PX email for correspondence: jf@acl.icnet.uk Abstract: This paper describes the development and application of PROforma, a unified technology for clinical decision support and disease management. Work leading to the implementation of PROforma has been carrried out in a series of projects funded by European agencies over the past thirteen years. The work has been based on logic engineering, a distinct design and development methodology which combines concepts from knowledge engineering, logic programming and software engineering. Several of the projects have used the approach to demonstrate a wide range of applications in primary and specialist care and in clinical research. Concurrent academic research projects have provided a sound theoretical basis for the safety-critical elements of the methodology. The principle technical results of the work are the PROforma logic language for defining clinical processes, and an associated suite of software tools for delivering applications such as decision support and disease management procedures. The language supports four standard objects (decisions, plans, actions and enquiries), each of which has an intuitive meaning with well-understood logical semantics. The development toolset includes a powerful visual programming environment for composing applications from these standard components, for verifying consistency and completeness of the resulting specification, and for delivering stand-alone or embeddable applications. Tools and applications that have resulted from the work are described and illustrated with examples from specialist cancer care and primary care. The results of a number of evaluation activities are reported to illustrate the utility of the technology. 1. Introduction Quality and consistency in most healthcare systems are highly variable, and occasionally lamentable. Traditional clinical practices are proving unsustainable (e.g. overuse of drugs and investigations, waiting lists measured in years), and the costs of medical services continue to rise inexorably. In addition, highprofile media exposés of clinical errors, and increases in litigation following North American trends, are forcing many European governments and healthcare agencies to acknowledge that their services have structural problems. It is not a sufficient response simply to blame doctors, as tends to happen in litigation. Most doctors are doing their best, but have too much to do and too little time to ensure that all their decisions attain an optimal balance between achieving the best care for their patients and the most efficient use of resources. Many organizations are therefore looking for new solutions, and many are asking whether information technology can help doctors carry out their job more effectively, without usurping their traditional professional roles and responsibilities. This paper reviews a line of work that aims to show that advanced techniques for supporting clinical decision making and disease management at the point of care can make a significant contribution to the process of patient care 1. While the objectives of the work are common to many other projects worldwide, the solutions proposed exploit technological and other strengths that are particularly European. The idea that computers can be used to provide various forms of assistance to clinicians, such as better clinical records, timely prompts and reminders, and assistance in following care pathways, is far from new. A number of well-known centres, particularly in North America, have pioneered such systems, in some cases over decades. We are now beginning to see that their faith was justified, as objective benefits of computer support on clinical effectiveness [1] and cost-effectiveness [2] are emerging. Until quite recently, the funding available in Europe in this field was relatively modest, but about ten years ago the European Union (EU) established a series of research and development programs in medical informatics. Since then the level of activity and range of projects has expanded substantially. Although the emphasis of European projects can be different from those in North America (e.g. a greater concentration on primary care) common themes are also apparent. For example, a renewed interest in established methods (e.g. Arden Syntax [3] ) and a desire to develop new methods for standardising medical knowledge and disseminating best clinical practice (e.g. clinical coding systems, guideline interchange formats [4] and technologies for delivering decision support [5, 6]). In the field of decision support, European developments have paralleled North American trends. There has been a radical shift in emphasis away from providing support for diagnosis decisions (as described in de Dombal’s classic work on diagnosis of acute abdominal pain [7]) towards support for patient management (e.g. the UK health service’s experimental prescribing system, Prodigy [8], and the Dutch trial of the Bloedlink system for assisting GPs in ordering blood tests [9]. As elsewhere, current developments are profoundly influenced by the emergence of the Internet, as indicated by the explosion of clinical guidelines published on the World Wide Web. Work on advanced clinical information technologies is now developing strongly in Europe. Mainstream R&D in new technologies (e.g. multimedia), communications (e.g. the Internet and intranet technologies), 1 We acknowledge the contributions of the following to the work described in this paper: Pedro Barahona, Ahmed Benlamkadden, Andrew Coulson, Subrata Das, Paul Ferguson, Claude Gierl, David Glasspool, Colin Gordon, Nicky Johns, Andrew Newbigging, Peter Hammond, Ian Herbert, Jun Huang, Andrew Jackson-Smale, Peter Johnson, Philippe Lagouarde, Joanne Lau, Rob Martil, Mike Morris, Mees Mosseveldt, David Pitty, Ali Rahmanzadeh, Phil Reeves, Jean-Louis Renaud-Salis, Pierre Robles, Derek Shafer, Paul Taylor, Johan van der Lei, Robert Walton. and object-oriented programming languages and tools (e.g. CORBA, JAVA) are strongly influenced by US developments. However, Europe is also showing distinctive strengths in certain areas. For example, formal methods for software engineering (e.g. Z [10], VDM [11]) have always been strong, and these are being transferred into knowledge engineering (e.g. ML 2 [12], DESIRE [13]). There is also a strong European tradition in the development of software technologies based on mathematical logic (e.g. logic programming languages like Prolog, deductive databases, constraint solving systems), and these are having considerable influence on new technologies for machine learning and software agents. There is a growing interest in safety-critical applications using both conventional methods [14] and logic-based and AI techniques [15, 16, 17]. Our own work on medical systems has been particularly influenced by formal software engineering [18], and the PROforma technology described below exploits a number of ideas from these areas. The work described in this paper is characterised by an emphasis on rigorous design and systematic development methodologies for clinical applications, based on an integration of methods from logic programming and software engineering, or logic engineering [19]. 2. Logic engineering in medicine Figure 1 summarises a line of research and development that began in 1986, aimed at creating a unified technology for clinical decision support and disease management based on the logic engineering methodology [15]. The diagram provides a map of the projects referred to in the paper and their interdependencies. CREW RED Oxford system of medicine LEMMA DILEMMA MACRO InferMed PROMPT RAGS PRESTIGE CADMIUM CADMIUM II Figure 1: Map of projects covering period 1986-98. URLs for project WWW sites are as follows: PROMPT: http://www.acl.icnet.uk/lab/rtp_home.html; MACRO: http://www.eortc.be/macro; PRESTIGE: http://www.rbh.nthames.nhs.uk/rbh/itdept/r&d/projects/prestige.htm; CADMIUM: http://www.chime.ucl.ac.uk/HealthI/CADMIUM/; DILEMMA: http://www.acl.icnet.uk. RED: http://www.augusta.co.uk/safety/listing/red.htm. InferMed is at: http://www.infermed.co.uk/. The origins of the research are in a decision support system called The Oxford System of Medicine (OSM), sponsored by Oxford University Press and aimed at general practitioners (GPs) [20, 21]. The OSM set out to demonstrate how retrieval and browsing functions for electronic versions of conventional published material could be combined with decision support. The Oxford Textbook of Medicine, a comprehensive medical reference widely used by GPs in the UK, was the target publication. The resulting demonstrator system included a range of decision making functions (for differential diagnosis, selection of tests and investigations, therapy selection and simple prescribing decisions), together with a flexible text browser. The main innovation of the OSM was a general decision procedure expressed in first-order logic rather than in traditional quantitative formalisms based on probability. This procedure used meta-level reasoning techniques to automate a number of steps in decision making which are not addressed by conventional methods. These include the generation of decision options, identification of relevant information sources, and the construction of arguments for and against alternative options [22]. Although at the time, the method did not have the formal strength of statistical decision theory, it proved to have great flexibility and versatility. Later projects provided a secure theoretical foundation for the technique as well as many applications. 3. EU funding of Advanced Informatics in Medicine In 1987, the EU funded the LEMMA2 project whose main objective was to carry out technical refinements of the logic engineering approach, and to establish whether the decision making techniques developed for general practice could be applied more widely, notably in cancer care [e.g. 23]. The project was significantly influenced by the well-known work on ONCOCIN at Stanford University, but made use of the OSM’s logical decision procedure for cancer diagnosis, staging and protocol eligibility decisions. In addition, as cancer protocols typically require a number of tasks to be carried out over time, the OSM model had to be extended to manage control of task execution. Standard logic programming techniques were used to implement rule-based scheduling for protocol execution, based on a task life-cycle model and pattern directed state-transition rules. These methods were also adopted in CREW 3, a prototype system for supporting protocol-based clinical trials, and the MACRO clinical trials system, described below. The LEMMA project laid the foundations for the DILEMMA project (1992-1994) whose objectives were to:  Develop a decision making and task management engine to permit applications to be embedded in third-party and legacy systems;  Integrate decision support and task management with electronic medical records;  Transfer task management and decision support techniques to new domains, specifically cardiology, and multidisciplinary shared care. Dilemma's main results in the areas of tools and applications were as follows: 2 Logic Engineering in Medicine: Methods and Architectures. Collaboration with Institut Bergonié, Bordeaux and the New University of Lisbon, Portugal. 3 Clinical REsearch Workstation Generic tools   A 'knowledge based' electronic medical record (EMR) system [24], using a standard patient data model and providing storage of care plans. The EMR provided a variety of views of the data, including diagnosis, problem-, treatment- and time-oriented views. An integrated decision support and protocol execution engine (DSPE) which was interoperable with other applications, such as graphical user interfaces and patient record systems. Applications The DSPE was successfully used to implement the following applications: 1. A breast cancer protocol manager, integrated with the DILEMMA EMR system [25]. 2. PARSEC4, which demonstrated decision support functions in a cardiology setting [25]. 3. CAPSULE5, a decision support application for routine prescribing in primary care [25]. This used a knowledge base of drugs and their attributes (uses, side-effects contraindications, costs etc.) to generate a short-list of potential medications appropriate for a specific patient. 4. The SYNERGY system [25], which was linked to a commercial EMR to deliver guidelines (for the management of acute asthma and otitis media) at the point of care. The DSPE was responsible for data acquisition, task control and decision support. Research work in DILEMMA also explored the use of qualitative reasoning in drug prescribing and dosage decisions [26], and developed techniques from distributed AI and multi-agent systems to explore problems in supporting multidisciplinary care [27]. In the latter activity, software agents were designed to act as assistants for several clinical staff in a scenario of shared care of breast cancer patients. Each agent contained a DSPE, extended to include a set of transaction rules for implementing communication of patient data, delegation of tasks and management of workflow. Related projects The RED6 project, which is discussed in detail later, aimed to develop a formal foundation for the concepts and techniques of logic engineering. It was especially concerned with developing techniques for ensuring that decision support systems are safe. A focal application for the work was in the management of chemotherapy protocols. The OaSiS7 system [17] built on ONCOCIN and earlier work in LEMMA and DILEMMA, introducing a logic-based decision support function (figure 2). 4 PAtient Records for SharEd Care, developed at the Royal Brompton Hospital, London Computer Aided Prescribing Study Using Logic Engineering 6 Rigorously Engineered Decisions, funded by the UK’s Engineering and Physical Sciences Research Council within a national research programme on safety-critical software. 7 Oncology Support System using Artificial Intelligence 5 Figure 2: A view of the OaSiS chemotherapy protocol manager. This provided support for routine collection of data and prescribing of cytotoxics, together with functions for hazard alerts and decision support. OaSiS initiated an important line of work: the systematic investigation of requirements for managing therapeutic procedures with possible safety implications. At the start of the study, Hammond reviewed in detail some fifty cancer therapy protocols to identify clinical circumstances where safety issues arose (such as where the absence of treatment might be hazardous, or where inappropriate or continued use of particular treatments might be dangerous). For example, Hammond identified the following remarks in two unrelated protocols: “Use of pre-hydration or anti-emetics [is recommended] to counteract the dehydration due to vomiting, or the vomiting itself arising as a side-effect of many chemotherapeutic drugs”; “Folinic acid rescue carried out in conjunction with administratio of high dose methotrexate chemotherapy to try to balance the bone marrow side-effect”. He was able to translate these into a clear general principle that it is necessary to: counteract any expected, serious, toxic effect of a planned chemotherapeutic action. Hammond identified nine domain-independent principles of this kind [17] and expressed them in the form of first-order logic rules, such as: IF component(Plan,Action1) AND AND AND AND THEN produces(Action1, Effect) unsafe(Effect) OR undesirable(Effect)) can_ameriorate(Action2,Effect) not incompatible(Plan,Action2) add_to_plan(Plan,Action2). Such rules can be executed in a logic programming language, such as Prolog, for automatic modification of the design of therapy protocols, and dynamic repair of care plans. The DILEMMA project also led to an effort to address important issues in medical image processing and interpretation. Although many patient data are symbolic, much clinical information comes in analog form (e.g. signals from monitoring devices and imaging systems). The integration of signalanalysis and clinical decision making in a general way remains an unsolved problem. Furthermore, the fields of imaging and signal processing and those of knowledge based systems, decision support and protocol based care are sharply separated, in terms of the functionality which is required and the technologies which are used [28]. The CADMIUM8 project produced a prototype “radiology workstation” (figure 3) which introduced a number of innovations [29], notably a demonstration of the use of logic programming techniques to link processes of reasoning over qualitative symbolic data (e.g. patient history data) with problem-directed measurements and descriptions of image features. CADMIUM has been shown to improve decision making by radiographers looking for abnormal micro-calcifications in mammograms (see below). The work has also resulted in a knowledge model of several important decision tasks in radiology - detection, measurement and description. The CADMIUM II project is now starting, funded by the UK’s Engineering and Physical Sciences Research Council and the Imperial Cancer Research Fund. The major objective is to reengineer the prototype as a practical clinical system and to assess systematically its value in routine radiology. 8 Computer Aided Decision-Making and Image Understanding in Medicine. Figure 3: A view of the CADMIUM system being used in breast cancer screening. The background shows a mammogram, together with a patch of a breast which contains micro-calcifications. These have been automatically located by the software; an enhanced image appears top right. The inset panel at bottom left shows the state of execution of the screening protocol (the tasks which have been complete or are intended for execution), and the panel at bottom right presents an assessment of the abnormal features in terms of their support for malignant or benign conditions. The execution of the image processing is carried out automatically as part of CADMIUM’s decision procedure. A more recent application of logic programming and symbolic decision processes has been developed for assessing and communicating genetic risk information. As more and more genetic markers for serious diseases become available, healthcare professionals may be expected to carry out a number of tasks in this area: systematic collection of family history data; construction of an appropriate family tree; calculation of the appropriate disease risk, and finally explanation of the results to the enquirer. The RAGs system, illustrated in figure 4, has been designed to support all these activities, focusing initially on breast and ovarian cancer. Figure 4: A view of the RAGs risk assessment system. RAGs automatically generates a pedigree from family data, assesses the arguments for and against elevated risk, and generates a text report. The program generates appropriate forms for collecting information about the client and her family, and uses this information to construct a family tree. All information gathered is then used to appraise the person’s risk level (high, medium or low), and a report is generated which summarises the arguments for and against this assessment. The application is currently undergoing evaluation in a study with general practitioners. 4. Industrial strength technologies and commercialisation The goal of the PROMPT9 project (supported by the EC’s 4th Framework Health Telematics programme, 1996-8) is to develop the core technology of a clinical workstation that incorporates the functionality developed in the earlier projects. The structure of the PROMPT workstation is summarised by the cloverleaf motif in figure 5. 9 PROtocols for Medical Procedures and Therapies PROforma Reference data and knowledge Patient record system Comms & shared care Figure 5: Component parts of the PROMPT clinical workstation The patient record system [24] is based on the patient data model and prototype EMR developed during DILEMMA. The Reference data and knowledge component is a browser providing access to a collection of information sources, held locally or remotely on a network. The shared care component includes a set of conventional communication functions. Initially, this will permit multiple users to share patient information and care plans. Later, we plan to introduce support for delegating and managing clinical tasks between members of a care team, based on the technology described by Huang et al [27] and the KQML agent transaction language which is widely used in multi-agent systems research [30]. The fourth component of the PROMPT workstation, the PROforma toolset, is the main current focus of our logic engineering methodology. It is designed to add value to the other functions by adding reasoning, decision making, task scheduling, plan management and other capabilities to conventional patient data retrieval and analysis, medical knowledge and protocol libraries, intersystem and inter-agent messaging. We shall look at PROforma technology in more detail in section 6, but briefly its main components are as follows:  The PROforma formalism, an extended logic language oriented toward the specifications of decisions, plans and other medical tasks. It is a knowledge representation language in the artificial intelligence tradition, and a formal specification language as that term is used in software engineering.  Composer, a graphical design environment for developing and verifying PROforma specifications.  Performer, an encapsulated execution engine which can enact a PROforma specification and can be embedded in point of care or legacy applications. PROforma technology also provides a central software component for designing and managing clinical trials, which are being developed in a further EU funded project, MACRO10. The principle services that MACRO offers to researchers are a graphical study design tool which is used to create data collection schedules, and electronic case report forms incorporating data verification (figure 6). Each study is defined in an extended version of the PROforma11 language such that a definition can be automatically exported to centres collaborating in a trial via the World Wide Web. Each collaborator can access the study definition via an MS Windows client application, or via a standard WWW browser. Whatever software is used, the client can run the trial protocol as required, e.g. for displaying active forms for entering and validating data, or for Figure 6: A view of the study design component of the MACRO multicentre clinical trials software. A trial is modelled as a collection of information recording tasks carried out during patient visits (such as completing electronic forms to report patient data or adverse events during chemotherapy sessions in an evaluation of therapeutics in breast cancer). The 'visit by task' matrix 10 co-ordinated by the European Organisation for Research into the Treatment of Cancer, Brussels. The PROforma language is an interchange language for tasks only and is not a formalism for specifying visual layouts, such as forms or graphical user interfaces. 11 (shown here) and the data entry forms are created with "drag and drop" design tools. The PROMPT DSPE is used for task management and for data validation at run time. exporting valid data sets in encrypted form to the trial administration centre. At the time of writing, MACRO Trial Manager has been adopted for use in cancer therapy trials by the British Medical Research Council and the European Organisation for Research into the Treatment of Cancer, though the software can be used for managing controlled trials in any medical field. An interactive demonstration can be found at http://www.infermed.com/macro/demo. 5. Evaluation studies Empirical evaluation studies of a number of the applications described earlier have been carried out during a number of the projects described. In 1990, a special issue of Medical Informatics on "Validation and testing of medical decision aids” [31] published results from ten European projects, many of which are still of interest. O’Neil and Glowinski [32], for example, considered the special methodological difficulties of evaluating large applications, using the Oxford System of Medicine as an illustrative case. They considered central issues like medical performance (accuracy, sensitivity and specificity of a decision aid), and other important questions such as the coverage, robustness, flexibility and relevance of the advice offered by a system. Their paper also included an evaluation of the OSM decision procedure applied to the question of admitting a patient into a coronary care unit on the basis of his/her classification into particular diagnostic categories. Using ROC analysis, the performance of the OSM was compared on a number of dimensions with a standard Bayesian method. While noting that a single comparison is not sufficient to address all the relevant assessment criteria, they found that in terms of decision making, the performance of the two methods was indistinguishable. Many in the decision making community will find this result surprising, since the logical procedure essentially ignores the strength of individual pieces of evidence, while the Bayesian procedure uses all the available information. Nevertheless, similar results have now been reported in a variety of studies [33, 34, 35]. On the basis of considerable experience, we would make the following claim: in much practical medical decision making, the crucial thing to get right is the logical structuring of the problem. If it is wrong, no amount of precise calculation will produce a correct decision; if it is correct, in very many applications quite simple decision procedures will produce the same recommendation as those based on precise calculations. In rare situations where the procedures produce different results, the evidential basis for the different recommendations will be marginal, and it may be unwise to take important decisions without further corroboration. The performance of the CAPSULE prescribing system has also been quantitatively assessed. In a carefully designed crossover study [36], 42 general practitioners were presented with 36 simulated patient cases, constructed from actual consultations. Computer support in the control condition consisted simply of an alphabetic list of all available medications. Two further conditions were defined: in the first, the computer suggested a short list of drugs appropriate for the specific patient; in the second, the arguments for and against each of these alternatives were also made available. Computer support significantly improved the quality of prescribing in a number of respects. With decision support, decisions made by the doctors agreed with the recommendations of an expert panel 70% more often than without support. In the control condition. the doctors ignored a cheaper, equally effective drug in 50% of cases; this figure reduced to 35% when decision support was provided. In terms of acceptability, 88% of the GPs found the system easy to use, and 59% said they would be likely to use it in practice. A demonstration of CAPSULE can be found at http://www.infermed.com. A further evaluation of the use of a decision procedure based on logical argumentation has been carried out in decision support involving image interpretation. One application of the CADMIUM radiology workstation is in breast cancer screening. A protocol has been developed which sets out the routine tasks required, including patient registration, mammography, a review of mammograms and an assessment of whether there is any reason to suspect the presence of a malignancy. When CADMIUM gets to the point where this latter decision is to be made, the decision procedure automatically launches the appropriate image processing and interpretation programs (as described in [37]). The imaging functions segment out the breast, then search for abnormalities, notably calcifications (illustrated in figure 3). Calcifications can result from both malignant and benign conditions, but their density, form, clustering and other characteristics provide information which can be used to distinguish their type. Taylor et al [38] carried out a study of whether the decision support that CADMIUM provides can improve the classification of micro-calcifications by radiographers. Twelve radiographers with specialist training in mammography were presented with a number of standard reference images from patients with known diagnoses. In the decision support condition, CADMIUM carried out the image processing and provided the radiographers with a report on any calcifications present (in the form of arguments for and against various malignant or benign conditions) based on their features. Decision support produced a significant improvement in the radiographers’ decisions, by increasing hits and correct rejections and by reducing misses and false positives (table 1). TABLE 1: EFFECT OF DECISION SUPPORT ON RADIOGRAPHERS' CLASSIFICATIONS OF MAMMOGRAMS Without decision support With decision support True positives False positives True negatives False negatives 11 8 58 11 15 4 62 7 Taylor’s work demonstrates a principled way of integrating the diverse worlds of decision support and imaging by using logical argumentation to achieve two goals simultaneously: to interpret information extracted from the image, and to control the execution of the image processing software which carried out the extraction. The results suggest that this technique may also have practical diagnostic value. Finally, a pilot study of the impact of a disease management guideline is being carried out in a primary care setting within the PROMPT project12. This is a multi-task guideline for patients presenting with dyspepsia. The guideline is broadly similar in structure to those implemented in the Synergy system mentioned earlier. It provides the GP with prompts and reminders for collecting relevant information in carrying out a number of decisions, including whether to refer a patient for specialist investigation, whether to investigate with endoscopy, or whether to prescribe h2-antagonists, proton-pump inhibitors etc. Table 2 summarises the results obtained for 80 patients seen by GPs in the researchers’ own practice. All the doctors in this practice have computer systems on their desks, and they could access the support system by simply pressing a button on the computer screen. Patients presented in the usual (unpredictable) way and when the doctor realised that the patient was complaining of dyspeptic symptoms then s/he would either manage the patient according to routine practice, or by using the computerised guideline, according to a predetermined schedule. TABLE 2 EFFECTS OF DYSPEPSIA GUIDELINES ON GP DECISIONS Av. No. of consultations for this problem Av. Cost of medication Av. No. Blood tests Av. No. Endoscopy/XR Av. No. referrals Without Guideline (n=40) With guideline (n=40) 1.65 1.5 £26.6 £8.45 0.05 0.2 0.1 0.2 0.1 0.4 The results shown in table 2 suggest that decision support has a clear effect on practice by reducing numbers of consultations and medication costs, and increasing investigations and referrals. The participating doctors consider that the guideline is simple to use, and suggest that it could help improve quality of care by enabling:   12 Earlier pick up of cancers; A shorter period of maintenance medication with expensive proton pump inhibitors or H2 antagonists; The study is being carried out by Drs. Colin Lyons, Peter Wilson and colleagues at North End Medical Centre, London.   Earlier eradication and cure of proven ulcers; Better targeting for prescribing. These results demonstrate the versatility of the logic-oriented approach to the design of medical decision support systems, and suggest that they have a useful contribution to make in practical patient care. 6. Towards a sound and rigorous methodology for logic engineering As with any medical technology, the efficacy of individual applications cannot be considered in isolation; in our view we also have an obligation to show, as far as possible, that the general techniques are formally sound and that the technology is safe. This position is based on an analogy with engineering in other safety-critical areas such as aerospace engineering. Just as practical trials are important before introducing any new clinical technology, test flights are an essential part of aircraft engineering. However, the high safety demands on aircraft builders have led them to develop formal design theories to guide and verify the design and construction processes. The value of such theories is that they can help to predict the properties and failure modes of an aircraft design before it is built. This has major cost as well as safety benefits. The RED13 project explored the use of formal theories of reasoning and decision making, rigorous specification techniques, and systematic design and implementation of software in order to optimise soundness and safety. The definition of a formal framework for describing clinical decision making and process management was the main focus of the theoretical work in this project. It is summarised by the “domino model” shown in figure 7. Clinical goals Patient data Actions Possible solutions Decisions Care plans Figure 7: The “domino” model of clinical decision making and process management Each node of the domino represents information that is relevant to a particular clinical situation, such as facts about a patient’s history, decisions and other tasks in progress, actions which are 13 Rigorously Engineered Decisions. planned. Each arrow represents an inference procedure that uses information of the type shown at the arrow's tail in conjunction with information from a patient record and/or general medical knowledge base, in order to generate information of the type shown at its head. This model is an abstraction of the various decision types explored in the applications projects described earlier (in general practice, oncology, cardiology and radiology), and the task management functions required for enacting practice guidelines, therapy protocols and other clinical procedures (in oncology, radiology, clinical trials). The decision logic is subsumed by the four nodes to the left of the diagram, and the processes required to schedule and enact tasks are summarised by the four nodes to the right. The RED project formalised the inference processes of the domino model in terms of nonclassical logics, and demonstrated the consistency and completeness of the logics [19, 39]. To illustrate this work, we may consider a central concept in medical decision making, that of uncertainty management (e.g. uncertainty about the diagnosis or prognosis of a patient, the severity of a condition, which tests to do or medication to give). The intuitive method of reasoning under uncertainty used in all the applications outlined above emphasises analysis of the reasons for decisions, rather than the classical framework of abstracting uncertainty into a number. This idea can be given a clear interpretation in terms of a logic of argument (LA [40, 41]), a kind of labelled deduction system [42]. Formal argumentation can be used in combination with classical quantitative decision parameters (probabilities and utilities) or alone when quantitative information is not available [41, 43]. A second important result of the RED project was the development of a systematic methodology for developing applications based on the formal knowledge representation, decision making, and task-scheduling theory developed during the project and summarised in the domino model. This provides a strong set of constraints for checking consistency and completeness of an application knowledge base. The primary methodology incorporates a conventional software life-cycle model (figure 8) which is grounded in the conceptual framework summarised in the domino model. The PROforma toolset supports steps 2-5 of the development methodology shown in the figure. The toolset combines the logic-based representation developed in RED with an object-oriented representation of clinical tasks in order to achieve a balance between the clarity and precision of a formal specification language, and a more intuitive representation of clinical concepts and procedures. The domino model provides a good framework for defining the formal semantics of PROforma [39], but it is rather abstract. We have therefore developed a complementary view in which clinical procedures are conceived in a way that emphasises the clinical activities that the application is designed to support. These are captured naturally using the concepts of objectoriented programming. 1. Generalised design theory 2. Application task analysis 3. Detailed logic specification 4. Analysis & verification of PROforma specification 5. Execution testing Figure 8: The RED logic engineering development lifecycle In the object-oriented view of PROforma, an application such as a protocol or clinical guideline is modeled as a plan made up of one or more tasks. PROforma supports four basic classes of task. 1. A decision is a task that involves a choice of some kind, such as a choice of investigation, diagnosis or therapy. The PROforma specification of a decision task defines the decision options, relevant information and rules by which the pros and cons of different options can be derived. 2. An action is a clinical procedure that is to be enacted outside the computer system, such as the administration of an injection. 3. An enquiry is an action that yields information. The specification of an enquiry includes a description of the information required (e.g. the type, range and other properties of a clinical parameter) and a method for getting it (e.g. by creating a data entry form, by querying a local patient record or a remote laboratory database, or by processing an image). 4. A plan is a set of tasks, which are to be carried out to achieve a clinical (e.g. therapeutic) goal. Plans are composed of any number of PROforma tasks, including sub-plans, and are usually ordered to reflect logical, temporal or other constraints. Generic task Figure 9: The task ontology supported by the PROforma guideline specification language Tasks are defined with a class hierarchy or ontology, as shown in figure 9. They are modelled in an object-oriented style in the usual sense that: (a) Any specific clinical task is seen as an instance of some more general class of tasks. Each class is eventually a specialisation of a root task, (at the top of the figure). Any class can be further specialised into particular sub-types. For example, decisions can be specialised into diagnosis decisions and therapy decisions; plans may be specialised into research protocols and routine guidelines. (b) Every task inherits part of its specification from the classes above it in the hierarchy, expressed as a set of attributes. A task is distinguished from its parents by the possession of additional attributes, and distinguished from its siblings by different values for their common attributes. (c) When tasks are enacted, the communication of information between tasks is achieved by passing messages between encapsulated task objects, rather than by explicit procedure calls or other mechanisms that require access to the internal definition of a task. Detailed technical definitions of the PROforma task hierarchy are given in [19]. Figure 10: The PROforma Composer (composition tool). The centre panel is used to lay out the basic structure of an application using icons for the four classes of task. Each recursive task (a plan) adds another design area which can be accessed as required. Arrows are used to define logical and temporal relationships between tasks, and details of each task are provided by means of tools accessed via the task table on the right. A global overview of the whole task structure is shown via the tree on the left. Automatic verification of any task, or the whole application, can be invoked at any time. The inset panel (bottom left) shows the result of calling the task checker. This uses the strong task model which underpins PROforma to identify incomplete and inconsistent definitions in the task specification. The main concepts of the domino model are embodied in PROforma. The model can be used to specify the knowledge required in medical decision making and the management of clinical procedures, and is sufficiently well defined to permit a high degree of automatic analysis and verification of applications. Figure 10 shows the guideline Composer, which supports the representation of clinical procedures in terms of the four standard tasks. Here it is being used to integrate several tasks concerning the management of breast cancer (which were discussed earlier as separate applications). Further tools are provided to populate these outline task definitions with medical content, and verify the consistency and completeness of the definitions. The process of building an application consists of laying out a structure of this kind, and populating the details of each task using a standard set of templates for each task class. The system then checks the application definition, generating an error and/or warning report if necessary. When the application is complete, a comprehensive set of test functions is provided to permit the designer to step through the application to ensure that its behaviour is as intended. The PROforma toolset has been successfully used to implement a wide variety of applications in primary, secondary and tertiary care. It is available as a commercial product and for academic use from InferMed [44].14 7. Conclusions The emergence of knowledge based systems, and European strengths in formal software engineering and logic programming, created an opportunity to develop the new approach to clinical decision support and disease management described in this paper. There have been many promising results from the work, of which a mature example is the PROforma language and toolset. Like other researchers in medical informatics, we have found that there are many pressures to produce quick results. But in a safety-critical field like medicine research must be both broad and deep. The creation the PROforma technology was only possible because of longterm funding provided through national and European research programs. REFERENCES [1] M. Johnstone, K. B. Langton, B. Haynes, & A. Matheu, “Effects of computer-based clinical decision support systems on clinician performance and patient outcome. A critical appraisal of research”. Annals of Internal Medicine, vol. 120, pp. 135-142, 1994. [2] R.S. Evans, S.L. Pestonik, D.C. Classen, et al. “A computer-assisted management program for antibiotics and other anti-infective agents”. New England Journal of Medicine, vol. 338, no. 4, Jan 22nd, 1998. [3] G. Hripcsak, P. Ludemann, T.A. Pruor, O.B. Wigertz, P.B. Clayton, “Rationale for the Arden Syntax”. Computers in Biomedical Research, 27, 291-324, 1994. [4] http://dsg.harvard.edu/public/intermed/glif_overview.html : "Introduction to the InterMed Common Guideline Model and Guideline Interchange Format (GLIF)". [5] M.A. Musen, S.W.Tu, A.K. Das, Y. Shahar, “A component based architecture for automation of Protocol-Directed Therapy”. Proceedings of the 5th Conference on Artificial Intelligence in Europe, Springer, Berlin, 1995. [6] J. Fox, N. Johns, C.Lyons et al, “PROforma: a general technology for clinical decision support systems”. Computer Methods and Programs in Biomedicine vol. 54, pp. 59-67, 1998. [7] F.T. De Dombal, “Computer-assisted Diagnosis of Abdominal Pain”. Advances in Medical Computing, Proceeding of the 3rd International Symposium on Computers in Medicine, Blackburn, UK, J. Rose, & J.H. Mitchell, (eds), 1975. 14 InferMed is a joint commercial venture by the Imperial Cancer Research Fund and Integral Solutions Ltd. It has been established to supply commercial products based on the generic PROforma technology and specific applications, including CAPSULE and MACRO Trial Manager [8] I.N. Purves, PRODIGY: Implementing clinical guidance using computers, editorial, British Journal of General Practice, in press. [9] M. van Wijk, M. Mosseveldt, J. van der Lei, "The impact of guideline-based decision support on general practitioners' test ordering: a randomised control trial". To appear in: proc. of EPRIMP, the international working conference on Electronic Patient Records in Medical Practice (Rotterdam, 8-10 October 1998). [10] J. Spivey, "The Z Notation. A Reference Manual", Prentice Hall International, London, 1988. [11] C.B.,Jones, "Systematic Software Development Using VDM", Prentice Hall International, London 1986. [12] F. van Harmelen, J.R. Balden, “(ML)2: a formal language for KADS models of expertise”. Knowledge Acquisition (special issue: The KADS approach to knowledge engineering) vol. 4 no. 1, 1992. [13] F.M.T. Brazier, B. Dunin Keplicz, N. Jennings, J. Treur, “DESIRE: Modelling Multi-Agent Systems in a Compositional Formal Framework”. International Journal of Cooperative Information Systems, special Issue on Formal Methods in Cooperative, Information Systems: Multi-Agent Systems, (eds. M. Huhns and M. Singh), vol. 6, pp. 67-94, 1997. [14] F. Redmill, & T. Anderson, (eds.), Safety-critical Systems, Chapman & Hall, 1993. [15] J. Fox, “On the soundness and safety of expert systems”. Artificial Intelligence in Medicine, vol 5, pp.159-179, 1993. [16] D. Robertson, & P.J. Krause, (eds.), Knowledge Engineering Review, special issue on safetycritical systems, vol. 12, no.3, Sep. 1997. [17] P.Hammond, A.L Harris, S. K. Das, & J.Wyatt, “Safety and Decision Support in Oncology”. Methods of Information in Medicine, vol 33, pp. 371-381, Apr. 1994. [18] P. Krause, J.Fox, M. O’Neil, & A. Glowinski, “Can We Formally Specify a Medical Decision Support System?”, IEEE Expert, pp56-61, June 1993. [19] J. Fox and S. Das, “Artificial Intelligence in Hazardous Applications”, forthcoming, 1998. [20] J. Fox., and A. Rector, “Expert systems for primary medical care?” Automedica 4, pp. 123130, 1984. [21] J. Fox, A.Glowinski, C.Gordon, S. Hajnal, & M, O’Neil, “Logic Engineering for Knowledge Engineering: Design and implementation of the Oxford System of Medicine”. Artificial Intelligence in Medicine, vol. 2, no. 6, pp. 323-339, Dec 1990. [22] J. Fox, D.A. Clark, A.J. Glowinski, M.J. O'Neil, "Using predicate logic to integrate qualitative reasoning and classical decision theory. IEEE transactions on systems, man and cybernetics, vol 20, no. 2, pp347-357, 1990. [23] P. Barahona, "A causal-functional model for medical knowledge representation". In: ed. E. Keravnou, Deep models for medical knowledge engineering, Elsevier, 1992. [24] P. Lagouarde, R. Thomson, J.L. Renaud-Salis et al, “The PROMPT electronic health care record”. In: J.Brender, J.P. Christensen et al (eds) Medical Informatics Europe ’96, IOS Press, Amsterdam, pp. 1055-1059, 1996. [25] R. Thomson, "DILEMMA: Decision Support in Primary Care, Oncology and Shared Care". In M. F. Laires et al (Eds), Health in the New Communications Age, IOS Press, 1995. [26] P. Barahona, "A causal and temporal reasoning model and its use in drug therapy applications". Artificial Intelligence in Medicine, vol. 6, no. 1, pp. 1-27, 1994. [27] J. Huang, N.R. Jennings, & J. Fox,"An Agent-based Approach to Healthcare Management”. International Journal of Applied Artificial Intelligence, vol 9, no 4, 1995. [28] P.Taylor, “Computer aids for Decision Making in Diagnostic Radiology – a literature review”. The British Journal of Radiology, vol 68, pp. 945-957, Sep. 1995. [29] P. Taylor, J. Fox, A. Todd-Pokropek, “A model for integrating image processing into decision aids for diagnostic radiology”. Artificial Intelligence in Medicine, vol. 9, pp. 205-225, 1997. [30] T. Finin et al. “ Specification of the KQML Agent-Communication Language”. The DARPA Knowledge Sharing Initiative, 1993. [31] J. Wyatt, D. Spiegelhalter, editors, special issue of Medical Informatics on validation and testing of medical decision aids, vol. 15, no.3, Sep. 1990. [32] M. O'Neil, A. Glowinski, "Evaluating and validating very large knowledge-based systems". Medical Informatics, vol. 15, no.3, pp. 237-251, Sep. 1990. [33] J. Fox, D. Barber, KD. Bardhan, "Alternatives to Bayes ? A quantitative comparison with rule-based diagnostic inference". Methods of Information in Medicine, Vol. 19, No. 4 pp. 210215, Oct. 1980. [34] T. Chard, "Qualitative probability versus quantitative probability in clinical diagnosis: a study using a computer simulation". Medical Decision Making, vol .11, pp.38-41, 1991. [35] M. Pradhan, M. Henrion, G. Provan et al, "The sensitivity of belief networks to imprecise probabilities: an experimental investigation." Artificial Intelligence, vol. 85, pp.365-397, 1996. [36] R. Walton, C. Gierl, P. Yudkin et al., “Evaluation of computer support for prescribing (CAPSULE) using simulated cases”. British Medical Journal No. 7111, vol .317, 27 September 1997. [37] P. Taylor , J. Fox, A. Todd-Pokropek, "Evaluation of a Decision Aid for the Classification of Microcalcifictions. Proc. of the 4th International Workshop on Digital Mammography,,Elsevier Science, (in press), 1998. [38] P. Taylor, J. Fox, A. Todd-Pokropek, "The development and evaluation of CADMIUM: a prototype system to assist in the interpretation of mammograms", Medical Image Analysis (submitted). [39] S.K. Das, J. Fox, D. Elsdon, & P. Hammond, “A flexible architecture for autonomous agents”. Journal of Experimental and Theoretical Artificial Intelligence, vol. 9, pp. 407-440, 1997. [40] J. Fox, P. Krause, S. Ambler, “Arguments, Contradictions and Practical Reasoning”. In: B. Neumann (ed) ECAI ’92, Proc. of 10th European Conference on AI, pp. 623-627, 1992. [41] J. Fox, S. Parsons, "Arguing about beliefs and actions". In: eds. A. Hunter and S.Parsons, Applications of uncertainty formalisms, Springer-Verlag, 1998 (in press). [42] D. Gabbay, "Labelled deductive systems", Oxford University Press, Oxford, UK, 1996 [43] P. Krause., S. Ambler, M. Elvang-Goransson, & J. Fox, “A Logic of Argumentation for Reasoning under Uncertainty”. In: Computational Intelligence, vol 11, no.1, 113-131, 1995. [44] http://www.infermed.com.

Practice guidelines and clinical effectiveness: A unified methodology

Related documents

Products

Support

Practice guidelines and clinical effectiveness: A unified methodology

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib