Practical Natural Language Processing Harvesting Low Hanging Fruit Jeffrey P Ferraro, PhD (Jeffrey.Ferraro@imail.org) Scott L DuVall, PhD (Scott.DuVall@hsc.utah.edu) Outline • • • • • • • • • Application of NLP - involves interpretation of radiographs. Effects on NLP with and without radiology interpretations. Some issues around obtaining good NLP results. Difficult interpretations due to ambiguity in radiographs. What’s been easy and what’s hard. Alternatives tried to NLP that have failed. Describe some different approaches to NLP. Describe the mechanics of the method used. Working Example Early Detection Community Acquired Pneumonia (CAP) Significance • CAP along with influenza 8th leading cause of death in the United States. • ~6 million cases annually / about 500,000 – 1.1 million hospitalizations annually. Challenges • Diagnostic error rate for Pneumonia: 10% - 25% • High variability in hospitalization decisions among clinicians (38% - 79% of hospitalizations could not be explained by illness severity. • 10.7% treated as outpatients secondarily admitted within 7 day. • Early Detection – compliance w/Joint Commission (JCAHO) quality accreditation Benefits • Reduction in diagnostic errors. • Reduction in unnecessary hospital admissions (20 times more costly). • Rapid diagnosis and severity assessment for proper care & treatment. CAP Real-time Screening and eProtocol CAP eProtocol Vitals Labs Predictive Diagnostic Screening Likelihood of Pneumonia +/- Physical Exam Radiological Reports 1) Pleural Effusion IP vs. OP Treatment 2) Multilobe Infiltrates Severe CAP Criteria IP – ICU Tretment 3) Cavitary Disease MRSA Risk Factor Treatment Protocol Real-time Predictive Diagnostic Screening Bayesian Network NLP Interpretation Effect Average Merit 6951.986 +-26.887 1618.487 +-22.655 1120.249 +-26.959 1078.845 +-11.476 662.104 +-31.555 496.629 +-17.675 489.792 +-15.667 422.224 +-14.469 224.394 +- 9.062 203.584 +- 9.537 AUC with/NLP: 0.92 AUC without/NLP: 0.65 Average Rank 1 +- 0 2 +- 0 3.1 +- 0.3 3.9 +- 0.3 5 +- 0 6.5 +- 0.5 6.5 +- 0.5 8 +- 0 9.1 +- 0.3 10.3 +- 0.46 Attribute NLP Finding Temperature Heart Rate Chief Complaint Age SPO2 Respiratory Rate WBC Systolic BP Mean BP Obtaining Good NLP Results Conclusion • Radiology interpretations are necessary for good diagnostic prediction. Challenges • Reduction in ambiguous language results in better predictive capabilities • Standardization of Terminology Radiologists Response • Need complete and accurate clinical contexts. • Accurate protocol (film) selection NLP Challenges - Ambiguous Language Possible Pneumonia Clinical History: Cough and dyspnea. Study: PA and lateral chest on … Findings: No comparison studies available. Heart is normal in size. Aorta is moderately tortuous. Lung volumes are normal. Minimal airspace disease is identified within the region of medial right middle lobe, seen on frontal and lateral views. This opacity is consistent with focal atelectasis or possibly inflammatory process. No significant edema. No pleural effusion or pneumothorax. Impression: 1. Mild airspace disease in right middle lobe, as above. NLP Challenges - Ambiguous Language Positive Pneumonia PA and lateral chest radiograph Comparison: None Indication: Fever, cough Findings: The lungs are symmetrically inflated there is obscuration of the cardiac apex on the frontal projection with increased attenuation on the lateral view suggesting subsegmental lingular airspace disease. The diaphragm appears well visualized. Mild calcification noted at the level of the aortic arch. Mild ectasia of the descending thoracic aorta. Mild cardiomegaly. Osseous structures and soft tissues are unremarkable. Impression: 1. Subsegmental airspace disease of the lingula. Recommend a followup erect PA and lateral chest radiograph. 2. Mild cardiomegaly. 3. Mild atherosclerotic disease and ectasia of the thoracic aorta. NLP – What’s Easy and What’s Hard Clinical Findings using NLP • +/- Pneumonia – Fairly Good (ambiguity / standard terminology) Sen: 0.95, Spec: 0.81, PPV: 0.93, Acc: 0.91 • +/- Pleural Effusions – Good (succinct language) Sen: 0.93, Spec: 0.97, PPV: 0.84, Acc: 0.96 • +/- Cavitary Disease – Good (succinct language) Sen: 0.88, Spec: 1.0, PPV: 1.0, Acc: 1.0 • Single lobe or Multi-lobe Infiltrates – Poor (must be inferred by locations: RLL, RML, RUL, LLL, LUL, Right Lung, Left Lung) Error Propagation Problem Single Lobe: Sen: 0.65, Spec: 0.78, PPV: 0.82, Acc: 0.70 Multi-lobe: Sen: 0.78, Spec: 0.65, PPV: 0.58, Acc: 0.70 NLP Alternatives that Failed Templating <report body> .... Pneumonia Quality Assurance ----------------------------------------Parenchymal opacity c/w pneumonia in the appropriate clinical setting (yes | indeterminate): or No parenchymal opacity to suggest pneumonia. Multilobar or bilateral involvement (yes|no): Cavitation (yes|no): Pleural fluid (yes|no): No Compliance w/Templating • Complex cases • Proper Protocol (Film) & Clinical Context • Workflow / Productivity Impact (Template Selection) Approaches to NLP Direct Machine Learning (ML) Classification • Course Grained Approach – bag of words, sentences, n-grams, chunking phrases • Supervised Learning Methods (Need to know Truth) - Support Vector Machines - Gaussian Mixture Models - Bayesian Networks - K-Nearest Neighbor - Decision Trees - Random Forests Information Extraction & Rule based • Fine Grained Approach – extract clinical concept categories (e.g., appliances, state change, locations, clinical findings) • IE: Pattern Matching, NER, statistical inference models) • Classification (Inference Rules) Information Extraction & Machine Learning • Fine Grained Approach – extract clinical concept categories (e.g., appliances, state change, locations, clinical findings) • IE: Pattern Matching, NER, statistical inference models) • Classification (ML: Supervised Learning Methods) Course Grained Method + / - Pneumonia • Machine Learning Approach Random Forest (ensemble of decision trees) • Decision Trees from various perspectives (randomly constrain available features) • Majority Vote Rules Required Artifacts - Segmenter - decompose document to sentence - Sentence Level Annotation (positive evidence, negative evidence, no information gain) - Statistical Methods (10-fold cross validation, Bootstrapping) - Training and test data sets Course Grained Method -’ve +’ve ~IG The upper lobes are well aerated and normal . 0 The right lung appears clear . 0 The remainder of the lungs is clear . 0 The right lung and apical portion of the left upper lobe remains clear . 0 Left lung is clear . 0 No other areas of air space opacity are seen to suggest other regions of pneumonia . 0 There is no segmental or lobar consolidation . 0 There is an area of parenchymal lung opacity present in the left lower lobe and pneumonia is questioned . Left lower lobe pneumonia . 1 However , there is an area of patchy parenchymal opacity present posteriorly in the right lower lobe . 1 The appearance of this is suggestive of pneumonia . 1 There is confluent opacification of most of the posterobasal segment of the left lower lobe . 1 Left lower lobe consolidation , consistent with pneumonia . 1 There is complete opacification of the superior segment of the right lower lobe , with air bronchograms . Frontal view gives this consolidated segment a round appearance . 1 Segmental consolidation , consistent with pneumonia . 1 There is confluent opacity in the left base which blurs the diaphragm . 1 Left basilar subsegmental consolidation , consistent with pneumonia . 1 Spondylitic change is noted of the thoracic spine . 2 On the lateral projection this appears to be in the lingula and the lower lobe . 2 Heart size and vascular pattern are within normal limits . 2 Cardiac size is normal . 2 No extraventilatory air is seen . 2 Underlying bilateral severe bullous emphysema is also noted , with marked hyperinflation in the upper lobes . Central pulmonary arterial hypertension is noted . 2 Expansion is within normal limits . 2 Osseous mineralization is diffusely diminished . 2 Thoracic kyphosis is accentuated and there is severe collapse of several mid thoracic vertebrae . 2 1 1 2 Course Grained Method Feature Set (X) upper right lingula size lateral lobe pneumonia air seen 1 0 0 1 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 1 0 0 1 0 0 1 0 0 0 0 0 0 0 1 0 0 0 Truth (Y) 1 1 0 0 2 2 1 0 0 0 0 0 0 0 1 0 0 0 Course Grained Method Example Questions ?