Can AI & Law Contribute to Managing Electronically Stored Information in Discovery Proceedings? Some Points of Tangency Position Paper ICAIL 2007 Workshop on Supporting Search and Sensemaking for Electronically Stored Information in Discovery Proceedings (DESI) Kevin D. Ashley Professor of Law and Intelligent Systems Senior Scientist, Learning Research and Development Center University of Pittsburgh ICAIL 2007 DESI Workshop © Kevin D. Ashley. 2007 1 Contributions from AI & Law or AI research with legal domain implications 1. Textual Case-Based Reasoning (TCBR) methods to analyze electronic corporate textual documents. 2. Natural Language Processing (NLP) techniques to recognize subjectivity and sentiment in text. 3. Techniques for visualizing evidential reasoning. 4. Incorporating AI & Law ontologies as a basic framework. ICAIL 2007 DESI Workshop © Kevin D. Ashley. 2007 2 TCBR methods to analyze electronic corporate textual documents • Extract info from textual narratives w/ underlying structure in terms of knowledge-related tasks (See Mustafaraj, et al. 2006) • E.g., machine maintenance diagnostic reports: – Observing, Explaining, and Acting. – Inspector typically observes a measurement which may be abnormal, explains it in terms of the problems it indicates, and recommends an action. • Hidden Markov Models (HMMs) capture ordering info. – Lee and Barzilay's (2004) Probabilistic Content Models: HMMs – States correspond to characteristic domain info – State transitions correspond to information-presentation orderings. • Assumes large corpus of comparable documents. • Apply in product liability litigation involving defective machinery. ICAIL 2007 DESI Workshop © Kevin D. Ashley. 2007 3 NLP techniques to recognize subjectivity and sentiment in text • Develop conceptual representation of private states and attitudes (Wilson, et al., 2006). • Human annotators code expressions of attitudes and private states, their sources and intensities in training set. • Machine learning programs learn to recognize sentiment and argumentative attitudes in new documents from the corpus. • Assumes corpuses contain the same types of documents, for instance, news reports. • Use to identify memos in securities or fraud litigation with subjective opinions or emotionally-charged recommendations. – Filtering them and target for further human analysis. ICAIL 2007 DESI Workshop © Kevin D. Ashley. 2007 4 Techniques for visualizing evidential reasoning • Conference showcased systems for visualizing legal implications of evidentiary documents in litigation. “Graphic and Visual Representations of Evidence in Legal Settings,” (2007) (http://tillers.net/home.html) – SEAS (Lowrance, 2007) supports template-based structured argumentation in a collaborative software tool – Structured arguments based on hierarchy of questions used to assess situation. – Model competing claims/defenses in negligence claim. • In large-scale litigation, attorneys employ on-line tools like CaseMap to organize/integrate physical/electronic documents. • Natural to link SEAS and CaseMap to help litigators collaboratively construct their legal arguments and integrate the documents. – Structured arguments as goal of electronic discovery management tools. – See (Ashley, 2007) for suggested SEAS adaptations to litigation. ICAIL 2007 DESI Workshop © Kevin D. Ashley. 2007 5 SEAS depiction of claim & affirmative defenses from legal rules to evidence From Lowrance, J. (2007) “Graphical manipulation of evidence in structured arguments” ICAIL 2007 DESI Workshop © Kevin D. Ashley. 2007 6 Incorporating AI & Law ontologies as a basic framework • Important “lesson learned” in AI & Law • Represent legal concepts in ontological frameworks and domain ontologies: – – – – – – – helps acquire, evaluate, share, and re-use knowledge bases overcomes brittleness of legal expert systems makes explicit assumptions about legal knowledge mediates storage of legal rules manages distinctions among concept types coordinates physical and legal institutional descriptions of events generates natural language explanations. • Detailed legal ontologies: – basis for indexing schemes to make sense of electronic documents discovered. (See, e.g., Breuker, et al. 2002) – Seeds for fleshing out ontologies automatically from electronic documents in corpus. ICAIL 2007 DESI Workshop © Kevin D. Ashley. 2007 7 How can AI help? How can anything help? • Better in-depth, public analysis of successes and failures in TREC results. • Focus on creating the mapping from the legal claim’s elements to the specific ways they might be instantiated in the documents of this dispute. – Examine how attorneys teach associates and paralegals what to look for in making that mapping. – Role of common sense violations of expectations given the job roles of the participants. – Working backward from the client’s injury to the possible causes and the documents they would likely generate. • Educating lawyers about how the document retrieval systems work, what they can/cannot do. – They need to understand system’s analysis so they understand limitations of results and can play to system’s strengths. ICAIL 2007 DESI Workshop © Kevin D. Ashley. 2007 8 How can AI help? How can anything help? (2) • Visualizations of information – Social networking of communications – Organizational structure of party (formal and informal), including command and control points. – Chronologies of events – Lawyer’s visualizations of their developing arguments • Good facts and bad facts. • Integrating computerized document management tools for organizing chronologies (e.g., Casemap) with argument visualizations. – Flag linkages among documents based on the information above and different assumptions about the importance of different events, times, and participants. ICAIL 2007 DESI Workshop © Kevin D. Ashley. 2007 9 How can AI help? How can anything help? (3) • Information extraction from text – – – – Automatically identifying entities and their relations Learning patterns in corpus and use to identify documents Flagging emotional language, polarity, and intensity But can machine learning be done from heterogeneous document corpuses? • Ontologies – How does one learn an ontology of applied terms in a legal domain case? How can it be done automatically? – Ontologies of different kinds: a-kind-of, a-part-of, means-same-as – Functional ontologies (e.g., in terrorism domain, frames for identifying money raising activities.) – This is an important part of sense-making and brings us back to the mappings from legal claim elements to the kinds of documents that may instantiate them. ICAIL 2007 DESI Workshop © Kevin D. Ashley. 2007 10 How can AI help? How can anything help? • Thanks to: – – – – – – – – – Raju, Sreenu Panegeotis, O. Palau, Raquel Mochales Moens, Marie-Francine Lee, Lawrence Bauer, Robert Austin-Hollands, Chris Attlee, Simon Ashley, Kevin ICAIL 2007 DESI Workshop © Kevin D. Ashley. 2007 11