Syed Farrukh Mehdi Reza Fathzadeh S. M. Faisal Abbas (Presenter) {fmehdi,reza,fabbas}@cs.dal.ca 1 Ontology ◦ Machine readable information Text ◦ Human readable information, most of the current information is text. Ontology Learning ◦ (Semi) automatic extraction of relevant concept and relations Medical Domain 2 Syntax based concept learning augmented with domain specific subject corpora Syntax Based Extraction Domain Specific Knowledge base 3 Medical Domain Terminology ◦ OpenGalen project GALEN Terminology Server For Other domains, domain specific terminology corpus should be used. 4 Paul Buitelaar 5 Parsing ◦ Linguistic Method Using Production Rules specified by linguists ◦ Statistical Method Using statistical models derived from written text. We used Stanford NLP Parser which is a statistical parser Dependency Trees instead of Parse Trees 6 Domain Specific Terminology Corpus Language corpus for general concepts ◦ GRAIL Terminology Server for Medical Domain ◦ WordNet for English Language 7 Intension ◦ Formal and information definition of terms Extension ◦ Deriving concepts Linguistic Realization ◦ Concept coverage 8 Terminal Concept ◦ Nouns, Noun Phrases Compound Concepts ◦ Defined Rules 9 Concepts are related Defined Rules 10 IN subordinating conjunction (FUNC_WORD) or preposition (PREP) ◦ “of” Candidate for Taxonomy 11 CC coordinating conjunction ◦ “and”, “or” etc ◦ Compound concepts, broken into terminal concepts 12 RB adverb and adverbial phrase DT determiner/demonstrative pronoun Ignored in our work so far 13 Verb is used as a relation between subject and object 14 JJ adjective NN common noun 15 Recursive, until dependency tree is exhausted Create compound concepts and relate them with the rule and then apply the rules on the sub phrases 16 Framework Institution Reference ASIUM INRIA, Jouy--‐en--‐Josas Faure and Nedellec 1999 TextToOnto AIFB, University of Karlsruhe Madche and Volz 2001 HASTI Amir Kabir University, Teheran Shamsfard,Barforoush2004 OntoLT DFKI, Saarbrucken Buitelaar et al. 2004 DOODLE Shizuoka University Morita et al.2004 Text2Onto AIFB, University of Karlsruhe Cimiano and Volker 2005 OntoLearn University of Rome Velardi et al. 2005 OLE Brno University of Technology Novacek and Smrz 2005 OntoGen Institute Jozef Stefan, Ljubljana Fortuna et al., 2007 GALeOn Technical University of Madrid Manzano-Macho et al. 2008 DINO DERI, Galway Novacek et al.2008 OntoLancs Lancester University Gacitua et al. 2008 RELExO AIFB, University of Karlsruhe Volker and Rudolph 2008 OntoComp University of Dresden Sertkaya 2008 17 [Buitelaar05] Paul Buitelaar, etal. Ontology Learning from Text, October 3 rd , 2005 [Kim09] Jin-Dong Kim et al., Overview of BioNLP’09 Shared Task On Event Extraction [Stuck] Semantic Technologies, Ontology Learning, Prof. Dr. Heiner Stuckenschmidt, Dr. Johanna Völker [Biemann] Chris Biemann: Ontology Learning from Text: A Survey of Methods [StanParser] http://nlp.stanford.edu/software/lexparser.shtml [WordNet] http://wordnet.princeton.edu/ [OpenGALEN] http://www.opengalen.org/ 18 Please provide us Comments and Directions 19