St. Petersburg State University Faculty of Philology and Arts International Conference «CORPUS LINGUISTICS – 2008» October 6–10, 2008, St. Petersburg CONFERENCE PROGRAM October 6, Monday 9-30…14-00 – Registration of participants 14-00…14-15 – OPENING CEREMONY Greetings from Sergey I. Bogdanov, the Dean of the Faculty of Philology and Arts Opening Speech by Alexander S. Gerd, the Head of the Organizing and Program Committees 14-15…15-45 – PLENARY MEETING Savchuk S. (Moscow) The Corpus of the Early XXth Century Russian Texts: from Experience in Creating a Corpus Boguslavsky I., Iomdin L., Valeev D., Sizov V. (Moscow) ETAP Parser and its Evaluation Using a Deeply Annotated Corpus of Russian Texts Paducheva E. (Moscow) Direct and Indirect Diathesis of a Mental Verb: Corpus-Based Approach 15-45…16-00 – Coffee-break 16-00…17-30 – PLENARY MEETING Gindin S., Ivanova E., Krasnikova A. (Moscow) Integral Systems of Philological Maintenance: Goals, Principal Components, Constructive Problems and Cultural Value Rychkova L. (Belarus, Grodno) Sociocultural and Ecolinguistic Aspects of Corpora Blanco X. (Spain, Barcelona) Using NooJ for Multipurpose Analysis of Romance Languages Corpora October 7, Tuesday 10-00…11-15 – PARALLEL CORPORA Andreeva Е. (StP) «Only» and «Alone» – a Corpus-Based Sense Differentiation Bobkova T., Gridneva L., Lebedev K., Perebeynoss V. (Ukraine, Kiev) Principles of Part of Speech Tagging in the English-Ukraine Parallel Corpus of NATO Documents Krizhanovsky A. (StP) Experiments on Corpus Index Generated from Wikipedia 11-10…11-30 – Coffee-break 11-30…12-45 – SEMANTICS IN CORPORA Kustova G., Toldova S. (Moscow) The Russian National Corpus: Semantic Filters for the Verb Sense Disambiguation Lapshinova-Koltunski E. (Stuttgart, Germany) Data Extraction from Corpora as a Tool to Classify Predicates by their Subcategorization Properties Kustova G. (Moscow) The Computer Dictionary of Combinability of Degree Words Based on the Russian National Corpus 12-45…13-00 – Coffee-break 13-00…14-15 – SYNTAX AND MORPHOLOGY IN CORPORA Boguslavsky I., Iomdin L., Mityushin L., Sizov V. (Moscow) Length of Dependency Links in the Annotated Corpus of Russian Petkevič V. (Czech Republic, Prague) Agreement in Czech as Manifested in the Czech National Corpus Šimkova M. (Slovak Republic, Bratislava) Part of Speech Tagging in the Slovak National Corpus and its Application in Explanatory Dictionary Creation 14-15…15-00 – Lunch-break 15-00…16-15 – CORPORA IN THE STUDY OF SPEECH Kuehnast M. (Germany, Berlin) Acquisition of Null Anaphora in Bulgarian – a Coprus-Based Study Saveljeva-Trofimova O., Kibrik A. (Moscow) Postpositive Subclausal Units in Spoken Language (Based on the «Dream Stories» Corpora) Korotaev N., Kibrik A. (Moscow) «Sentence» in Oral Narrative: A Corpus-Based Study 16-15…16-30 – Coffee-break 16-30…18-10 – SPECIAL ASPECTS OF CORPORA DEVELOPMENT AND APPLICATION Belyaeva L. (StP) Parallel Corpus as a Database for Applied Lexicography Gerd A., Azarova I., Dmitriev A., Nikolaev I., Sokolova A. (StP) Toponymic and Cartographic Data in the Project «Complex Toponymic Mapping of Ingermanland Historic and Cultural Zone (South-West of Leningrad Oblast)» Nikolaev I. (StP) Ingermanland Folk Songs Corpora as Data Source for Izhorian Language Grammatical Description Badmayeva L., Badagarov Zh., Tsydypov B. (Ulan-Ude) General Problems of the Buryat Corpus Organization October 8, Wednesday 9-30…11-10 – SPECIAL ASPECTS OF CORPORA DEVELOPMENT AND APPLICATION Gerd A. (StP) The Russian National Corpus – Card Index – Academy Dictionary Rogov A., Gurin G., Kotov A. (Petrozavodsk) Some Peculiarities of the Grammatically Annotated Corpus of the Late XIX century Russian Journalistic Texts Kamshilova O., Kapotova N., Rasumova V. (StP) Sentence Length and Structure Analysis in a Special Corpus Rodionova E., Khozyainov S., Mitrofanova O. (StP) Text Corpora in Attribution of Literary Works 11-10…11-30 – Coffee-break 11-30…12-45 – SEMANTICS IN CORPORA Azarova I., Bichineva S., Vakhitova D. (StP) Automatic WSD for Frequent Nouns (in terms of RussNet Structural Units) Mitrofanova O., Lashevskaya O., Panicheva P. (StP, Moscow) Experiments on Statistical WSD for Russian Nouns in a Corpus Lezin G. (StP) The Ontological Representation of the Meaning of the Sentence in RDF/OWL 12-45…13-00 – Coffee-break 13-00…14-15 – SYNTAX AND MORPHOLOGY IN CORPORA Iagounova E. (StP) Analyzing Context Predictability of Textual Units by Means of Corpora Grishina E., Savchuk S. (Moscow) The Russian National Corpus as a Tool for Studying Grammar Variation Kopotev M. (Finland, Helsinki) Creating a Frequency Grammar of Russian 14-15…15-00 – Lunch-break 15-00…16-15 – MACHINE TRANSLATION AND PARALLEL CORPORA Garabík R. (Slovak Republic, Bratislava) A Simple Russian-Slovak Machine Translation System Vavřín M., Rosen A. (Czech Republic, Prague) INTERCORP: a Multilingual Parallel Corpus Klyueva N., Bojar O. (Czech Republic, Prague) UMC 0.1: Czech-Russian-English Multilingual Corpus 16-15…16-30 – Coffee-break 16-30…18-10 – CORPORA IN THE STUDY OF SPEECH Waclawičová M., Křen M. (Czech Republic, Prague) ORAL2008: New Balanced Corpus of Spoken Czech Shestopalova V., Petrova T., Bolgov M. (Vladivostok) The Regional Variety of Spoken Russian as a Subject for Corpus Linguistics Vydrin V. (StP) Towards an Electronic Corpus of Bamana: Tonal Marks Smirnov A., Smirnova O. (Voronezh) Foreign Language Phonotactics Software October 9, Thursday 9-30…11-10 – DIACHRONIC CORPORA Azarova I., Alexeeva E., Alexeev V. (StP) Automatic Annotation of the Hagiographic Corpus SCAT Ufland E., Alexeeva E. (StP) Reducing Spelling Variation in Service Components of the Hagiographic Corpus SCAT Rogozina E. (StP) XML-encoding of the Hagiographic Text Content Structure for SCAT Corpus Nekipelova I. (Izhevsk) The Analysis of Types of Lexical Meanings as a Step in Reconstruction of Lexical Semantics of the Old Russian Language (based on Slavonic Triodion) 11-10…11-30 – Coffee-break 11-30…12-45 – SYNTAX AND MORPHOLOGY IN CORPORA Grebenkov A. (StP) Internet Resource www.wordform.ru as a Tool of Morphological Tagging of Russian Texts Boyarsky K., Kanevsky E. (StP) Software Development for Semi-Automatic Morphological Tagging Bobichev V. (Moldova, Chisinau) Automatic Morphological Disambiguation for Corpus Annotation 12-45…14-00 – Lunch-break 14-00…14-50 – SPECIAL ASPECTS OF CORPORA DEVELOPMENT AND APPLICATION Bugakov O. (Ukraine, Kiev) Using UNLC in Linguistic Research Sidorova E. (Novosibirsk) An Approach to Construction of Subject Domain Dictionaries via Corpora 14-50…15-00 – Coffee-break 15-00…15-50 – SPECIAL ASPECTS OF CORPORA DEVELOPMENT AND APPLICATION Konstantinova N. (StP) On the Choice of Means for Ontology Representation (A Case Study of «Financial Management» Domain) Vinogradova N., Mitrofanova O. (StP) Formal Ontology as a Tool for Data Systematization in the Corpus of Russian Texts on Corpus Linguistics 15-50…16-00 – Coffee-break 16-00 – CLOSING CEREMONY Closing Speech by Alexander S. Gerd, the Head of the Organizing and Program Committees ROUND TABLE 17-00 – Buffet TIME LIMIT: plenary talks – 30 minutes (with discussion), workshop talks – 25 minutes (with discussion). A computer and a projector will be at the disposal of participants. CONFERENCE LOCATION: October 6 – Faculty of Philology and Arts, Universitetskaya embankment, 11; registration – room 193, plenary meeting – Cinema Hall. October 7, 8, 9 – Institute of Linguistic Studies, Tuchkov pereulok, 9; workshops – Conference Hall.