April 15th, 2013 Ms. Amany AlKhayat TLC session: Corpus Linguistics (A Practical Session) Corpora tasks 1- Can you guess the most common words in English? Write only 3 of them. What's your evidence? Compare your answers with your partner. ______________________________________________________________________________ ______________________________________________________________________________ _______________________________________________________ Check your answers against this evidence from the Corpus of Contemporary American English (COCA) (This is a corpus of 450 m words http://www.englishclub.com/vocabulary/common-words-5000.htm Tip! Why Frequency is important? http://www.lextutor.ca/research/ Information! Tokens, Types Types are word-forms and tokens are occurrences of word-forms. So, for example, in the sentence 'The cat sat on the mat', there are two tokens of the type 'the' and one token each of the types 'cat', 'sat', 'on', and 'mat'. 2- Take a few minutes to think of which words collocate with these 3 verbs: take, break and catch. ___________________________________________________________________________ ___________________________________________________________________________ ___________________________________________________________________________ ___________________________________________________________________________ ___________________________________________________________________________ ___________________________________________________________________________ __________________________________________________________________________ 3- How do you deal with grammatical problems in class? (E.g., Punctuation, Collocations, colligation, pronouns, reference resolution, coherence markers…etc.) ______________________________________________________________________________ ______________________________________________________________________________ ______________________________________________________________________________ ______________________________________________________________________________ ______________________________________________________________________________ ______________________________________________________________________________ ______________________________________________________________________________ April 15th, 2013 Ms. Amany AlKhayat TLC session: Corpus Linguistics (A Practical Session) Corpora tasks 4- Now look at the word cloud below and try to decide which words collocate with take, catch and break. ______________________________________________________________________________ ______________________________________________________________________________ ______________________________________________________________________________ ______________________________________________________________________________ ______________________________________________________________________________ __________ It’s time for practicing corpora Now let's open http://corpus.byu.edu/coca Check your answers for questions 2 and 4 using COCA. April 15th, 2013 Ms. Amany AlKhayat TLC session: Corpus Linguistics (A Practical Session) Corpora tasks Grammar Intuition vs. Corpus data: Main website: http://www.lextutor.ca/ ****Grammar: http://www.lextutor.ca/corpus_grammar/ Quiz Builder http://www.lextutor.ca/concordancers/multi/ Just a description of corpora used in Lextutor: http://www.lextutor.ca/concordancers/corpus_descriptions.html Please feel free to search Lextutor or COCA for any words or phrases that you need information for.