LGCS 121: Psycholinguistics Lexical decision programming To get a sense for how to design and program an experiment, we’ll be programming a pilot lexical decision experiment. The lexical decision task is a task in which a subject must decide whether a string of letters is a word of English (e.g., horse) or not (e.g., troim) as quickly as possible. There are many variations, e.g., semantic categorization, but the basic task is pretty much the same in most cases. You’ll want to decide on an experimental hypothesis and a suitable manipulation – the crucial difference in materials that you want to test. Since we can’t test every English word on a subject, we’ll need to sample words while controlling for possible confounds between the groups. Sampling just means that we’re picking out a manageable, representative portion of a population to test. A confound is a difference in experimental items (or procedure) that could possibly explain the results in different way than the experimental hypothesis. In other words, having confounds prevents you from concluding that the results confirm or disconfirm your experimental hypothesis, since there are other possible explanations for the result of interest. Suppose you want to test the effect of frequency on lexical decision times and accuracy – subjects are known to respond more quickly and accurately to more frequent words. Your manipulation is frequency and so you want to make sure that there is a difference in frequency between your items. For concreteness, call the list of low frequency words the Low condition and the list of high frequency words the High condition. To get a measure of frequency, you could go to the English Lexicon Project, which has many lexical characteristics for words. Use the HAL or SUBTLEX frequency measure rather than the outdated Kucera-Francis (KF) measure. We’ll discuss how to use ELP more in class. http://elexicon.wustl.edu/ One easy confound to control for is word length. You can limit yourself to words of only one length, or have even variation across Low and High. To determine whether the Low and High conditions differ, you enter the values into an Excel sheet and use the TTEST() function to compare values. For each manipulation (experimental variable) and control you consider, you should collect the mean for the each condition, the standard deviation (the amount of variation in your items), and run a t-test, which is a simple test comparing the means of two lists to estimate whether they are likely to be drawn from the same population (in which case the means do not differ significantly) or not (in which case they do differ significantly). For example, suppose I have 10 items in each of the Low and High conditions in Excel. I’ve created some numbers here on the next page. (See Excel sheet for formulae.) Item Low 1 2 3 4 5 6 7 8 9 10 High 12 5 100 234 98 11 3 54 44 87 982 4300 436 7784 209 3212 555 332 1200 6500 You can take the mean by using the AVERAGE() function and giving the array of values as its argument. Low High = AVERAGE(B2:B11) = 64.80 = AVERAGE(C2:C11) = 2551 The standard deviation can be calculated in much of the same way: Low High = STDEV(B2:B11) = 70.78 = STDEV (C2:C11) = 2781.46 To test whether the two groups are significantly different in terms of frequency, you can use a t-test which takes 4 arguments: = TTEST(array1, array2, Tails, Type) In the example above, array1 is simply the array corresponding to the Low items, B2:B11, and array2 corresponds to the High items C2:C11. The value for Tails depends on what you’re testing, but we can just use a two-tailed test in all cases here. So set Tails to 2. Finally, we can set the Type to 3 (for two-sampled test, without assuming equal variance). Note that if you want to control for confounds and are looking for no difference between samples, use Type = 2, since you’ll be assuming that the samples have the same variance (that is, are distributed in roughly the same shape). = TTEST(B2:B11, C2:C11,2,3) which gives us a value p < 0.05, indicating that the difference of means between Low and High is likely to have been sampled from different populations. That is, the mean differences are genuine. You can read more about the t-test here: http://office.microsoft.com/en-us/excel-help/ttest-HP005209325.aspx http://en.wikipedia.org/wiki/Student's_t-test To report this difference, you could write something like: In a paired t-test, items from the Low condition (M = 64.80, SD = 70.78) were significantly less frequent than items from the High condition (M = 2551, SD = 2781.46), p < 0.05. In addition, your experiment should have enough fillers to mask the manipulation from the subject. Ideally, you’ll have an even number of Yes and No responses in the experiment, so that subjects don’t develop a strategy (for example, a bias towards Yes responses if the majority of items are genuine words). So, if you’re showing your subjects 20 words each, then you should have 20 fillers, for 40 items total. You’ll want to think carefully about how to describe the experiment to your subjects. It’s crucial that they understand the task well enough before starting the experiment. I recommend programming in written in instructions and a practice session for them to complete before starting the task. You may wish to give the instructions to a friend to spot any ambiguities or misleading language. Write up, due October 11. You should write up a very short report (3-4 pages) detailing what you’ve done. The report should have the following sections. Please use the section numbering indicated here. 1. Introduction Lay out the background here along with any assumptions you’re making. Cite any relevant work and clearly state your hypothesis and the prediction you are testing. 2. Experiment 2.1 Materials and Method Provide examples of the items you used, and the number of items presented, along with fillers. Give means, standard deviations, and t-test probabilities for relevant lexical characteristics. Describe your method. 2.2 Participants How many people did you test? How did you recruit and compensate them? Did you exclude any one? If so, why? 2.3 Results Briefly describe your results in terms of means, standard deviations, and ttest probabilities for reaction times (time it took to make the lexical decision). Report the percent accuracy for the entire experiment and by condition. Provide a graph to illustrate the results. 3. General discussion What can we conclude from your experiment? What open questions are there? 4. References Complete citations in APA format if possible: http://myrin.ursinus.edu/help/resrch_guides/cit_style_apa.htm The writing style should be simple, impersonal, and direct. In addition, you’ll need to provide your program, the items, any data you collected, and the Excel sheet you used to calculate the means, standard deviations, and t-tests.